[tesseract-ocr] Re: jTessBoxEditor - Tesseract box editor & trainer

2024-12-09 Thread محمود محمد
I want you to guide me on how to deal with Tesseract jTessBoxEditor to create a training model on 10 images in Arabic and run the model Hello Tesseract with Mahmoud Abdel Aleem I saw your contributions in GitHub about Tesseract and I benefited from you well Thank you for your useful contrib

[tesseract-ocr] Re: jTessBoxEditor - Tesseract box editor & trainer

2024-02-19 Thread Shravani Adivarekar
Can you please guide me on how to use it and create box files also on the installation...I am new to OCR and need to develop a Handwritten text recognition for Devanagari language. On Thursday, September 26, 2013 at 7:32:13 AM UTC+5:30 Quan Nguyen wrote: > jTessBoxEditor is a Java box editor f

[tesseract-ocr] Re: jTessBoxEditor - Tesseract box editor & trainer

2018-03-22 Thread Khan
How to train Chinese characters using JTessBoxEditor because it can support only few languages -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+

[tesseract-ocr] Re: jTessBoxEditor - Tesseract box editor & trainer

2016-09-13 Thread Quan Nguyen
jTessBoxEditor 1.7 Release: - Update Tesseract training executable 3.05dev (2016-08-31) - Generated images are now compressed to reduce file sizes - Additional parameters for text2image command - Use BreakIterator for character boundary analysis http://vietocr.sourceforge.net/tra

[tesseract-ocr] Re: jTessBoxEditor - Tesseract box editor & trainer

2016-07-13 Thread pham x hoang
Which version of box file are u using now ? 2 or 3 ? On Wednesday, September 25, 2013 at 8:02:13 PM UTC-6, Quan Nguyen wrote: > > jTessBoxEditor is a Java box editor for Tesseract OCR data. It can read > images of common image formats, including multi-page TIFF. The > program requires JRE 6.0 or

[tesseract-ocr] Re: jTessBoxEditor - Tesseract box editor & trainer

2016-06-09 Thread Meh Hem
Hi Mate, Just wanted to say that I love your program and you are doing a great job. That is all. On Sunday, June 5, 2016 at 5:24:15 AM UTC+8, Quan Nguyen wrote: > > jTessBoxEditor 1.6 Release > > - Upgrade Tesseract training executable 3.05dev (from > https://github.com/UB-Mannheim/tesseract/wi

[tesseract-ocr] Re: jTessBoxEditor - Tesseract box editor & trainer

2016-06-04 Thread Quan Nguyen
jTessBoxEditor 1.6 Release - Upgrade Tesseract training executable 3.05dev (from https://github.com/UB-Mannheim/tesseract/wiki) - Incorporate new training commands, including text2image (currently not usable on Windows) http://vietocr.sourceforge.net/training.html http://sourceforge.net/project

[tesseract-ocr] Re: jTessBoxEditor - Tesseract box editor & trainer

2014-11-26 Thread Almas Maris
How can i get english TIFF file? On Thursday, September 26, 2013 9:02:13 AM UTC+7, Quan Nguyen wrote: > > jTessBoxEditor is a Java box editor for Tesseract OCR data. It can read > images of common image formats, including multi-page TIFF. The > program requires JRE 6.0 or later. > > Version 1.0 B

Re: [tesseract-ocr] Re: jTessBoxEditor - Tesseract box editor & trainer

2014-11-12 Thread newbie
Quan/Shree, Do u know of some tool that would only leave the fonts on the image ? A preprocessing of the image for tesseract ? Thanks On Tuesday, November 11, 2014 3:41:21 PM UTC-5, Quan Nguyen wrote: > > The buttons, port, signs, symbols, logos -- those non-text elements --

Re: [tesseract-ocr] Re: jTessBoxEditor - Tesseract box editor & trainer

2014-11-11 Thread Quan Nguyen
The buttons, port, signs, symbols, logos -- those non-text elements -- all help confuse Tesseract. On Tuesday, November 11, 2014 2:04:35 PM UTC-6, newbie wrote: > > Quan, >Can u ellaborate on the problems with image processing - what > do u mean by the non text objects ? I have attac

Re: [tesseract-ocr] Re: jTessBoxEditor - Tesseract box editor & trainer

2014-11-11 Thread Quan Nguyen
The buttons, port, signs, symbols, logos -- those non-text elements -- all help confuse Tesseract. On Tuesday, November 11, 2014 2:04:35 PM UTC-6, newbie wrote: > > Quan, >Can u ellaborate on the problems with image processing - what > do u mean by the non text objects ? I have attac

Re: [tesseract-ocr] Re: jTessBoxEditor - Tesseract box editor & trainer

2014-11-11 Thread newbie
Quan, Can u ellaborate on the problems with image processing - what do u mean by the non text objects ? I have attached the image in a thread above to shree. Thanks On Tuesday, November 11, 2014 2:17:30 PM UTC-5, Quan Nguyen wrote: > > Looks like you got yourself a problem of image p

Re: [tesseract-ocr] Re: jTessBoxEditor - Tesseract box editor & trainer

2014-11-11 Thread Quan Nguyen
Looks like you got yourself a problem of image processing, not training. There are many non-text objects in your image; any OCR engine would have problems with. Eliminating them, you'll get better results. On Tuesday, November 11, 2014 9:30:24 AM UTC-6, newbie wrote: > > Shree, > Thank

Re: [tesseract-ocr] Re: jTessBoxEditor - Tesseract box editor & trainer

2014-11-11 Thread newbie
The google link u gave me below does not let me download the file. Just wanted to check if its different from the one I have. On Tuesday, November 11, 2014 1:53:57 PM UTC-5, newbie wrote: > > Shree, > The eng.traindata that comes with tess4j, which I am presuming > is the one from the

Re: [tesseract-ocr] Re: jTessBoxEditor - Tesseract box editor & trainer

2014-11-11 Thread newbie
Shree, The eng.traindata that comes with tess4j, which I am presuming is the one from the google link below, gives me this below. I should be able to read the vip2500 and AT&T Uverse from the image, which it is not doing. Hence I thought I might have to train it. AT&T U-verse rowan

Re: [tesseract-ocr] Re: jTessBoxEditor - Tesseract box editor & trainer

2014-11-11 Thread ShreeDevi Kumar
You don't need to train in order to extract text. Have you tried with the english traineddata .. available from https://code.google.com/p/tesseract-ocr/source/browse/?repo=tessdata ShreeDevi भजन - कीर्तन - आरती @ http://bhajans.rampari

Re: [tesseract-ocr] Re: jTessBoxEditor - Tesseract box editor & trainer

2014-11-11 Thread ShreeDevi Kumar
JTessBoxEditor has three tabs Use *Tiff/Box Generator* to generate tiff and box files from a given text file for the chosen font The Box files created by Box/Tiff Generator are based on the rendering of the text in the chosen font and will be accurate - however they may still get errors 'blob not

[tesseract-ocr] Re: jTessBoxEditor - Tesseract box editor & trainer

2014-11-10 Thread Quan Nguyen
To work in the Box Editor, you would need to provide the box file along with the image. The box file can be either generated or made by Tesseract training. There's no need to convert the image files. On Monday, November 10, 2014 12:28:36 PM UTC-6, newbie wrote: > > I have installed JTessBoxEdito

[tesseract-ocr] Re: jTessBoxEditor - Tesseract box editor & trainer

2014-11-10 Thread newbie
I have installed JTessBoxEditor to train my images for tess4j. But I am unable to open the file(png,tiff) in the box editor. When I read the tutorial , it says use tiff/box files as input to the editor, but when it browse's for files it seems to be looking for text files. I have an original png

[tesseract-ocr] Re: jTessBoxEditor - Tesseract box editor & trainer

2014-08-19 Thread Quan Nguyen
Version 1.1 beta is released with the following enhancements: - Add training support for Right-to-Left (RTL) text - Add horizontal box split using modifier keys Any comments/feedback are welcome. Thanks. On Wednesday, September 25, 2013 9:02:13 PM UTC-5, Quan Nguyen wrote: > > jTessBoxEditor is