[tesseract-ocr] Re: Reading dot matrix characters

2019-06-11 Thread Balaji Gurunathan
Hi, I've a similar requirement and I'm new to Tesseract. Could you please share steps required to implement this? Thanks. On Thursday, October 23, 2014 at 12:07:03 PM UTC+5:30, up...@6paq.com wrote: > > Hello. > > I have images that contain characters that are made from individual dots, > like

Re: [tesseract-ocr] Changes in Tesseract 4.0 to 4.1 causing loss in precision

2019-06-11 Thread Shree Devi Kumar
Using the latest code from master branch, --oem 1 and --psm 6 I get the following results using the different traineddata files: tessdata -- | =~ 7.2. | BK Medical 1.01<1.50 TIS: 1.2<2.0 _ Res /Hz 1/7 Hz ~ aw > General nz - ¥ Povier > Gan 52 %, PRF 0.5 kHz| “oo, TENN 0.7. { 7.2. oe ¥ [Heas

Re: [tesseract-ocr] Changes in Tesseract 4.0 to 4.1 causing loss in precision

2019-06-11 Thread Beck Olson
Zdenko-- I was able to pull a scrubbed example without confidential information that demonstrates the issue. Here is the difference: Tesseract 4.1Tesseract 4.0 1B-KMedieal | | BK Medical 101150 | 1.01<1.50 TIS:1220 L | TIS: 1.2<2.0 L Res/Hz1/7Hz es S

[tesseract-ocr] dealing with image with text of separate columns

2019-06-11 Thread Jingjing Lin
I'm wondering, what are the parameters to tune to get better result for image with text of several columns, example as attached. Basically I would like to have separate columns separate, instead of getting different columns sticking together. Like the middle part in the .txt file. I used '-c

Re: [tesseract-ocr] Tesseract does not give good output we need some suggestion.

2019-06-11 Thread Lorenzo Bolzani
Try to straighten the text: https://www.pyimagesearch.com/2017/02/20/text-skew-correction-opencv-python/ (I suspect you are already doing this) Small dots will give you problems with this method, so first make a copy of the image, run a light close/erode (google: morphology transformation) to re

[tesseract-ocr] Re: Training help

2019-06-11 Thread Mox Betex
I have, but I have stumbled upon a problem that I can't solve. I am trying to build training data for Tesseract 4.00 When I execute this command: combine_lang_model --input_unicharset data/unicharset --script_dir data/ tessdata --output_dir data/output --pass_through_recoder --lang MyModel I g