[tesseract-ocr] Extra characters showing up

2022-02-24 Thread Ed Dow
Greetings, I'm using tesseract 4.0.0 in a C/C++ application where I capture an image and then "scrape" text/data from it. I am having issues with tesseract recognizing the ROI with just several characters ( see attached). The attached image is: *014* Recognized as: */~—6h014 5* If I get r

[tesseract-ocr] Re: Error : Can't Run the fine-Tune Model after fine-tuning multi- fonts of khmer language

2022-02-25 Thread Ed Dow
Not sure if you figured this out but I noticed in the error message the path has both forward and back slashes. Maybe that is causing issues opening the file? Also from the command line type:echo %TESSDATA_PREFIX% It should return with the tessdata directory. On Sunday, February 6, 2

[tesseract-ocr] Re: OCR Recognition of Aircraft Tail Numbers

2022-02-25 Thread Ed Dow
That's a good question. I'm working on a project right now taking screen shots from TV and extracting data. It's sort of working. I think much of this would depend on picture quality and placement of the tail numbers. Software would need to be developed around tesseract to know where to loo

[tesseract-ocr] Re: PANCARD AADHAAR

2022-02-25 Thread Ed Dow
You could use OpenCV to define a template with regions of interest (ROIs) and then use tesseract to OCR them? On Tuesday, January 25, 2022 at 3:52:57 AM UTC-7 ameers...@gmail.com wrote: > I'm also facing the same issue. Can someone throw some time light here > please > > > > On Saturday, Ja

[tesseract-ocr] Re: Extra characters showing up

2022-03-01 Thread Ed Dow
;SetImage function. This is the number of bits per pixel. After converting to grayscale it's 1 and not 3. I forgot about this and I was getting 3 strings back. Quite strange. On Thursday, February 24, 2022 at 11:02:27 PM UTC-7 Ed Dow wrote: > Greetings, &g