In the last few days it has been a joyful experience getting to know 
Tesseract and using it in one of my projects. Thank you all.

The challenge I am facing is that the OCR detection is not correct on some 
of the PDF files I am working with and I am writing to this group seeking 
advice on how I could do this better. The PDF file has identifiers like 
below which when OCR'ed with Tesseract gives the result "*WaJES58865"* 
while the right answer is "*UZJ6358865*". 
[image: after.png]

While checking on an online tool in https://www.imagetotext.info/ the OCR 
text is correct. So, happy yo hear to any tips and tricks to get the right 
detections using Tesseract too.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/1c592d37-ae9b-45c8-b327-5afeed6aa1f2n%40googlegroups.com.

Reply via email to