In the last few days it has been a joyful experience getting to know Tesseract and using it in one of my projects. Thank you all.
The challenge I am facing is that the OCR detection is not correct on some of the PDF files I am working with and I am writing to this group seeking advice on how I could do this better. The PDF file has identifiers like below which when OCR'ed with Tesseract gives the result "*WaJES58865"* while the right answer is "*UZJ6358865*". [image: after.png] While checking on an online tool in https://www.imagetotext.info/ the OCR text is correct. So, happy yo hear to any tips and tricks to get the right detections using Tesseract too. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/1c592d37-ae9b-45c8-b327-5afeed6aa1f2n%40googlegroups.com.