Thanks a lot Dmitry, I will try again on my end and let you know Thanks a lot + Harit
On Sunday, March 1, 2015 at 11:59:58 PM UTC-8, Harit Himanshu wrote: > > Consider the attached receipt. > > I am trying to get text from this image. > > I tried all the options that I could > > ➜ receipts tesseract costco.jpg costco -psm 0 > > Tesseract Open Source OCR Engine v3.02.02 with Leptonica > > Error during processing. > > ➜ receipts tesseract costco.jpg costco -psm 1 > > Tesseract Open Source OCR Engine v3.02.02 with Leptonica > > OSD: Weak margin (4.85) for 209 blob text block, but using orientation > anyway: 0 > > ➜ receipts tesseract costco.jpg costco -psm 2 > > Tesseract Open Source OCR Engine v3.02.02 with Leptonica > > ➜ receipts tesseract costco.jpg costco -psm 4 > > Tesseract Open Source OCR Engine v3.02.02 with Leptonica > > set_count == gridheight():Error:Assert failed:in file colfind.cpp, line 648 > > [1] 46598 abort tesseract costco.jpg costco -psm 4 > > ➜ receipts tesseract costco.jpg costco -psm 5 > > Tesseract Open Source OCR Engine v3.02.02 with Leptonica > > ➜ receipts tesseract costco.jpg costco -psm 6 > > Tesseract Open Source OCR Engine v3.02.02 with Leptonica > > ➜ receipts tesseract costco.jpg costco -psm 7 > > Tesseract Open Source OCR Engine v3.02.02 with Leptonica > > ➜ receipts tesseract costco.jpg costco -psm 8 > > Tesseract Open Source OCR Engine v3.02.02 with Leptonica > > ➜ receipts tesseract costco.jpg costco -psm 9 > > Tesseract Open Source OCR Engine v3.02.02 with Leptonica > > ➜ receipts tesseract costco.jpg costco -psm 10 > > Tesseract Open Source OCR Engine v3.02.02 with Leptonica > > ➜ receipts tesseract costco.jpg costco -psm 6 -l eng > > Tesseract Open Source OCR Engine v3.02.02 with Leptonica > > ➜ receipts tesseract -v > > > But the option where I get most data is with -psm 6. But the data is > unreadable (See attached file) > > > How can I read this image? > > Thanks > > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/580be8e2-b649-45ae-9a0e-c1b389b3c407%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

