[tesseract-ocr] Tesseract misinterprets letters in an invoice

Alexandra Mon, 08 Jan 2018 06:59:12 -0800


Hello,



I am using Tesseract 4.0 and I am trying to OCR some invoices. My problem 
is that it gives wrong results for some letters, for example I will get a $ 
or an 8 when the letter is actually S.

The weird things is that some S's are guessed correctly, but some S's or 
not, and this applies to other letters as well.

My question is, how can I train Tesseract to handle these cases better?

Also, I was wonderinf if Tesseract misinterprets S in S.A. as being a 
number because of the dots.

I have attached the image that I am having problems with.


Thanks,

Alexandra

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/f978d5c0-b6f4-4fc6-aabb-012a317f5367%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

[tesseract-ocr] Tesseract misinterprets letters in an invoice

Reply via email to