Is anyone aware of best practices for recognizing text in a image which could be in english or any other language?
- Is configuring tesseract with all 100+ supported trained datasets and just letting him figure out what the best language dataset to use an option? Does anybody have experience with accuracy and performance in such a configuration? - Is there a suggested alternative ... like trying to guess what the language is, then do language identification on the returned blob and later re-OCR with the recognized language? Thanks! -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/7fee0d31-9997-4590-a1d6-36439049711c%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.