Tesseract misses the extraction of some words like "Monthly" and "Total" (under section V) in the attached form. Upon using the PRImA tools I found that "Monthly" was omitted as it wasn't segmented correctly while "Total" even though fell under the segmentation region wasn't extracted.
Any idea what could have caused such a behavior and how to fix this? I used PSM 3. Thank you. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/2df7c54f-bc66-482d-9f77-0fd65a6c2ae0%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

