Hello, You can try the OCR preprocessing in spark NLP, if you are on Python or Scala. Try to use the scaling option.
Alberto. On Feb 24, 2019 2:21 PM, "'Nenad Kocev' via tesseract-ocr" < [email protected]> wrote: > Hello, I recently discovered Tesseract and I've been using it to extract > digits from images using tess4j library. With the settings posted bellow I > get around 85% accuracy of recognition. > Is there a way to get 100% accuracy. I have example of an image in the > attachments. Other images may differ only in number of digits they have and > may also contain special characters like ",+-". Thanks for your help. > > Settings: > > tesseract.setPageSegMode(7); // text is in single line > > tesseract.setTessVariable("tessedit_char_whitelist", ",+-0123456789"); > tesseract.setTessVariable("load_system_dawg ", "false"); > tesseract.setTessVariable("load_freq_dawg ", "false"); > > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit https://groups.google.com/d/ > msgid/tesseract-ocr/be275b8f-1c58-4793-b2c3-545bc2e5ac74% > 40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/be275b8f-1c58-4793-b2c3-545bc2e5ac74%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CA%2BGGe6QvDytiJ3nyo6kV%3DdihWrULzwNtvHO7uUogm6e80RMeRw%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

