I've already configured tesseract to not use dictionaries when processing non-word images, but what is the best practice for tesstraining a non-word traineddata file?
Currently, I'm tesstraining using tessdata_best and my corrected ground-truth. In my situation, is it better not to training with tessdata_best? Does tessdata_best contain word dictionaries that will needlessly bloat my traineddata file? Thank you for your time and guidance. Respectfully, Gary -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/5c649d07-a619-4db3-b5c6-7a6bed286528n%40googlegroups.com.