[tesseract-ocr] Tesstraining Non-Word Traineddata File

'Gary' via tesseract-ocr Thu, 20 Jan 2022 12:47:07 -0800

I've already configured tesseract to not use dictionaries when processing 
non-word images, but what is the best practice for tesstraining a non-word 
traineddata file?


Currently, I'm tesstraining using tessdata_best and my corrected 
ground-truth.

In my situation, is it better not to training with tessdata_best?  Does 
tessdata_best contain word dictionaries that will needlessly bloat my 
traineddata file?

Thank you for your time and guidance.

Respectfully,


Gary

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/5c649d07-a619-4db3-b5c6-7a6bed286528n%40googlegroups.com.

[tesseract-ocr] Tesstraining Non-Word Traineddata File

Reply via email to