I see that the default tessdata just has English and OSD. I see all the other data at https://github.com/tesseract-ocr/tessdata. Do I just copy those to the same tessdata directory? The repo has a much larger version of eng.traineddata than what comes with Tesseract. Can I just replace it? And what is the difference of the ones in the script directory?
In the directory from the initial install, not only do I have eng.traineddata, but there is also user-patterns, user-words and other files. Do those files exist for the other languages as well? -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/MN2PR20MB268642993B65C83511CFAF88E7A19%40MN2PR20MB2686.namprd20.prod.outlook.com.