[tesseract-ocr] Make russian_with_accent traineddata file

Romain B. (Le Belge) Mon, 05 Feb 2024 00:53:23 -0800

Hi,
<https://stackoverflow.com/posts/77897165/timeline>


I saw that tesseract make the mistakes of turning russian vowels with 
accents(ò,à,...)(used for educational purposes most of the time) into other 
russian letters, and saw that someone, with the same problem 
<https://github.com/tesseract-ocr/langdata/pull/12>, had created trained 
data(if i understood correctly) for russian with accents 
<https://github.com/tesseract-ocr/langdata/tree/main/rus_accent>

The problem is, i can not find a way to make it a traineddata file, to test 
it and later use it in my code. I found the tesstrain 
<https://github.com/tesseract-ocr/tesstrain> git, but was not able to make 
it work with the data found.

I honestly don't know if I am missing something, not understanding 
correctly something, or if we simply don't train data with these types of 
files anymore.

If you got any clue, that would help me a lot.

Thank you!

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/201355ba-dafd-49fd-b0a7-3b42fd8175d8n%40googlegroups.com.

[tesseract-ocr] Make russian_with_accent traineddata file

Reply via email to