I have, but I have stumbled upon a problem that I can't solve. I am trying to build training data for Tesseract 4.00
When I execute this command: combine_lang_model --input_unicharset data/unicharset --script_dir data/ tessdata --output_dir data/output --pass_through_recoder --lang MyModel I get error "Failed to load script unicharset from:data/ tessdata/Latin.unicharset". File Latin.unicharset is in data/tessdata folder, I don't understand how to fix this. Can you help me? On Tuesday, June 11, 2019 at 4:10:27 AM UTC+2, ElGato ElMago wrote: > > Did you try the tutorial at all? It's a pretty good guidance though you > might need help here and there. > > 2019年6月9日日曜日 15時27分23秒 UTC+9 Mox Betex: >> >> Can someone explain me how to create training data for tesseract 4.0? >> I read tutorial on web but I really don't understand. >> Is there some GUI software for training? >> Do I have to create training data with single font or image of text >> lines? >> >> -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/7824df1d-bb1a-4acf-b7c7-ecc32dafff1c%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.