First of all: if you follow any tutorial on internet - report the problem to the author of the tutorial. Next: use official documentation for training. I see there are a bunch of folks just "generating content" - to gain an audience. Without insight and therefore also without support, using old/outdated information... Tesseract 4 was released 29 Oct 2018. Almost 4 year ago! The recent tesseract version is 5.2 and training process was also improved: https://github.com/tesseract-ocr/tesstrain
Zdenko st 31. 8. 2022 o 0:18 John Alway <[email protected]> napĂsal(a): > Hello, > > I've been following a tutorial on youtube titled "Tesseract OCR - Lesson > 2: Training Tesseract for new font" here: > https://www.youtube.com/watch?v=1v8BPw0Dn0I&ab_channel=TheCode > > I'm using tesseract 4.0 on Window 10. > > I went through the steps he used, and everything seems to go smoothly > until I get to the actual training. When I run "mftraining" the program > hangs. It seems to get stuck and doesn't indicate why are what it's doing. > > I'm using a set of fonts in an image. I have the full alphabet upper and > lower case and the numbers 0 to 9 on the png image. I've attached the > image. Unlike him, I'm using the English. I don't know the font, so I'm > just calling it tiktok to give it a name. My training file is called > *eng.tiktok.exp0.tr > <http://eng.tiktok.exp0.tr> * > > I used* jTessBoxEditor* to correct mistakes and set the box sizes and > positions precisely. > > > When I run this command: > *mftraining -F font_properties -U unicharset -O eng.unicharset > eng.tiktok.exp0.tr <http://eng.tiktok.exp0.tr>* > > The program just hangs. I've waited over twenty minutes. > > Should I wait longer? What could cause it to hang? > > > > Thanks! > ...John > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/534c3f74-420b-4c96-83dd-609bcb002f81n%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/534c3f74-420b-4c96-83dd-609bcb002f81n%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8w5MX1XchQp6jfu2Vz06zWp82HxbDHrgp7%2BQ_Neh%2BDeug%40mail.gmail.com.

