In order to train tesseract you need two things. An image of the character and the name of the character. This is provide to tesseract with the help of “box files”. The box files contains the co ordinates of the bounding boxes around characters with the labels as what those characters are.
The command is tesseract eng.timesitalic.exp0.tif eng.timesitalic.exp0 batch.nochop makebox tesseract [lang].[fontname].exp[num].tif [lang].[fontname].exp[num] batch.nochop makebox On Wednesday, January 9, 2013 11:36:34 AM UTC-5, Stephane Holy wrote: > > Hello, > > I'm trying Tesseract training but I'm a bit lost on the process and > documentation doesn't help me. So, I would like to know what is > a box file and how to create one ? > > Thanks in advance! > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

