In order to train tesseract you need two things. An image of the character 
and the name of the character. This is provide to tesseract with the help 
of “box files”.  The box files contains the co ordinates of the bounding 
boxes around characters with the labels as what those characters are.


The command is 

tesseract eng.timesitalic.exp0.tif eng.timesitalic.exp0 batch.nochop makebox


tesseract [lang].[fontname].exp[num].tif [lang].[fontname].exp[num] 
batch.nochop makebox




On Wednesday, January 9, 2013 11:36:34 AM UTC-5, Stephane Holy wrote:
>
> Hello,
>
> I'm trying Tesseract training but I'm a bit lost on the process and 
> documentation doesn't help me. So, I would like to know what is 
> a box file and how to create one ?
>
> Thanks in advance!
>
>

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to