Hi there,

I am using Tesseract 3.01 under Linux.

I can successfully create traineddata  from one *.tif file. But
combining different tif / box files give me an exception:
What are the steps:

Let's say I want to create a traineddata from two tif files: 01.tif
and 02.tif

1. tesseract 01.tif 01 batch.nochop makebox
2. tesseract 02.tif 02 batch.nochop makebox
3. I check the two box files using jTessBoxEditor
4. tesseract 01.tif 01 nobatch box.train
5. tesseract 02.tif 02 nobatch box.train
6. As described under 
http://wiki.ubuntuusers.de/tesseract-ocr/tesseract-ocr_trainieren
(sorry: it is on German, but the commands are the same) I create the
*.tr files:
7. mftraining 01.tr 02.tr

But this results in error: Reading 01.tr ...01 has no defined
properties.
!"Missing font_properties entry is a fatal error!":Error:Assert
failed:in file mftraining.cpp, line 287
Segmentation fault


Also trying to create unicharset with

unicharset_extractor 01.box 02.box

works successfully, but mftraining -U ./unicharset 01.tr 02.tr fails
with the same error.


Somebody has an idea what I am doing wrong. Also using the group e.g.
with the search word "combine" did not result in any fitting
solution.

Thanks for any advice,

Holm from Germany

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to tesseract-ocr@googlegroups.com
To unsubscribe from this group, send email to
tesseract-ocr+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to