It it written on training doc[1]: "*…**each .tr filename must match an entry in the font_properties file, or mftraining will abort.*"
So you could save your time if you read documentation. Zdenko [1] http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3#font_properties_(new_in_3.01) On Wed, Jun 1, 2011 at 4:33 PM, Holm Dressler <velovity1...@googlemail.com>wrote: > Hi there, > > OK, found it out by myself: here are the steps: > > 1. Create 01.tr with tesseract 01.tif 01 nobatch box.train > 2. Create 02.tr with tesseract 02.tif 02 nobatch box.train > 3. Create unicharset with: unicharset_extractor 01.box 02.box > 4. Just copy it (maybe it is not necessary) cp unicharset > 02.unicharset > 5. echo 01 0 0 0 0 0 > font_properties > 6. echo 02 0 0 0 0 0 >> font_properties > 7. mftraining -F font_properties -U unicharset 01.tr 02.tr > > SO YOU SEE: step 6 was missing (with >> which means you should have > two lines in your font_properties) > > > So Jimmi: now it is your turn :-) > > Talk soon > > Holm > > > > On May 26, 2:23 pm, zdenko podobny <zde...@gmail.com> wrote: > > On Thu, May 26, 2011 at 2:02 PM, Sarel van der Merwe < > sfvdme...@gmail.com>wrote: > > > > > Hi, > > > > > Do you know where i can locate the version 3 manual or reference guide > > > for Tesseract.. > > > > > The I know is in download section (tessdoc-html-3.0.0-preview1.tar.gz) > ;-) > > > > Maybe Jimmi will update it for 3.01 :-) > > Some good information could be found in tesseract forums. > > All links are on main project page. Surprisingly ;-) > > > > Zdenko > > > > Thanks > > > > > > > > > Sarel > > > > > On Thu, May 26, 2011 at 1:33 PM, zdenko podobny <zde...@gmail.com> > wrote: > > > > Hi, > > > > Problem is that you use the latest version and you do not read the > latest > > > > manual [1]. If I correctly understood that German manual (via google > > > > translate), it is for version 3.00 so it do not follow changes in > 3.01 > > > > version. > > > > Another "problem": 3.01 is not released yet. It is for developers and > > > > experienced tester for testing and bug reporting. IMHO 3.01 training > is > > > not > > > > fully documented. > > > > > > Zdenko > > > > [1]http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3 > > > > On Thu, May 26, 2011 at 10:59 AM, Holm Dressler > > > > <velovity1...@googlemail.com> wrote: > > > > > >> Hi there, > > > > > >> I am using Tesseract 3.01 under Linux. > > > > > >> I can successfully create traineddata from one *.tif file. But > > > >> combining different tif / box files give me an exception: > > > >> What are the steps: > > > > > >> Let's say I want to create a traineddata from two tif files: 01.tif > > > >> and 02.tif > > > > > >> 1. tesseract 01.tif 01 batch.nochop makebox > > > >> 2. tesseract 02.tif 02 batch.nochop makebox > > > >> 3. I check the two box files using jTessBoxEditor > > > >> 4. tesseract 01.tif 01 nobatch box.train > > > >> 5. tesseract 02.tif 02 nobatch box.train > > > >> 6. As described under > > > >>http://wiki.ubuntuusers.de/tesseract-ocr/tesseract-ocr_trainieren > > > >> (sorry: it is on German, but the commands are the same) I create the > > > >> *.tr files: > > > >> 7. mftraining 01.tr 02.tr > > > > > >> But this results in error: Reading 01.tr ...01 has no defined > > > >> properties. > > > >> !"Missing font_properties entry is a fatal error!":Error:Assert > > > >> failed:in file mftraining.cpp, line 287 > > > >> Segmentation fault > > > > > >> Also trying to create unicharset with > > > > > >> unicharset_extractor 01.box 02.box > > > > > >> works successfully, but mftraining -U ./unicharset 01.tr 02.trfails > > > >> with the same error. > > > > > >> Somebody has an idea what I am doing wrong. > > > > > >> Also using the group e.g. > > > >> with the search word "combine" did not result in any fitting > > > >> solution. > > > > > >> Thanks for any advice, > > > > > >> Holm from Germany > > > > > >> -- > > > >> You received this message because you are subscribed to the Google > > > >> Groups "tesseract-ocr" group. > > > >> To post to this group, send email to tesseract-ocr@googlegroups.com > > > >> To unsubscribe from this group, send email to > > > >> tesseract-ocr+unsubscr...@googlegroups.com > > > >> For more options, visit this group at > > > >>http://groups.google.com/group/tesseract-ocr?hl=en > > > > > > -- > > > > You received this message because you are subscribed to the Google > > > > Groups "tesseract-ocr" group. > > > > To post to this group, send email to tesseract-ocr@googlegroups.com > > > > To unsubscribe from this group, send email to > > > > tesseract-ocr+unsubscr...@googlegroups.com > > > > For more options, visit this group at > > > >http://groups.google.com/group/tesseract-ocr?hl=en > > > > > -- > > > You received this message because you are subscribed to the Google > > > Groups "tesseract-ocr" group. > > > To post to this group, send email to tesseract-ocr@googlegroups.com > > > To unsubscribe from this group, send email to > > > tesseract-ocr+unsubscr...@googlegroups.com > > > For more options, visit this group at > > >http://groups.google.com/group/tesseract-ocr?hl=en > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to tesseract-ocr@googlegroups.com > To unsubscribe from this group, send email to > tesseract-ocr+unsubscr...@googlegroups.com > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-ocr@googlegroups.com To unsubscribe from this group, send email to tesseract-ocr+unsubscr...@googlegroups.com For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en