Dňa 02.05.2012 16:46, nkantan r wrote / napísal(a): > hi all! > find below the log on generating a tr file; > > ==== > Page 0 > APPLY_BOXES: > Boxes read from boxfile: 3312 > Boxes failed resegmentation: 0 > Found 3312 good blobs and 3 unlabelled blobs in 0 words. > 0 remaining unlabelled words deleted. > TRAINING ... Font name = TAMKambanNarrow > Generated training data for 220 words > Page 1 > APPLY_BOXES: > Boxes read from boxfile: 3312 > Boxes failed resegmentation: 0 > Found 3312 good blobs and 3 unlabelled blobs in 0 words. > 0 remaining unlabelled words deleted. > Generated training data for 232 words > ============ > > normally i get "0 unlabelled blobs in 0 words" and if i deliberately > deleted any boxes i get "nn boxes in 0 words"; but in this particular > tif and box files all orginally generated boxes are labelled (either > individually or after merging or splitting); so no blob is left > unlabelled; i went through the box/tif file using jTess box editor; > but i could not locate any unlabelled blobs; > is there a way to generate the box-coordinates in the log file so that > i can definitely check that all boxes are covered? > > regards > rnkantan > I am not sure if I understand you correctly. Do you need to visualize (e.g. draw rectangle) base on this king of message [1]?
Or in your log file there in no such message ([1])? It would be good to post you file somewhere for further testing... [1] APPLY_BOXES: Unlabelled word at :Bounding box=(239,3113)->(396,3153) -- Zdenko -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-ocr@googlegroups.com To unsubscribe from this group, send email to tesseract-ocr+unsubscr...@googlegroups.com For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en