Dear Adam, > Tool allows to "cut" images on top of glyph data from PAGE file and afterwards > create Tesseract training page with respective box file. This can be used for > Tesseract training. I was testing this using script: > https://github.com/psnc-dl > /page-generator/blob/master/src/etc/train.sh and it seems that it can produce > valid Tesseract profile.
That sounds a lot like the tool that Matthew announced a few days ago (in this very thread). Can you explain the differences a little, please? > Page-generator supports also output from our tool -- Cutouts (http:// > wlt.synat.pcss.pl/cutouts, https://confluence.man.poznan.pl/community/display/ > WLT/Cutouts+application) which allows to work on preparation of training > material. That's interesting. Am I correct in thinking that this replaces Aletheia as a tool to extract glyph images in your workflow? Is the code available? Is it freely licenced? Many thanks, I look forward to learning more. Nick -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

