Hi Matt, Firstly, I share the general feeling that depending so strongly on a proprietary tool sucks a great deal (and would do even if they weren't so bad at processing registrations).
Aside from that though, my main question is what this tool does that one of the box editors like jTessBoxEditor doesn't do? Is the workflow nicer? Are there other useful features it brings? Are the existing tools not well set up for training from historical documents? Am I right in thinking that the main feature you bring is essentially the ability to remove parts of a scanned page you don't want (whether because the character samples aren't very representative or some other reason)? That's what I got from reading the webpage you linked to. But I don't see why that's preferable to just not including boxes around the parts you don't care for. Am I missing something? Thanks, I look forward to learning more. Nick -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

