I've been doing some research on using Tesseract for both document scans and text in scenery, and I was wondering what image processors are best? Given I have a lot of images, I cannot process each batch by hand, so I will have to make a pretty smart pre-processing app before I get started. This will be developed against a Linux OS, so right now I am looking at:
Leptonica ImageMagick OpenCV GIMP ImageJ I also thought about unpaper or scantailor, but the level of documentation is pretty small relative to the above technologies, and I highly value strong documentation support. Any thoughts which of these are best, or if I am missing anything obvious? Thanks for your help. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/14bc5af1-697a-4016-81a3-92288530f256%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.