Re: A reliable ocr program for Fedora

dwoody5654 Tue, 15 Dec 2015 13:12:24 -0800

On 12/15/2015 03:00 PM, Tom Horsley wrote:

If you have pdf files with actual characters, the
pdftotext tool works well for extracting the text
(though not necessarily the layout).

there is an option: -layout
It does a good job with preserving the layout.
David

As far as doing OCR from actual image files,
I always found tesseract to work better than most
(but it was still pretty feeble).

--
users mailing list
users@lists.fedoraproject.org
To unsubscribe or change subscription options:
https://admin.fedoraproject.org/mailman/listinfo/users
Fedora Code of Conduct: http://fedoraproject.org/code-of-conduct
Guidelines: http://fedoraproject.org/wiki/Mailing_list_guidelines
Have a question? Ask away: http://ask.fedoraproject.org

Re: A reliable ocr program for Fedora

Reply via email to