Quote/Cytat - matthew christy <[email protected]> (Fri 06 Dec 2013 09:10:56 PM CET):

Hi All,

The Initiative for Digital Humanities, Media, and Culture (IDHMC) at Texas
A&M University, as part of its Early Modern OCR Project (eMOP<http://emop.tamu.edu/>)
has created a new tool, called Franken+, that provides a way to create font
training for the Tesseract OCR engine using page images. This is in
contrast to Tesseract's documented method<http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3>of font training which involves using a word processing program with a
modern font. Franken+ has now been released for beta testing and we invite
anyone who's interested to give it a try and to please provide feedback.

Franken+ works in conjunction with PRImA's open source Aletheia tool<http://www.primaresearch.org/tools.php>

Aletheia is not an open source tool. Not only the source is not available, but you can download it only for "personal research" after registration.

It's a pity your very interesting tool has non-free prerequisites.

Best regards

Janusz

--
Prof. dr hab. Janusz S. Bień - Uniwersytet Warszawski (Katedra Lingwistyki Formalnej)
Prof. Janusz S. Bień - University of Warsaw (Formal Linguistics Department)
[email protected], [email protected], http://fleksem.klf.uw.edu.pl/~jsbien/

--
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to