Re: Franken+ Released -- New Tool For Training Tesseract on Fonts from Page Images

Janusz S. Bien Fri, 06 Dec 2013 13:07:03 -0800

Quote/Cytat - matthew christy <[email protected]> (Fri 06 Dec2013 09:10:56 PM CET):

Hi All,
The Initiative for Digital Humanities, Media, and Culture (IDHMC) at Texas
A&M University, as part of its Early Modern OCR Project(eMOP<http://emop.tamu.edu/>)
has created a new tool, called Franken+, that provides a way to create font
training for the Tesseract OCR engine using page images. This is in
contrast to Tesseract's documentedmethod<http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3>offont training which involves using a word processing program with a
modern font. Franken+ has now been released for beta testing and we invite
anyone who's interested to give it a try and to please provide feedback.
Franken+ works in conjunction with PRImA's open source Aletheiatool<http://www.primaresearch.org/tools.php>

Aletheia is not an open source tool. Not only the source is notavailable, but you can download it only for "personal research" afterregistration.


It's a pity your very interesting tool has non-free prerequisites.

Best regards

Janusz

--

Prof. dr hab. Janusz S. Bień - Uniwersytet Warszawski (KatedraLingwistyki Formalnej)

Prof. Janusz S. Bień - University of Warsaw (Formal Linguistics Department)
[email protected], [email protected], http://fleksem.klf.uw.edu.pl/~jsbien/

--
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

---You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.

To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.

Re: Franken+ Released -- New Tool For Training Tesseract on Fonts from Page Images

Reply via email to