As you mentioned tesseract 4.0 is only support for the unicode fonts. What 
is the procedure if we want to trained with non-unicode fonts. Since most 
of the documents written in Sri Lanka are in non-unicode fonts and there 
are lots of historical books available which written on non-unicode forms.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/a280b31b-f2c3-494e-a69e-ac3e36f02382%40googlegroups.com.

Reply via email to