As you mentioned tesseract 4.0 is only support for the unicode fonts. What is the procedure if we want to trained with non-unicode fonts. Since most of the documents written in Sri Lanka are in non-unicode fonts and there are lots of historical books available which written on non-unicode forms.
-- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/a280b31b-f2c3-494e-a69e-ac3e36f02382%40googlegroups.com.

