It is now 2 years since this answer was posted. Is it possible to train tesseract 4 on real images now? On Thursday, January 11, 2018 at 2:27:43 PM UTC+5:30 shree wrote:
> Currently, Ray/Google has NOT released info on how to train Tesseract 4 > (LSTM) with real life images. The only supported option is to use synthetic > training data created by tesstrain.sh script using training text and > unicode fonts. > > To train an LSTM model from scratch requires a large amount of training > data and huge computing resources and time (in days/weeks). > > As a user, your best bet for training is to try finetuning for a > particular font or adding a couple of characters. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/2afd940d-f7e1-48f7-8792-a1542741d336n%40googlegroups.com.