Of course you can! Just checkout the tesstrain tool: https://github.com/tesseract-ocr/tesstrain
Cheers, Kay On Fri, Jan 8, 2021 at 9:32 AM Murtuza Dahodwala <murtuzamda...@gmail.com> wrote: > > > It is now 2 years since this answer was posted. Is it possible to train > tesseract 4 on real images now? > On Thursday, January 11, 2018 at 2:27:43 PM UTC+5:30 shree wrote: >> >> Currently, Ray/Google has NOT released info on how to train Tesseract 4 >> (LSTM) with real life images. The only supported option is to use synthetic >> training data created by tesstrain.sh script using training text and unicode >> fonts. >> >> To train an LSTM model from scratch requires a large amount of training data >> and huge computing resources and time (in days/weeks). >> >> As a user, your best bet for training is to try finetuning for a particular >> font or adding a couple of characters. > > -- > You received this message because you are subscribed to a topic in the Google > Groups "tesseract-ocr" group. > To unsubscribe from this topic, visit > https://groups.google.com/d/topic/tesseract-ocr/S8g4ihT9sXQ/unsubscribe. > To unsubscribe from this group and all its topics, send an email to > tesseract-ocr+unsubscr...@googlegroups.com. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/2afd940d-f7e1-48f7-8792-a1542741d336n%40googlegroups.com. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CA%2B4pRfrF47572VE_i5CpvDurghfCABWKZF-yO%3DGH92BeECHdeQ%40mail.gmail.com.