Tesstrain only uses single line of text for training. I want to train several block of texts.
On Saturday, January 9, 2021 at 8:04:18 PM UTC+5:30 wuer...@gmail.com wrote: > Of course you can! Just checkout the tesstrain tool: > https://github.com/tesseract-ocr/tesstrain > > Cheers, > Kay > > On Fri, Jan 8, 2021 at 9:32 AM Murtuza Dahodwala > <murtuz...@gmail.com> wrote: > > > > > > It is now 2 years since this answer was posted. Is it possible to train > tesseract 4 on real images now? > > On Thursday, January 11, 2018 at 2:27:43 PM UTC+5:30 shree wrote: > >> > >> Currently, Ray/Google has NOT released info on how to train Tesseract 4 > (LSTM) with real life images. The only supported option is to use synthetic > training data created by tesstrain.sh script using training text and > unicode fonts. > >> > >> To train an LSTM model from scratch requires a large amount of training > data and huge computing resources and time (in days/weeks). > >> > >> As a user, your best bet for training is to try finetuning for a > particular font or adding a couple of characters. > > > > -- > > You received this message because you are subscribed to a topic in the > Google Groups "tesseract-ocr" group. > > To unsubscribe from this topic, visit > https://groups.google.com/d/topic/tesseract-ocr/S8g4ihT9sXQ/unsubscribe. > > To unsubscribe from this group and all its topics, send an email to > tesseract-oc...@googlegroups.com. > > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/2afd940d-f7e1-48f7-8792-a1542741d336n%40googlegroups.com > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/e5ebc5f9-d59d-49c4-944f-9348999691a6n%40googlegroups.com.