Take a look at tesseract-ocr/tesstrain On Tue, Jan 14, 2020 at 10:13 PM 'Fabio Lugli' via tesseract-ocr < tesseract-ocr@googlegroups.com> wrote:
> Hello everyone, i'm trying to train tesseract on handwriting, knowing that > it's not the best option, using the latest version available for Windows. I > have access to a huge amount of .tif files, lines of handwritten text, i'm > able to obtain the .box files, which I later edit to be compliant to the > latest requirements (boxes all over the line, spaces between words, tab at > the end). After that i did not understand how to improve eng.traineddata or > how to create an own .traineddata file, also following the instructions on > https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00. > So which are the next passages to obtain a correct training dataset? > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/b736f06c-0627-41ad-bd2a-6dcad01b4576%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/b736f06c-0627-41ad-bd2a-6dcad01b4576%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- ____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUmONZCXetsLonNqoUgv%3DmgND7%2B8iCbcybifCi7BmEoUA%40mail.gmail.com.