Re: [tesseract-ocr] training doubts

2020-10-20 Thread Shree Devi Kumar
For English, most of the times, preprocessing your images and using official traineddata will give better results than trying to do training. For finetuning, ( https://tesseract-ocr.github.io/tessdoc/TrainingTesseract-4.00.html#fine-tuning-for-impact) what is recommended is using the existing trai

[tesseract-ocr] training doubts

2020-10-20 Thread Kumar Rajwani
hey i need small help as i have to train tesseract on my documents. I have already read some training issues and i have steps that i can perform. 1. !tesseract "document.png" "document" -l eng --psm 11 wordstrbox it will give me line lavel box correct ocr. copy image file and