For English, most of the times, preprocessing your images and using
official traineddata will give better results than trying to do training.
For finetuning, (
https://tesseract-ocr.github.io/tessdoc/TrainingTesseract-4.00.html#fine-tuning-for-impact)
what is recommended is using the existing trai
hey i need small help as i have to train tesseract on my documents.
I have already read some training issues and i have steps that i can
perform.
1.
!tesseract "document.png" "document" -l eng --psm 11 wordstrbox it will
give me line lavel box
correct ocr. copy image file and
2 matches
Mail list logo