Re: [tesseract-ocr] Tesseract Performance

2020-12-24 Thread Shree Devi Kumar
>testing an unseen image, the performance was exactly the same. Can you share the image (preferably a page) and expected result? On Thu, Dec 24, 2020 at 8:36 PM Soumik Ranjan Dasgupta < ranjansou...@gmail.com> wrote: > Hi everyone, > I wanted to do fine-tune the ben.traineddata model by using so

Re: [tesseract-ocr] Tesseract Performance

2020-12-24 Thread Lorenzo Bolzani
If the results are exactly the same the most likely explanation is that you are still using the old model. Try to move or rename the new model and see if something change. Did you see an improvement during the training? Mean rms, char train, word train, ecc. Bye Lorenzo Il giorno gio 24 dic

[tesseract-ocr] Tesseract Performance

2020-12-24 Thread Soumik Ranjan Dasgupta
Hi everyone, I wanted to do fine-tune the ben.traineddata model by using some ancient text that were supposedly printed with typeset. I have roughly around 1k lines of text and tried the normal fine-tuning approach with around 25k iterations. The thing that surprised me the most was even after