[tesseract-ocr] Tesseract unstable font property prediction

2022-09-02 Thread Kehinde Adeoya
Tesseract 3.0.5 TessData 3.0.4 Tesseract 5Java binding. I am using Tesseract 3.0.5 in a project, which is awesome. It works brilliantly well. Lately, I noticed its predictability changes when the same code is run multiple times for the same image text. I was able to train new fonts in different

[tesseract-ocr] Incorrect recognition of Latin words inside Arabic text

2022-09-02 Thread Naourass Derouichi
Hi all, I'm trying to ocr images similar to the attached one, but the error rate of Latin words is too high. I tried all PSMs with the following models from tessdata_best: *ara*, *eng* , *fra*, *Ara (*in different orders)*. *I even tried finetuning them on the font used in the input images. *Sa

Re: [tesseract-ocr] Incorrect recognition of Latin words inside Arabic text

2022-09-02 Thread Zdenko Podobny
Please stop abusing the tesseract forum. Why are you sending the same email again and again? Zdenko pi 2. 9. 2022 o 20:24 Naourass Derouichi napísal(a): > Hi all, I'm trying to ocr images similar to the attached one, but the > error rate of Latin words is too high. > > I tried all PSMs with th

Re: [tesseract-ocr] Incorrect recognition of Latin words inside Arabic text

2022-09-02 Thread Naourass Derouichi
Sorry everyone, I didn't know that an email is distributed for each new post. I forgot to attach the image and didn't find out how to edit the post so I created a new one. This is my first time using this forum. Have a good day :) Le vendredi 2 septembre 2022 à 19:35:29 UTC+1, zdenop a écrit :

Re: [tesseract-ocr] Training Fonts, mftraining hangs

2022-09-02 Thread Jaspreet Kaur
Preprocessing helps when your images are not clear to enhance the image quality. You can also work with box file by correcting the box files and put right characters into it. On Wed, Aug 31, 2022 at 8:40 PM Adrian Paul Ciobanita < adrian.cioban...@gmail.com> wrote: > Can you recomend tutorials, o

Re: [tesseract-ocr] Training Fonts, mftraining hangs

2022-09-02 Thread Adrian Paul Ciobanita
Hello Jaspreet, Do you know of resources / documentation that explain step by step how to correctly use those box files, with ground truth files? I know jsTextEditor can help out with that, but I never been able to correctly use those box files, after. I cannot find a good article to explain how t