Re: [tesseract-ocr] Train new font using Tesseract 5 with legacy tessdata 3.0.5

2022-05-20 Thread Zdenko Podobny
Can you please clarify what exactly you want to do / achieve? Training LSTM model or legacy model? Zdenko št 19. 5. 2022 o 16:12 Kehinde Adeoya napísal(a): > Are the tutorials where it is detailed on how to train a new font using > the latest Tesseract-5 and Tessdata-3.0.5? I have not found an

Re: [tesseract-ocr] Support for multithreading build by CMake doesn't work

2022-05-20 Thread Zdenko Podobny
Best way would be to try it ;-) AFAIR there were similar approaches e.g. ([1], [2], [3] - IMHO using GNU Parallel was quite popular; search for "tesseract parallel" - google provide 1.28 Mio results), but please be aware of this open issue[4]... [1] https://appliedmachinelearning.blog/2018/06/30/p

[tesseract-ocr] Tesseract not recognising trained font

2022-05-20 Thread Kehinde Adeoya
I have newly trained new fonts successfully. I trained Ubuntu and Inter fonts. Likewise, 1. I noticed Tesseract does not recognize them, but kept returning a strange name for the fonts. It returned the 1809_Homer font name for Ubuntu, and kept me wondering if there is anything wrong with the t

[tesseract-ocr] Tesseract not recognising trained fonts

2022-05-20 Thread Kehinde Adeoya
I have newly trained new fonts successfully. I trained Ubuntu and Inter fonts. I am using Tesseract 3.0.5, and Tessdata-3.0.4. 1. I noticed Tesseract does not recognize them, but kept returning a strange name for the fonts. It returned the 1809_Homer font name for Ubuntu, and kept me wondering i

Re: [tesseract-ocr] Train new font using Tesseract 5 with legacy tessdata 3.0.5

2022-05-20 Thread Kehinde Adeoya
Thanks, @Zdenko I have newly trained new fonts successfully. I trained Ubuntu and Inter fonts. I am using Tesseract 3.0.5, and Tessdata-3.0.4. 1. I noticed Tesseract does not recognize them, but kept returning a strange name for the fonts. It returned the 1809_Homer font name for Ubuntu, and kep

[tesseract-ocr] Tesseract not detecting Ubuntu and Inter Google fonts but returning the wrong font - 1809_Homer

2022-05-20 Thread Kehinde Adeoya
I have newly trained new fonts successfully. I trained Ubuntu and Inter fonts. I am using Tesseract 3.0.5, and Tessdata-3.0.4. 1. I noticed Tesseract does not recognize them, but kept returning a strange name for the fonts. It returned the 1809_Homer font name for Ubuntu, and kept me wondering i

[tesseract-ocr] Tesseract unable to recognise Ubuntu and Inter fonts, it returned - 1809_Homer

2022-05-20 Thread Kehinde Adeoya
I have newly trained new fonts successfully. I trained Ubuntu and Inter fonts. I am using Tesseract 3.0.5, and Tessdata-3.0.4. 1. I noticed Tesseract does not recognize them, but kept returning a strange name for the fonts. It returned the 1809_Homer font name for Ubuntu, and Inter. This kept m