[tesseract-ocr] fine tuning on images

2024-03-14 Thread roei shlezinger
Hello, I have relatively clear images in Hebrew and Tesseract produces reasonable but not perfect results. I thought about continuing to train the model to make them better but ran into a problem. Here is the command I run: "bash-4.4# make training MODEL_NAME=test11 GROUND_TRUTH_DIR=/home/tesst

[tesseract-ocr] Re: why are there no new trained models since 2018?

2024-03-14 Thread W.t
https://github.com/tesseract-ocr/tessdata_best/releases/tag/4.1.0 has models uploaded in 2021. There may be newer ones for 5 but I don't know where they are. 2021 is still a pretty long time though, I suppose they achieved as much as they could for general application and anything more requires