As the first step, I would suggest you read
https://github.com/tesseract-ocr/tessdoc/blob/main/ImproveQuality.md

Next: LSTM model is training on words/lines of text so it could have
a problem with "code". For images like these legacy mode is perfect. E.g.:

tesseract WCAZ.png - --psm 6 --oem 0
W C A Z
tesseract DVEO.png - --psm 6 --oem 0
D V E O

The legacy engine model is available in languages files in tessdata
repository (https://github.com/tesseract-ocr/tessdata). Many
installations prefer to use fast model (without legacy model)

Zdenko


so 25. 3. 2023 o 8:39 Kyle Zeneki <kylezen...@gmail.com> napĂ­sal(a):

> Hello, I have these images and I'm trying to print their output using
> Tesseract. I spent 2 hours fine-tuning Tesseract for a specific font, and
> the error rate was 0.163. I used multiple font-detecting websites, and the
> closest match was "Futura Now." However, Tesseract sometimes fails to read
> the "E" from "D V E O" but successfully reads the "E" from "EOPEO." It also
> occasionally misreads "S E G I E" as "Ss Ee G I E." etc. I'm wondering if
> there's a way to train Tesseract by image rather than by font.
> Alternatively, is there a better tool than Tesseract, such as EasyOCR?"
> [image: capture9.png][image: capture4.png][image: capture5.png][image:
> capture6.png][image: capture7.png][image: capture8.png]
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/fffda6e4-5754-4b87-b397-0365793d8c4en%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/fffda6e4-5754-4b87-b397-0365793d8c4en%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8xGoNHX0u6Xcod%3DV0_E8yXarV3rZStUdwjcr%3DXaN1WAzA%40mail.gmail.com.

Reply via email to