Thanks for the version and model information. That'll be useful for anyone trying to help.
My best guess is that there's something about the Farsi training data which is causing this, but I don't know what (and I don't speak Farsi). One thing you might try is using the Arabic script model and see if that's any better. Other than that, I'm afraid I don't have any good suggestions. Tom On Thursday, February 29, 2024 at 1:20:33 AM UTC-5 Iman Firouzian wrote: > I've installed it using: > !sudo apt install tesseract-ocr > in Google Colab. > > it says it's the latest version: tesseract-ocr is already the newest > version (4.1.1-2.1build1). > > and the language model is "fas" and is installed by: > !sudo apt install tesseract-ocr-fas > > thanks for helping > > On Thursday, February 29, 2024 at 1:50:50 AM UTC+3:30 tfmo...@gmail.com > wrote: > >> On Wednesday, February 28, 2024 at 3:28:51 AM UTC-5 Iman Firouzian wrote: >> >> >> Please help me with this >> >> >> Please include more details about what version of the software you are >> using and which language (or script) model(s). >> >> Tom >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/855351f8-aab5-4c9e-89bc-72acd2501d18n%40googlegroups.com.