I downloaded it as you suggested, and as the terminal output below shows, the file is now present at the correct place :
$file /home/mbalambala/tesseract/tessdata/eng.traineddata /home/mbalambala/tesseract/tessdata/eng.traineddata : HTML document, UTF-8 Unicode text, with very long lines $ echo TESSDATA_PREFIX /home/mbalambala/tesseract/tessdata but the error message stays exactly the same : $ tesseract Downloads/p1.pdf p1 Error opening data file /home/mbalambala/tesseract/tessdata/eng.traineddata Please make sure the TESSDATA_PREFIX environment variable is set to your "tessdata" directory. Failed loading language 'eng' Tesseract couldn't load any languages! Could not initialize tesseract. Whatever the real problem is, the error message is not detecting it. On Sunday, January 17, 2021 at 10:37:22 AM UTC+1 ... wrote: > Run the following command in order to get the eng.traineddata file within > the tessdata directory: *wget > https://github.com/tesseract-ocr/tessdata/blob/master/eng.traineddata > <https://github.com/tesseract-ocr/tessdata/blob/master/eng.traineddata>* > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/47e8b734-5de9-4624-8872-ed91ac8775b4n%40googlegroups.com.