[tesseract-ocr] Beginner question : could not initialize tesseract, missing eng.traineddata file in tessdata

2021-01-16 Thread Roparzh Hemon
Hello, I am a complete beginner to Tesseract. I just installed it on my Ubuntu machine. Here is a snippet from my Terminal : $ echo TESSDATA_PREFIX /home/mbalambala/tesseract/tessdata $ tesseract Downloads/p1.pdf p1 Error opening data file /home/mbalambala/tesseract/tessdata/eng.traineddata P

Re: [tesseract-ocr] Beginner question : could not initialize tesseract, missing eng.traineddata file in tessdata

2021-01-19 Thread Roparzh Hemon
I downloaded it as you suggested, and as the terminal output below shows, the file is now present at the correct place : $file /home/mbalambala/tesseract/tessdata/eng.traineddata /home/mbalambala/tesseract/tessdata/eng.traineddata : HTML document, UTF-8 Unicode text, with very long lines $ ech

Re: [tesseract-ocr] Beginner question : could not initialize tesseract, missing eng.traineddata file in tessdata

2021-01-19 Thread Roparzh Hemon
> >> *wget https://github.com/tesseract-ocr/tessdata/blob/master/eng.traineddata >> <https://github.com/tesseract-ocr/tessdata/blob/master/eng.traineddata>* >> >> >> On Tue, Jan 19, 2021 at 9:49 PM Roparzh Hemon >> wrote: >> >>> >