intallation
!apt-get install python-dev libxml2-dev libxslt1-dev antiword unrtf 
poppler-utils pstotext tesseract-ocr 
!sudo apt-get install libenchant1c2a

!sudo add-apt-repository -y ppa:alex-p/tesseract-ocr-devel
!sudo apt-get update
# !sudo apt-get install tesseract-ocr
!sudo apt install -y tesseract-ocr tesseract-ocr-eng tesseract-ocr-osd
!fc-cache -vf
!apt-get install libtesseract-dev libleptonica-dev pkg-config
!pip install pytesseract pdf2image

!tesseract -v


*download image*
!wget 
https://s29843.pcdn.co/blog/wp-content/uploads/sites/2/2020/11/TechSmith-Blog-ExtractText-1024x768.png


*Tesseract cmd*
!tesseract "TechSmith-Blog-ExtractText-1024x768.png" "out"

Tesseract Open Source OCR Engine v5.0.0-alpha-20201231-417-g2d805 with 
Leptonica 
Estimating resolution as 845 
Detected 12 diacritics
*Result*
out.txt file is empty. i also checked with other images. i think the latest 
push have some issues.

can you tell me how can i downgrade to last tesseract 5 version.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/fccfede8-a83b-4805-9add-5287781a57e5n%40googlegroups.com.

Reply via email to