Re: [tesseract-ocr] Re: Traineddata files

Tom Morris Mon, 19 Feb 2024 14:36:15 -0800

On Monday, February 19, 2024 at 1:30:37 AM UTC-5 argo...@gmail.com wrote:


... My question now is why tesseract does not take PDF. Pdf are images no ?


PDF files can contain text, graphics, images, or a mix of them all.

If you have PDF files that contain images, you can extract them using
utilities like Poppler's pdfimages. https://askubuntu.com/a/150106

Tom

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/0b373bfb-d65e-4060-94a9-0ba4bcb47516n%40googlegroups.com.

Re: [tesseract-ocr] Re: Traineddata files

Reply via email to