On Monday, February 19, 2024 at 1:30:37 AM UTC-5 argo...@gmail.com wrote:
... My question now is why tesseract does not take PDF. Pdf are images no ? PDF files can contain text, graphics, images, or a mix of them all. If you have PDF files that contain images, you can extract them using utilities like Poppler's pdfimages. https://askubuntu.com/a/150106 Tom -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/0b373bfb-d65e-4060-94a9-0ba4bcb47516n%40googlegroups.com.