I exported a png from a pdf that seemed to be a scanned image of the original text. I installed the latest tesseract and leptonica via Homebrew. I then ran
tesseract Downloads/foundations-of-mathematics.tiff foundations-of-mathematics and it consistently outputs the first page only. On Thursday, August 8, 2019 at 11:58:57 PM UTC-7, zdenop wrote: > > Provide exact information what you did. > Make sure you use the latest tesseract and leptonica. > > Zdenko > > > pi 9. 8. 2019 o 7:41 ilevy <textr...@gmail.com <javascript:>> napĂsal(a): > >> I'm trying tesseract for the first time with a png of a multipage >> document I saved out of a pdf (which itself was just an image). >> >> When I run tesseract, I get an output of the first page, but that's all. >> I notice that there's a control-L (^L) at the end of the text file. >> >> How do I get the entire file output to txt? >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to tesser...@googlegroups.com <javascript:>. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/4067da33-b1d1-4bbe-9909-9b5552c49549%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/4067da33-b1d1-4bbe-9909-9b5552c49549%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/802fb21f-602c-473a-8675-54f98df3e11a%40googlegroups.com.