I exported a png from a pdf that seemed to be a scanned image of the 
original text. I installed the latest tesseract and leptonica via Homebrew. 
I then ran

tesseract Downloads/foundations-of-mathematics.tiff 
foundations-of-mathematics

and it consistently outputs the first page only.

On Thursday, August 8, 2019 at 11:58:57 PM UTC-7, zdenop wrote:
>
> Provide exact information what you did.
> Make sure you use the latest tesseract and leptonica.
>
> Zdenko
>
>
> pi 9. 8. 2019 o 7:41 ilevy <textr...@gmail.com <javascript:>> napĂ­sal(a):
>
>> I'm trying tesseract for the first time with a png of a multipage 
>> document I saved out of a pdf (which itself was just an image).
>>
>> When I run tesseract, I get an output of the first page, but that's all. 
>> I notice that there's a control-L (^L) at the end of the text file.
>>
>> How do I get the entire file output to txt?
>>
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to tesser...@googlegroups.com <javascript:>.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/4067da33-b1d1-4bbe-9909-9b5552c49549%40googlegroups.com
>>  
>> <https://groups.google.com/d/msgid/tesseract-ocr/4067da33-b1d1-4bbe-9909-9b5552c49549%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/802fb21f-602c-473a-8675-54f98df3e11a%40googlegroups.com.

Reply via email to