That's a good question. The png was exported from a pdf, so there may have been some notion of pages encoded into it, but that's a guess. What I can say is that the result is consistent. Running
tesseract Downloads/foundations-of-mathematics.tiff foundations-of-mathematics always yields the first page in foundations-of-mathematics.txt. On Thursday, August 8, 2019 at 11:28:34 PM UTC-7, ElGato ElMago wrote: > > Is it possible to have multiple pages in a png file in the first place? > > 2019年8月9日金曜日 14時41分15秒 UTC+9 ilevy: >> >> I'm trying tesseract for the first time with a png of a multipage >> document I saved out of a pdf (which itself was just an image). >> >> When I run tesseract, I get an output of the first page, but that's all. >> I notice that there's a control-L (^L) at the end of the text file. >> >> How do I get the entire file output to txt? >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/3acf4d42-c94a-414a-a248-286e568f8b87%40googlegroups.com.