That's right, that initial "TO" and this is just a fraction of the text, there are dozens of examples like "TO" on a single page. But since it spreads to two lines there's nothing I can do I assume?
On Tuesday, August 4, 2020 at 7:39:21 PM UTC+2 zdenop wrote: > Not sure what do you mean... > > tesseract big_low.jpeg - --psm 6 > Warning: Invalid resolution 0 dpi. Using 70 instead. > FY, MINERS.—TO LET, ON LEASE, on such terms as may > be agreed on, the MINERALS in the ESTATE of KNOCKSHINNOCK, lying in > the parish of New Cumnock, and county of Ayr. Acdead vein has been lately > discovered > > Problem is there only with initial TO which is IMO caused by T with size > of two lines with following smaller size letters. > > Zdenko > > > ut 4. 8. 2020 o 13:07 tlit...@gmail.com <tlit...@gmail.com> napísal(a): > >> Hello, >> >> Is it possible to train for bigger fonts in the beginning of the >> sentences, since it seems that tesseract always misses them. >> >> Thanks in advance. >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to tesseract-oc...@googlegroups.com. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/0f97a784-e8e4-4c05-8296-b95dc2211e78n%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/0f97a784-e8e4-4c05-8296-b95dc2211e78n%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/288a05c0-9c53-4729-9aa3-5b5202388a16n%40googlegroups.com.