See https://tesseract-ocr.github.io/tessdoc/FAQ.html#what-page-separators-are-used-in-txt-output-by-tesseract-400
On Thu, Feb 18, 2021, 12:15 J Cassar <johnkcas...@gmail.com> wrote: > Good Day, > > I've used tesseract on a number of jpeg images ( see input image attached) > and it works fine as it outputs the text. However it also outputs a symbol > in the next line below the text ( see output text attached) even though > there is no symbol in the image file. > > Is there a reason why a symbol is there and is there a way to prevent it > from showing up in the output file ? > > I've ran tesseract in cmd promt and in python and it always gives me a > symbol below the text. > > Thanks, Appreciate anyone assistance in this matter. > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/96332d23-f52b-485d-b719-54999db5f8aan%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/96332d23-f52b-485d-b719-54999db5f8aan%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUxzp5tn5jkBWimDE31a2TK7Bt%2B%2BpMeabNmwj%2BMLKVfVQ%40mail.gmail.com.