Probably FF. Tesseract adds a page break (normally form feed) by default.
It is still possible to suppress page breaks by setting an empty page_separator. ShreeDevi ____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Fri, Feb 23, 2018 at 12:29 PM, <ada...@turningcloud.com> wrote: > Is there any way to remove the End of page symbol that appears in the > image? It looks like a box with some 000c written at the end. > > Regards > Adarsh > > > On Thursday, February 22, 2018 at 4:22:21 PM UTC+5:30, shree wrote: >> >> What --psm are you using? >> >> Tesseract might be treating the last portion as a different column. >> >> Try PSM 4 or 6. >> >> On 22-Feb-2018 3:48 PM, <ada...@turningcloud.com> wrote: >> >>> >>> <https://lh3.googleusercontent.com/-q5owZImroPI/Wo6YuWowGqI/AAAAAAAAABU/K2W_51vsFrMkC9Zxf8QrSVr2y3UEUqUgACLcBGAs/s1600/output.jpg> >>> >>> >>> <https://lh3.googleusercontent.com/-zxXVbpgcpZY/Wo6YMtjsdPI/AAAAAAAAABM/Xfy8Cmd_lU8GPBBUPjuqhx2pQZj7Q8qaQCLcBGAs/s1600/page-3.tif> >>> >>> >>> <https://lh3.googleusercontent.com/-zxXVbpgcpZY/Wo6YMtjsdPI/AAAAAAAAABM/Xfy8Cmd_lU8GPBBUPjuqhx2pQZj7Q8qaQCLcBGAs/s1600/page-3.tif> >>> The issue I am facing is that when i scan a file which has coumn data >>> separeated by "|" , OR, then in a single line, tesseract is printing the >>> last column data after the last line of the file. >>> I'll be attaching the image for your referral. Hope i receive some help >>> soon. The output image has the discrepancy on the last line . >>> >>> Can anyone suggest some solution. @shree much help needed. >>> >>> >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to tesseract-oc...@googlegroups.com. >>> To post to this group, send email to tesser...@googlegroups.com. >>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>> To view this discussion on the web visit https://groups.google.com/d/ms >>> gid/tesseract-ocr/76378a71-f459-454e-9c6c-a0e3f682b1b9%40goo >>> glegroups.com >>> <https://groups.google.com/d/msgid/tesseract-ocr/76378a71-f459-454e-9c6c-a0e3f682b1b9%40googlegroups.com?utm_medium=email&utm_source=footer> >>> . >>> For more options, visit https://groups.google.com/d/optout. >>> >> -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To post to this group, send email to tesseract-ocr@googlegroups.com. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit https://groups.google.com/d/ > msgid/tesseract-ocr/8f7a5127-f9ee-40c9-abfe-7843ff4c1a71% > 40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/8f7a5127-f9ee-40c9-abfe-7843ff4c1a71%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUAsHKZL4TuE234g0WoZ%3Dt%3DGvAfiKs4Cp4PrkxXqN7p%2BQ%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.