Is there any way to remove the End of page symbol that appears in the image? It looks like a box with some 000c written at the end.
Regards Adarsh On Thursday, February 22, 2018 at 4:22:21 PM UTC+5:30, shree wrote: > > What --psm are you using? > > Tesseract might be treating the last portion as a different column. > > Try PSM 4 or 6. > > On 22-Feb-2018 3:48 PM, <ada...@turningcloud.com <javascript:>> wrote: > >> >> <https://lh3.googleusercontent.com/-q5owZImroPI/Wo6YuWowGqI/AAAAAAAAABU/K2W_51vsFrMkC9Zxf8QrSVr2y3UEUqUgACLcBGAs/s1600/output.jpg> >> >> >> <https://lh3.googleusercontent.com/-zxXVbpgcpZY/Wo6YMtjsdPI/AAAAAAAAABM/Xfy8Cmd_lU8GPBBUPjuqhx2pQZj7Q8qaQCLcBGAs/s1600/page-3.tif> >> >> >> <https://lh3.googleusercontent.com/-zxXVbpgcpZY/Wo6YMtjsdPI/AAAAAAAAABM/Xfy8Cmd_lU8GPBBUPjuqhx2pQZj7Q8qaQCLcBGAs/s1600/page-3.tif> >> The issue I am facing is that when i scan a file which has coumn data >> separeated by "|" , OR, then in a single line, tesseract is printing the >> last column data after the last line of the file. >> I'll be attaching the image for your referral. Hope i receive some help >> soon. The output image has the discrepancy on the last line . >> >> Can anyone suggest some solution. @shree much help needed. >> >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to tesseract-oc...@googlegroups.com <javascript:>. >> To post to this group, send email to tesser...@googlegroups.com >> <javascript:>. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/76378a71-f459-454e-9c6c-a0e3f682b1b9%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/76378a71-f459-454e-9c6c-a0e3f682b1b9%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> For more options, visit https://groups.google.com/d/optout. >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/8f7a5127-f9ee-40c9-abfe-7843ff4c1a71%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.