Is there any way to remove the End of page symbol that appears in the 
image? It looks like a box with some 000c written at the end. 

Regards
Adarsh


On  Thursday, February 22, 2018 at 4:22:21 PM UTC+5:30, shree wrote:
>
> What --psm are you using?
>
> Tesseract might be treating the last portion as a different column.
>
> Try PSM 4 or 6.
>
> On 22-Feb-2018 3:48 PM, <ada...@turningcloud.com <javascript:>> wrote:
>
>>
>> <https://lh3.googleusercontent.com/-q5owZImroPI/Wo6YuWowGqI/AAAAAAAAABU/K2W_51vsFrMkC9Zxf8QrSVr2y3UEUqUgACLcBGAs/s1600/output.jpg>
>>
>>
>> <https://lh3.googleusercontent.com/-zxXVbpgcpZY/Wo6YMtjsdPI/AAAAAAAAABM/Xfy8Cmd_lU8GPBBUPjuqhx2pQZj7Q8qaQCLcBGAs/s1600/page-3.tif>
>>
>>
>> <https://lh3.googleusercontent.com/-zxXVbpgcpZY/Wo6YMtjsdPI/AAAAAAAAABM/Xfy8Cmd_lU8GPBBUPjuqhx2pQZj7Q8qaQCLcBGAs/s1600/page-3.tif>
>> The issue I am facing is that when i scan a file which has coumn data 
>> separeated by "|" , OR, then in a single line, tesseract is printing the 
>> last column data after the last line of the file.
>> I'll be attaching the image for your referral. Hope i receive some help 
>> soon. The output image has the discrepancy on the last line .
>>
>> Can anyone suggest some solution. @shree much help needed.
>>
>>
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to tesseract-oc...@googlegroups.com <javascript:>.
>> To post to this group, send email to tesser...@googlegroups.com 
>> <javascript:>.
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/76378a71-f459-454e-9c6c-a0e3f682b1b9%40googlegroups.com
>>  
>> <https://groups.google.com/d/msgid/tesseract-ocr/76378a71-f459-454e-9c6c-a0e3f682b1b9%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/8f7a5127-f9ee-40c9-abfe-7843ff4c1a71%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to