Probably FF.

Tesseract adds a page break (normally form feed) by default.

It is still possible to suppress page breaks by setting an empty
page_separator.


ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Fri, Feb 23, 2018 at 12:29 PM, <ada...@turningcloud.com> wrote:

> Is there any way to remove the End of page symbol that appears in the
> image? It looks like a box with some 000c written at the end.
>
> Regards
> Adarsh
>
>
> On  Thursday, February 22, 2018 at 4:22:21 PM UTC+5:30, shree wrote:
>>
>> What --psm are you using?
>>
>> Tesseract might be treating the last portion as a different column.
>>
>> Try PSM 4 or 6.
>>
>> On 22-Feb-2018 3:48 PM, <ada...@turningcloud.com> wrote:
>>
>>>
>>> <https://lh3.googleusercontent.com/-q5owZImroPI/Wo6YuWowGqI/AAAAAAAAABU/K2W_51vsFrMkC9Zxf8QrSVr2y3UEUqUgACLcBGAs/s1600/output.jpg>
>>>
>>>
>>> <https://lh3.googleusercontent.com/-zxXVbpgcpZY/Wo6YMtjsdPI/AAAAAAAAABM/Xfy8Cmd_lU8GPBBUPjuqhx2pQZj7Q8qaQCLcBGAs/s1600/page-3.tif>
>>>
>>>
>>> <https://lh3.googleusercontent.com/-zxXVbpgcpZY/Wo6YMtjsdPI/AAAAAAAAABM/Xfy8Cmd_lU8GPBBUPjuqhx2pQZj7Q8qaQCLcBGAs/s1600/page-3.tif>
>>> The issue I am facing is that when i scan a file which has coumn data
>>> separeated by "|" , OR, then in a single line, tesseract is printing the
>>> last column data after the last line of the file.
>>> I'll be attaching the image for your referral. Hope i receive some help
>>> soon. The output image has the discrepancy on the last line .
>>>
>>> Can anyone suggest some solution. @shree much help needed.
>>>
>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to tesseract-oc...@googlegroups.com.
>>> To post to this group, send email to tesser...@googlegroups.com.
>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>> To view this discussion on the web visit https://groups.google.com/d/ms
>>> gid/tesseract-ocr/76378a71-f459-454e-9c6c-a0e3f682b1b9%40goo
>>> glegroups.com
>>> <https://groups.google.com/d/msgid/tesseract-ocr/76378a71-f459-454e-9c6c-a0e3f682b1b9%40googlegroups.com?utm_medium=email&utm_source=footer>
>>> .
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/8f7a5127-f9ee-40c9-abfe-7843ff4c1a71%
> 40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/8f7a5127-f9ee-40c9-abfe-7843ff4c1a71%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUAsHKZL4TuE234g0WoZ%3Dt%3DGvAfiKs4Cp4PrkxXqN7p%2BQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to