Try adding a slight white border to images and see if that helps.

On Fri, Jun 22, 2018 at 7:35 PM <ahka.anyrea...@gmail.com> wrote:

>
> <https://lh3.googleusercontent.com/-2rjsO_cGMOk/Wy0CCWVozgI/AAAAAAAAAAM/6vwqggQjeXs3mbfkFKvGpVxgaanAbfQUQCLcBGAs/s1600/24-block-0-L-42.png>
>
>
> <https://lh3.googleusercontent.com/-E88ArfnXFP4/Wy0CMbscrVI/AAAAAAAAAAQ/YUhFh9aYMx0_CiqhK-qBVnX3l5YsyZ6FwCLcBGAs/s1600/24-block-0-L-25.png>
>
> Thanks for the reply
> Those are two line examples.
>
> On Friday, June 22, 2018 at 3:59:23 PM UTC+2, shree wrote:
>>
>> Please try with a different psm and see if you get better results. If you
>> share a sample image we can test and respond.
>>
>> On Fri, Jun 22, 2018 at 5:29 PM <ahka.an...@gmail.com> wrote:
>>
>>> Could someone please try to give me an answer for my language.
>>>
>>> On Friday, June 15, 2018 at 2:42:00 PM UTC+2, ahka.an...@gmail.com
>>> wrote:
>>>>
>>>> Dear All,
>>>>
>>>> In the project that I am currently working in, I have a pure text line
>>>> cropped from an document image.
>>>>
>>>> As a next step, I need to recognize the text using and at the same
>>>> time, I need to get the words coordinates.
>>>>
>>>> To get that coordinates I am passing the hocr parameters to the command
>>>> line and assign the page segmentation mode to 7 (line).
>>>>
>>>> tesseract file.png out.txt --psm 7 hocr.
>>>>
>>>> However, the output is really bad because by passing these parameters,
>>>> the line will be conisders as a page and some words will not be detected at
>>>> the output.
>>>>
>>>> Is there another way to get the word coordinate of that line?
>>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to tesseract-oc...@googlegroups.com.
>>> To post to this group, send email to tesser...@googlegroups.com.
>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>> To view this discussion on the web visit
>>> https://groups.google.com/d/msgid/tesseract-ocr/d24b268f-5cfa-4d20-89c0-9dfd2360f0dc%40googlegroups.com
>>> <https://groups.google.com/d/msgid/tesseract-ocr/d24b268f-5cfa-4d20-89c0-9dfd2360f0dc%40googlegroups.com?utm_medium=email&utm_source=footer>
>>> .
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>>
>>
>> --
>>
>> ____________________________________________________________
>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/38f2e418-76a3-4c0b-8ec3-71e6ebe62d83%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/38f2e418-76a3-4c0b-8ec3-71e6ebe62d83%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>


-- 

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduVP0OdjJc0Qh4YWWJjq-yWwtpU57vbMsiqD3L9eVDDUeQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to