I think pdf creation adds a text layer only and there isn't an option to
add HOCR to it.

@jbreiden can confirm.

On Mon, Sep 17, 2018 at 6:10 PM, Monica <monicakumari...@gmail.com> wrote:

> I have tried this, but this is showing the default behaviour. I think the
> default output is overlaying on pdf instead of hocr out.
>
>
> On Mon, Sep 17, 2018 at 5:47 PM Monica <monicakumari...@gmail.com> wrote:
>
>> Thanks Zdenko for you response.
>> will "tesseract scannedFile.png scanned.pdf -l eng hocr pdf" overlay on
>> pdf file ?
>>
>> On Mon, Sep 17, 2018 at 5:44 PM Zdenko Podobny <zde...@gmail.com> wrote:
>>
>>> Something like this?
>>>
>>> tesseract scannedFile.png scanned.pdf -l eng hocr pdf
>>>
>>> Zdenko
>>>
>>>
>>> po 17. 9. 2018 o 14:12 monica kumari <monicakumari...@gmail.com>
>>> napísal(a):
>>>
>>>> for OCRing a scanned pdf,
>>>> first it is converted to image format then OCRed and gives a temperory
>>>> file of pdf/text format and overlays on original scanned pdf.
>>>> I want the output format to be hocr. for this, I ran the command
>>>> "convert scannedFile.pdf scannedFile.png" and then "tesseract
>>>> scannedFile.png scanned.pdf -l eng hocr"
>>>> I got the hocr fomat as output.
>>>> Now I need a help to overlay it on scannned pdf file.
>>>>
>>>> Anybody have any idea about it ?
>>>>
>>>> --
>>>> You received this message because you are subscribed to the Google
>>>> Groups "tesseract-ocr" group.
>>>> To unsubscribe from this group and stop receiving emails from it, send
>>>> an email to tesseract-ocr+unsubscr...@googlegroups.com.
>>>> To post to this group, send email to tesseract-ocr@googlegroups.com.
>>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>>> To view this discussion on the web visit https://groups.google.com/d/
>>>> msgid/tesseract-ocr/c5b4f9c7-67e5-41d8-8c24-b4e5e4c39ed3%
>>>> 40googlegroups.com
>>>> <https://groups.google.com/d/msgid/tesseract-ocr/c5b4f9c7-67e5-41d8-8c24-b4e5e4c39ed3%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>> .
>>>> For more options, visit https://groups.google.com/d/optout.
>>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to tesseract-ocr+unsubscr...@googlegroups.com.
>>> To post to this group, send email to tesseract-ocr@googlegroups.com.
>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>> To view this discussion on the web visit https://groups.google.com/d/
>>> msgid/tesseract-ocr/CAJbzG8xGPRx0ZriLS%2BH7kyNHEFaAFHweKJc5KhycfLKT87
>>> XG8A%40mail.gmail.com
>>> <https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8xGPRx0ZriLS%2BH7kyNHEFaAFHweKJc5KhycfLKT87XG8A%40mail.gmail.com?utm_medium=email&utm_source=footer>
>>> .
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/CAPgEwRjWnOe%3DXwxbZp_F9ZUFFPVDtDztcTiq%
> 3DRyychterctsVQ%40mail.gmail.com
> <https://groups.google.com/d/msgid/tesseract-ocr/CAPgEwRjWnOe%3DXwxbZp_F9ZUFFPVDtDztcTiq%3DRyychterctsVQ%40mail.gmail.com?utm_medium=email&utm_source=footer>
> .
>
> For more options, visit https://groups.google.com/d/optout.
>



-- 

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduXUTCr-OfCd0xAC_AoJAqk6J%2B0OaJ4mR4_nyoU34qLMAQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to