I have tried this, but this is showing the default behaviour. I think the
default output is overlaying on pdf instead of hocr out.


On Mon, Sep 17, 2018 at 5:47 PM Monica <monicakumari...@gmail.com> wrote:

> Thanks Zdenko for you response.
> will "tesseract scannedFile.png scanned.pdf -l eng hocr pdf" overlay on
> pdf file ?
>
> On Mon, Sep 17, 2018 at 5:44 PM Zdenko Podobny <zde...@gmail.com> wrote:
>
>> Something like this?
>>
>> tesseract scannedFile.png scanned.pdf -l eng hocr pdf
>>
>> Zdenko
>>
>>
>> po 17. 9. 2018 o 14:12 monica kumari <monicakumari...@gmail.com>
>> napĂ­sal(a):
>>
>>> for OCRing a scanned pdf,
>>> first it is converted to image format then OCRed and gives a temperory
>>> file of pdf/text format and overlays on original scanned pdf.
>>> I want the output format to be hocr. for this, I ran the command
>>> "convert scannedFile.pdf scannedFile.png" and then "tesseract
>>> scannedFile.png scanned.pdf -l eng hocr"
>>> I got the hocr fomat as output.
>>> Now I need a help to overlay it on scannned pdf file.
>>>
>>> Anybody have any idea about it ?
>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to tesseract-ocr+unsubscr...@googlegroups.com.
>>> To post to this group, send email to tesseract-ocr@googlegroups.com.
>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>> To view this discussion on the web visit
>>> https://groups.google.com/d/msgid/tesseract-ocr/c5b4f9c7-67e5-41d8-8c24-b4e5e4c39ed3%40googlegroups.com
>>> <https://groups.google.com/d/msgid/tesseract-ocr/c5b4f9c7-67e5-41d8-8c24-b4e5e4c39ed3%40googlegroups.com?utm_medium=email&utm_source=footer>
>>> .
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to tesseract-ocr+unsubscr...@googlegroups.com.
>> To post to this group, send email to tesseract-ocr@googlegroups.com.
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8xGPRx0ZriLS%2BH7kyNHEFaAFHweKJc5KhycfLKT87XG8A%40mail.gmail.com
>> <https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8xGPRx0ZriLS%2BH7kyNHEFaAFHweKJc5KhycfLKT87XG8A%40mail.gmail.com?utm_medium=email&utm_source=footer>
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAPgEwRjWnOe%3DXwxbZp_F9ZUFFPVDtDztcTiq%3DRyychterctsVQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to