Re: [tesseract-ocr] Tesseract 4.0 - Multiline text

2022-03-23 Thread Shree Devi Kumar
Use the hocr option. On Thu, Mar 24, 2022, 10:52 Muraliraj DK wrote: > I am not sure if you have looked at the image. What i meant on Multi line > text is when the sentence is wrapped to next line i would like to extract > as single sentence instead of 2 lines (paragraph). > > Single line is - s

Re: [tesseract-ocr] Tesseract 4.0 - Multiline text

2022-03-23 Thread Muraliraj DK
I am not sure if you have looked at the image. What i meant on Multi line text is when the sentence is wrapped to next line i would like to extract as single sentence instead of 2 lines (paragraph). Single line is - sentence which is not wrapped to next line. Tessearct can read line by line but i

[tesseract-ocr] api->GetUTF8Text() error

2022-03-23 Thread flavi...@gmail.com
I have implemented OCR with Tesseract 4.1 (tesseract41.lib/tesseract41.dll), used on a Win10, 64 bit. Tesseract library also compiled on 64 bit, of course. I created a VC++ VS2017 test project, with the following code: do { std::shared_ptr api = std::make_shared

Re: [tesseract-ocr] Sub-second OCR?

2022-03-23 Thread Zdenko Podobny
https://github.com/tesseract-ocr/tessdoc/blob/main/Benchmarks.md Zdenko st 23. 3. 2022 o 7:07 Landon Barnickle napĂ­sal(a): > What is the expected for a standard full page 300DPI (eng-fast) on a > single core? > Does any one have links to benchmarks or a guide for maximizing > performance? > >