[tesseract-ocr] Not getting complete text from JPG

2019-07-01 Thread Leena
Dear Developer, Attached code, input image and output file that is used to extract text from input image. Used tessract OCR to extract the text from flow diagram, but not getting text of each box, few are extracted. Please analyse and suggest changes to get full text. Thanks -- You received

[tesseract-ocr] Bug in tesseract - eat too much RAM in windows

2019-07-01 Thread _ Flaviu
I am using tesseract in a VC++ app. Which goes fine, but if I use this library on and on without close the app, the RAM size is increasing a lot. Actually, if I read 10 images, I am not close the app, and then I read other 10 images, and then other 10 images, and so on, my app is taken huge

[tesseract-ocr] Choice Iterator only shows one choice for each character

2019-07-01 Thread Jochen Naumann
Hi, I am using the official api example for iterating over the choices for characters and getting their confidence levels. https://github.com/tesseract-ocr/tesseract/wiki/APIExample#example-of-iterator-over-the-classifier-choices-for-a-single-symbol However if I run the program, the iterator only

Re: [tesseract-ocr] Choice Iterator only shows one choice for each character

2019-07-01 Thread Shree Devi Kumar
Take a look at https://github.com/tesseract-ocr/tesseract/blob/ab09b09da66f458002f01d0bc4ffeee8eff58f6e/src/ccmain/tesseractclass.cpp#L524 On Mon, Jul 1, 2019 at 2:45 PM Jochen Naumann wrote: > Hi, I am using the official api example for iterating over the choices for > characters and getting th

[tesseract-ocr] How to use TessBaseAPI::GetUnichar() safely?

2019-07-01 Thread db4
I'm trying to get the actual charset loaded by Tesseract from a .traineddata file. I can call TessBaseAPI::GetUnichar(id++) in a loop but how to find what the maximum acceptable id is? TessBaseAPI::GetUnichar() implementation is just const char* TessBaseAPI::GetUnichar

Re: [tesseract-ocr] issue #1393: Android NDK: LOCAL_MODULE definition in jni/Android.mk must not contain space

2019-07-01 Thread JB Data31
As a first step of a *tesseract* *androïd* use, I try/achieve the build of *tesseract* and execute with a shell on my device. That's not a stock way to execute app on Androïd, but very instructive. According NDK guides , § *C++ STL

Re: [tesseract-ocr] Choice Iterator only shows one choice for each character

2019-07-01 Thread Jochen Naumann
Thanks, this seems to be what I need. But how do I set this lstm_choice_mode with the api? Am Montag, 1. Juli 2019 11:55:02 UTC+2 schrieb shree: > > Take a look at > https://github.com/tesseract-ocr/tesseract/blob/ab09b09da66f458002f01d0bc4ffeee8eff58f6e/src/ccmain/tesseractclass.cpp#L524 > > On

Re: [tesseract-ocr] GPU for Tesseract

2019-07-01 Thread JB Data31
A recent post about GPU / OpenCL. Le vendredi 28 juin 2019 13:32:30 UTC, Timothy Snyder a écrit : > > I think it means that Tesseract doesn't support nor require hardware > acceler

Re: [tesseract-ocr] Custom Tiff/Box pairs support in tesstrain.sh

2019-07-01 Thread hrishikesh kaulwar
Okay thanks for the suggestion. I will try it. On Friday, June 21, 2019 at 2:15:28 PM UTC+5:30, shree wrote: > > Dewarp the image for better recognition, without training. > > I used scantailor. > > tesseract dewarp.tif - --psm 6 > Page 1 > SAFE SURGERY CHECKLIST > I dapted from WHO Safe Surgery

[tesseract-ocr] how to improve accuracy around image border

2019-07-01 Thread Jingjing Lin
I noticed that tesseract can give very bad result near the border of an image. What is causing this? What can we do to improve the accuracy at image border? Example as attached. As you can see the result for the first four lines of the image is very bad. The result around the left border is also

[tesseract-ocr] use tesseract for handwritten text recognition

2019-07-01 Thread swayam pal
How to use tesseract 4.0 for handwritten text recognition. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post t