[tesseract-ocr] Re: Training error "Couldn't find a matching blob"

2018-06-04 Thread Paul Kitchen
ZDenko, I checked out the latest tesseract code and updated to branch 3.05. I see that the int64_t area bug is already fixed (thanks!). I also see that the buffer read overrun is partially fixed. There is this line in ReadAllBoxes(): box_data.push_back('\0'); Since the memory will have to be

Re: [tesseract-ocr] Re: Training error "Couldn't find a matching blob"

2018-06-04 Thread Paul Kitchen
Here is a sample of the problem it causes. I run the following to train the attached image and box file: tesseract gdt.symbols.exp0.tif gdt.symbols.exp0 box.train And here is the output: Tesseract Open Source OCR Engine v3.05.00dev with Leptonica Page 1 Bad box coordinates in boxfile string! ²

Re: [tesseract-ocr] Re: Preprocess Image

2018-06-04 Thread ShreeDevi Kumar
Take a look at http://www.fmwconcepts.com/imagemagick/textcleaner/ and other scripts by Fred ShreeDevi भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Mon, Jun 4, 2018 at 10:52 PM, Hongguo An wrote: > Can anybody help? thanks i

[tesseract-ocr] Re: Preprocess Image

2018-06-04 Thread Hongguo An
Can anybody help? thanks in advance On Thursday, May 31, 2018 at 12:57:20 PM UTC-7, Hongguo An wrote: > > > > Hi: > When trying to OCR the above image, the dat

Re: [tesseract-ocr] Re: Training error "Couldn't find a matching blob"

2018-06-04 Thread Zdenko Podobny
Stefan, Paul suggest to modified also LoadDataFromFile (ccutil/genericvector.h). That modification is not needed? Zdenko po 4. 6. 2018 o 17:32 'Stefan Weil' via tesseract-ocr < tesseract-ocr@googlegroups.com> napísal(a): > As far as I see 4.0.0 is good. I have sent a pull request which backpor

[tesseract-ocr] Re: Problem compiling tesseract 4.0 on macOS

2018-06-04 Thread shree
please see https://github.com/tesseract-ocr/tesseract/issues/1028#issuecomment-394415918 On Monday, June 4, 2018 at 7:33:19 PM UTC+5:30, Ning Zhao wrote: > > I'm trying to install tesseract 4.0 on macOS High Sierra from source > following https://github.com/tesseract-ocr/tesseract/wiki/Compilin

Re: [tesseract-ocr] Re: Training error "Couldn't find a matching blob"

2018-06-04 Thread 'Stefan Weil' via tesseract-ocr
As far as I see 4.0.0 is good. I have sent a pull request which backports the fix from 4.0.0 (a simplified variant of Paul's fix) to 3.05. Stefan -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receivin

[tesseract-ocr] Problem compiling tesseract 4.0 on macOS

2018-06-04 Thread Ning Zhao
I'm trying to install tesseract 4.0 on macOS High Sierra from source following https://github.com/tesseract-ocr/tesseract/wiki/Compiling#macos (macOS with Homebrew part). When I did the `make -j` part, after popping up over 1000 lines of output, it gave me these lines at the end: 1296 /bin/sh