Re: [tesseract-ocr] Tips and advice for preprocessing images before feeding them to tesseract.

2019-04-15 Thread Shilpa Shinde
I am trying to train tesseract , but not getting any proper documentation to do that ? On Mon, Apr 15, 2019 at 7:58 PM JB Data31 wrote: > I only focus on *example_02.jpg*. > > 1/ *example_02_0.jpg *→ tesseract → *02_0.txt* shows : > The target. > *--psm 6* because block of text. > It's not real

Re: [tesseract-ocr] Tips and advice for preprocessing images before feeding them to tesseract.

2019-04-15 Thread Lorenzo Bolzani
This is very hard to do reliably for general images. You may use something like EAST to detect text regions, then a few tests to understand if it's black on white text or the opposite. Then you can crop the image and rescale it to a standard size (this may not be the final size you'll feed to tess

Re: [tesseract-ocr] How to create a box file

2019-04-15 Thread Shree Devi Kumar
see https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00#creating-training-data On Mon, Apr 15, 2019 at 6:34 PM anne wrote: > Hi, I'm really new to Tesseract and I want to train it to recognize a new > script but until now, I still don't understand how to make a box file. I've

[tesseract-ocr] Tesseract - OpenCL installation

2019-04-15 Thread Dave
Hi everyone, I'm quite unclear as to how OpenCL is installed and there's no clear instructions anywhere (specifically I want to install OpenCL to speed up tesseract) As I understand it the nvidia drivers and cuda toolkit come with components needed for OpenCL, but do they contain the headers

[tesseract-ocr] How to create a box file

2019-04-15 Thread anne
Hi, I'm really new to Tesseract and I want to train it to recognize a new script but until now, I still don't understand how to make a box file. I've tried running this example from github https://github.com/tesseract-ocr/tesseract/wiki/Training-Tesseract-%E2%80%93-Make-Box-Files Run Tessera