Re: [tesseract-ocr] Recognize music symbols

2014-08-21 Thread Sriranga(80yrs)
I remember there was discussion on subject in the forum - search for the same. On Fri, Aug 22, 2014 at 8:50 AM, Kiran Babu G wrote: > Hi, > > I want to recognize music symbols with tesseract along with english. > Ideally, I would want to modify existing english trained data to add the > additi

[tesseract-ocr] Recognize music symbols

2014-08-21 Thread Kiran Babu G
Hi, I want to recognize music symbols with tesseract along with english. Ideally, I would want to modify existing english trained data to add the additional symbols. I understand that there is no access to box/tiff files for the orignial english trained data. So, I tried the following: - Train m

Re: [tesseract-ocr] Re: Tesseract compilation on code blocks (gcc + mingw)

2014-08-21 Thread Nick White
On Thu, Aug 21, 2014 at 11:29:09AM -0700, shree wrote: > zdenko, > > the current problem also seems related to strtok_r > > please see > > http://stackoverflow.com/questions/12973750/ > fatal-error-strtok-r-h-no-such-file-or-directory-while-compiling-tesseract-oc > > http://sourceforge.net/p/mi

[tesseract-ocr] Re: Tesseract compilation on code blocks (gcc + mingw)

2014-08-21 Thread shree
zdenko, the current problem also seems related to strtok_r please see http://stackoverflow.com/questions/12973750/fatal-error-strtok-r-h-no-such-file-or-directory-while-compiling-tesseract-oc http://sourceforge.net/p/mingw/feature-requests/64/ On Tuesday, March 22, 2011 2:06:58 PM UTC+5:30, zd

Re: [tesseract-ocr] tesseract trained successfully but gives:Tesseract Open Source OCR Engine v3.03 with Leptonica Segmentation fault (core dumped)

2014-08-21 Thread Dovhani Foneworx
The following is the process happened during training. fonew...@foneworxtest.foneworx.co.za:~/DM/Tesseracting/TESTDIR/Test2/images2/kfc$ tesseract kfc.Times_New_Roman.exp0.tif kfc.Times_New_Roman.exp0 box.train Tesseract Open Source OCR Engine v3.03 with Leptonica row xheight=40, but median xh

Re: [tesseract-ocr] tesseract trained successfully but gives:Tesseract Open Source OCR Engine v3.03 with Leptonica Segmentation fault (core dumped)

2014-08-21 Thread Nick White
In that case it must be a problem with your training data. Can you let us know the exact commands you used to create it? Alternatively, you could post a gdb backtrace, if you know how to do that. Nick On Thu, Aug 21, 2014 at 04:19:40PM +0200, Dovhani Foneworx wrote: > Hi Nick, this happens whe

Re: [tesseract-ocr] tesseract trained successfully but gives:Tesseract Open Source OCR Engine v3.03 with Leptonica Segmentation fault (core dumped)

2014-08-21 Thread Dovhani Foneworx
Hi Nick, this happens when I test with all images. and also, I have 4 images that i have joined together to make one single big image using imagemagic, and now im testing with all those individual image and also the big image aswell. this images are till slip from same shop. same problem happen.

Re: [tesseract-ocr] tesseract trained successfully but gives:Tesseract Open Source OCR Engine v3.03 with Leptonica Segmentation fault (core dumped)

2014-08-21 Thread Nick White
Hi Dovhani, Does this happen with all images when using your training, or just one? Nick On Thu, Aug 21, 2014 at 03:03:47AM -0700, Dovhani Foneworx wrote: > Hi guys, I have a problem, I have succesfully trained tesseract 3.03 in Ubunt > 14.04 but when i run tesseract it is giving errors on an i

Re: [tesseract-ocr] Makefile:372: recipe for target 'all' failed - using current version with leptonica 1.71 on cygwin

2014-08-21 Thread Nick White
On Thu, Aug 21, 2014 at 01:41:23PM +0530, Shree Devi Kumar wrote: > Hi Zdenko, > > ./ confusing for me :-) :-) ./ is a common idiom for unix. '.' means 'current directory', so ./ means 'in the current directory'. You have to do it to run programs in the current directory (or just do something

Re: [tesseract-ocr] Can I remove some fonts from an existing traineddata?

2014-08-21 Thread Nick White
On Wed, Aug 20, 2014 at 07:39:50PM -0700, SHEN Fei wrote: > hi Nick, > > I'm trying to use tesseract in my mobile phone so the tessdata size is > critical. > Since I only care about very few fonts, it would be convenient if I could add/ > remove a special font. > > Maybe removing some dictionary

[tesseract-ocr] Re: Language file for MICR font

2014-08-21 Thread Juned Khan
Hi Santhos Here is the updated code for intialization TessBaseAPI baseApi = new TessBaseAPI(); baseApi.setDebug(true); baseApi.init(DATA_PATH, "mcr"); baseApi.setImage(bitmap); String recognizedText = baseApi.getUTF8Text(); baseApi.end();

[tesseract-ocr] Re: Language file for MICR font

2014-08-21 Thread Juned Khan
Hi Santhosh this is how I am initializing Tesseract engine. TessBaseAPI baseApi = new TessBaseAPI(); baseApi.setDebug(true); baseApi.init(DATA_PATH, lang); baseApi.setImage(bitmap); String recognizedText = baseApi.getUTF8Text(); baseApi.end(

[tesseract-ocr] tesseract trained successfully but gives:Tesseract Open Source OCR Engine v3.03 with Leptonica Segmentation fault (core dumped)

2014-08-21 Thread Dovhani Foneworx
Hi guys, I have a problem, I have succesfully trained tesseract 3.03 in Ubunt 14.04 but when i run tesseract it is giving errors on an image and the image was part of the image. I have join 4 images with imagemagice and make one big image and when i run tesseract it do the following: fonew...

Re: [tesseract-ocr] 3.03 compilation problems on FreeBSD.

2014-08-21 Thread Shree Devi Kumar
On windows the following as given in http://vorba.ch/2014/tesseract-cygwin.html worked for me. I am using leptonica 1.71 ./configure LDFLAGS=-L/usr/local/lib Shree Devi Kumar भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Thu

Re: [tesseract-ocr] Makefile:372: recipe for target 'all' failed - using current version with leptonica 1.71 on cygwin

2014-08-21 Thread Shree Devi Kumar
Hi Zdenko, ./ confusing for me :-) I tried yesterday with mingw and msys but process was hanging while compiling leptonica, so tried today with cygwin. Here is the version info under cygwin gcc version 4.8.3 (GCC) automake 1.14 autoheader 2.69 autoconf 2.69 Shree Devi Kumar _

Re: [tesseract-ocr] Makefile:372: recipe for target 'all' failed - using current version with leptonica 1.71 on cygwin

2014-08-21 Thread zdenko podobny
I was able to build current tesseract code with mingw&msys (but in xp mode on windows 7 and was not clear install). I will have a look on this later. In meantime can you provided version information about autotools, gcc etc? BTW: "/autogen.sh" did exists in your instalation. But "./autogen.sh" yes

Re: [tesseract-ocr] Makefile:372: recipe for target 'all' failed - using current version with leptonica 1.71 on cygwin

2014-08-21 Thread Shree Devi Kumar
zdenko, yes, but the file is there and I was able to run by giving sh autogen.sh Please see the messages below. Shree Devi Kumar भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Thu, Aug 21, 2014 at 12:28 PM, zdenko podobny wr

Re: [tesseract-ocr] Can I remove some fonts from an existing traineddata?

2014-08-21 Thread SHEN Fei
hi Nick, I'm trying to use tesseract in my mobile phone so the tessdata size is critical. Since I only care about very few fonts, it would be convenient if I could add/remove a special font. Maybe removing some dictionary files is an option. 在 2014年8月20日星期三UTC+8下午9时17分00秒,Nick White写道: > > Hi

[tesseract-ocr] 3.03 compilation problems on FreeBSD.

2014-08-21 Thread samuelharmer
Dear all, I'm trying to set up a document server, part of which is a Tesseract installation to convert scanned image to OCRd PDF. FreeBSD only has 3.02 and the hOCR output from that doesn't play very well with hocr2pdf from the ExactImage toolset with default settings. Seeing as 3.03 has built