Error when using trained language file: tessdata_manager.SeekToStart(TESSDATA_INTTEMP):Error:Assert failed:in file adaptmatch.cpp, line 555 - Tesseract 3.02

2013-03-04 Thread A. Naut
I'm trying to train the attached files (Tesseract 3.02, following the instructions at http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3 ) , and although I can compete the training process successfully I can't get tesseract to work with the produce trainneddata file - I always recei

Using Tesseract OCR in C++ development environment

2013-03-04 Thread Oluwatobi Williams
Hi, I'm working on a C++ project on Ubuntu Linux 12.04 LTS. I'm using GNU GCC C++ compiler and I want to integrate tesseract ocr into my code without working with the command line. I would appreciate help regarding this matter. Thanks. -- -- You received this message because you are subscri

Re: How to get desire words coordinate from characters coordinate

2013-03-04 Thread Zdenko Podobný
Depending on your skills: a) You can analyze space between boxes to identify words (if you want to use box file) b) You can parse tesseract hocr output (if you have no clue what is hocr, search in this forum) c) You can use C++/C API of tesseract to create your own output - have a look at hocr

Re: "Unable to load library 'libtesseract302': The specified module could not be found." error

2013-03-04 Thread Sudarshan Sreenivasan
Any work around for this ? On Saturday, 19 January 2013 01:42:00 UTC+5:30, Quan Nguyen wrote: > > JVM 64-bit cannot load Tesseract and Leptonica 32-bit DLLs. You would need > JVM 32-bit. > > On Friday, January 18, 2013 8:11:56 AM UTC-6, Deniz Atak wrote: >> >> Hi, >> >> I am trying to run Tess4J

Re: How to get desire words coordinate from characters coordinate

2013-03-04 Thread SUBHADIP SINHA
Please help me if anybody know the solution !!! THANK YOU .. On Sunday, March 3, 2013 12:32:32 PM UTC+5:30, SUBHADIP SINHA wrote: > > Hi,ALL > > I finally got the .box file with all characters coordinate from .png > file,Now i want to group the charecters from the .box file with words > and n