[tesseract-ocr] Tesseract 3.05 Linking error on VS2015 x64

2017-06-20 Thread Kramer Baretta
Hi, I just ran the test code and got a linking error: Error LNK2001 unresolved external symbol "public: void __cdecl tesseract::TessBaseAPI::SetImage(struct Pix const *)" (?SetImage@TessBaseAPI@tesseract@@QEAAXPEBUPix@@@Z) I checked all the dependencies and what I added are: leptonica-1.74.2d

Re: [tesseract-ocr] Re: unicharset_extractor extracting zero values

2017-06-20 Thread shree
I got the same error building 3.05.01 and have filed it as an issue - https://github.com/tesseract-ocr/tesseract/issues/1000 -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send

Re: [tesseract-ocr] Re: unicharset_extractor extracting zero values

2017-06-20 Thread ShreeDevi Kumar
Master branch currently includes the legacy engine. So you can easily build your custom traineddata using the following command (modify it for your fonts location, training text, font name etc) training/tesstrain.sh \ --fonts_dir ~/.fonts \ --tessdata_dir ../tessdata \ --training_text ../la

Re: [tesseract-ocr] Re: unicharset_extractor extracting zero values

2017-06-20 Thread David Barishev
After several testing, i have found mixed results. If i download leptonica 1.74.4, build it, and than build master brach, it works fine. With the same version of leptonica, the 3.05.01 release failes with the following error: libtool: link: g++ -g -O2 -std=c++11 -o .libs/tesseract tesseract-

Re: [tesseract-ocr] bad result on tesseract(4.0) with lstm

2017-06-20 Thread ShreeDevi Kumar
Your input image quality needs to be improved. Also test with --oem 1 alone. Please test with https://github.com/tesseract-ocr/tesseract/blob/master/testing/hebtypo.jpg and see if you get similar results. for hocr, just adding hocr to the command line should work - as long as you have the hocr c

[tesseract-ocr] bad result on tesseract(4.0) with lstm

2017-06-20 Thread לאה למד
hi * Attached line from the original image command *tesseract file.tiff output --oem 2 -l heb --psm 6* resulte *"אומדן / שווי ההתקשרות: 6 ₪ לפני מע"מ. ₪"* command *tesseract file.tiff output --oem 0 -l heb --psm 6* resulte *"אןמדן ושווי ההתקשרות: 16,656 ₪ לפניימע"מ. ₪”"* So for people that

[tesseract-ocr] Re: How to make Web demo of Tesseract OCR USING vb.net or c# please assist

2017-06-20 Thread Memo Arfaa
On Friday, June 2, 2017 at 11:43:35 AM UTC+2, Mandeep Singh wrote: > > Hello Everyone, > > I want to make a web application using visual studio 2012 with code VB.net > or c# is there any example .? > https://sourceforge.net/p/vietocr/code/HEAD/tree/ -- You received this message because you a

[tesseract-ocr] run multiple tesseract instances in parallel not returning any result

2017-06-20 Thread Mohamed KAFFEL
Hello , I'm writing a python script where I use multiproccesing library to launch multiple tesseract instances in parallel. when I use multiple calls to tesseract but in sequence using loop ,it works .However ,when I try to parallel code everything looks fine but I'm not getting any results.

Re: [tesseract-ocr] Re: unicharset_extractor extracting zero values

2017-06-20 Thread ShreeDevi Kumar
> Do you know why my tesseract isnt compiling ? I would really love a updated version on my ubuntu. Not sure. I haven't built 3.05 branch. For master, I follow the usual autotools method. Have you also built leptonica? Make sure you don't have any old leptonica version already. Make sure you use

[tesseract-ocr] Re: unicharset_extractor extracting zero values

2017-06-20 Thread David Barishev
Thank you so much for your help, i found my error, i need to set script dir to the langdata folder when runnning set_unicharset_properties. Do you know why my tesseract isnt compiling ? I would really love a updated version on my ubuntu. Thank you again. On Tuesday, June 20, 2017 at 6:59:58 AM

Re: [tesseract-ocr] Tesseract 4.00.00alpha Windows doesn't find image files

2017-06-20 Thread ShreeDevi Kumar
Please show the command line you used followed by the error. You may have to put filename in quotes if there are spaces in it. ShreeDevi भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Mon, Jun 19, 2017 at 9:32 PM, J. Karjalaine

Re: [tesseract-ocr] How to improve the recognition of receipt (text not in words dictionary)

2017-06-20 Thread ShreeDevi Kumar
Please see https://github.com/tesseract-ocr/tesseract/issues/960#issuecomment-305966719 on stable 3.0x you can try by adding your product catalog to eng.user-words file and check for improvement. In my unit test, it seemed to apply the words from user dict. Alternately, you can also try withthe