[tesseract-ocr] Re: Tesseract 4.0.0 released

2018-10-30 Thread DMGG
Great news, thank you all. On Monday, October 29, 2018 at 7:02:30 AM UTC-4, zdenop wrote: > > Hello all, > > I am proud to announce that tesseract OCR engine version 4.0.0 ( LSTMs > based) was released today. > See online Release notes [1]. > Source code can be downloaded from GitHub [2]. > Know

Re: [tesseract-ocr] Re: Heads up: release of tesseract 4.0

2018-10-30 Thread flaviumarc
I have compiled now the tesseract library, with cppan. and I have found a test app, with this source code: /* dependencies: pvt.cppan.demo.google.tesseract.libtesseract: master pvt.cppan.demo.danbloomberg.leptonica: 1 */ #include #include #include // leptonica main header for image i

Re: [tesseract-ocr] Re: Heads up: release of tesseract 4.0

2018-10-30 Thread Zdenko Podobny
First learn to write forum e-mails! Stop stealing email threads. Your questions/problems has nothing to do with content of original posting. Zdenko ut 30. 10. 2018 o 9:12 napĂ­sal(a): > I have compiled now the tesseract library, with cppan. > > and I have found a test app, with this source code

Re: [tesseract-ocr] Re: Heads up: release of tesseract 4.0

2018-10-30 Thread flaviumarc
Ok, sorry. I will update the original post. On Tuesday, October 30, 2018 at 11:04:08 AM UTC+2, zdenop wrote: > > First learn to write forum e-mails! Stop stealing email threads. > Your questions/problems has nothing to do with content of original posting. > > Zdenko > > > ut 30. 10. 2018 o 9:12 >

Re: [tesseract-ocr] pixRead problem

2018-10-30 Thread flaviumarc
Thank you zdenop, after all, I have solved after all. Flaviu. On Wednesday, October 17, 2018 at 12:38:26 PM UTC+3, flavi...@gmail.com wrote: > > Yes, could be simple, but perhaps you have something installed which I > have not ... I guess ... > > On Tuesday, October 16, 2018 at 7:30:13 PM UTC+3

Re: [tesseract-ocr] pixRead problem

2018-10-30 Thread flaviumarc
Thank you Zdenop for your support, I have solved. On Tuesday, October 16, 2018 at 7:30:13 PM UTC+3, zdenop wrote: > > I do not use vcpkg. I suggest you to use cppan (you need to install it > and put to path). For me it stupidly easy and it takes cca 15 minutes on my > computer and internet netw

[tesseract-ocr] tesstrain.sh with hundreds of fonts

2018-10-30 Thread benda . krisztian
I would like to train the tesseract with hundreds of my fonts. My fonts name are numbers and their format like "22.ttf". For creating traineddata I use the tesstrain.sh script like this: tesseract-ocr/tesseract/src/training/tesstrain.sh \ --fonts_dir processed_fonts \ --lang eng \ --langdata_d

Re: [tesseract-ocr] tesstrain.sh with hundreds of fonts

2018-10-30 Thread Shree Devi Kumar
Please check the log file in the tmp directory. There might be some font related errors there. There has been pango related change made for fonts procese recently. Please check the change log. On Tue, 30 Oct 2018, 09:10 , wrote: > I would like to train the tesseract with hundreds of my fonts. My

Re: [tesseract-ocr] How to improve the quality of Training From Scratch

2018-10-30 Thread Shree Devi Kumar
Please read the wiki page regarding training 4.0 and the presentation files in docs by Ray Smith. On Tue, 30 Oct 2018, 02:32 bruce, wrote: > thank you for your reply ,shree. > I've seen the training_text and the list of fonts. > I will try again. > Before I start my next Scratch training,I want

[tesseract-ocr] How do I train tesseract 4 for the font Comic Sans MS?

2018-10-30 Thread 'rely LIVE' via tesseract-ocr
Hello, I want to train the default eng.traineddata for the font "Comic Sans MS". Is it possible at all? Which files do I need and where do I get them? I already installed tesseract 4 on Ubuntu 18.04 and can do simple OCR. What are the necessary commands to do training? I know from the basic tuto

Re: [tesseract-ocr] How do I train tesseract 4 for the font Comic Sans MS?

2018-10-30 Thread Shree Devi Kumar
See https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00#fine-tuning-for-impact Use comic sans font instead of impact, to finetune On Tue, 30 Oct 2018, 12:32 'rely LIVE' via tesseract-ocr, < tesseract-ocr@googlegroups.com> wrote: > Hello, > > I want to train the default eng.tra

[tesseract-ocr] New rus/eng traineddata for tes4

2018-10-30 Thread vngorunov via tesseract-ocr
Hi all! We are making a kofax like system, named soica. And we use tes4. It is good now. But there are stil problems with russian OCR. And problems wih rus/eng language. Could say if there will be new traineddata soon? -- You received this message because you are subscribed to the Google Groups