Re: [tesseract-ocr] Re: Need Help with extracting info from Invoice

2018-01-10 Thread Afreen Ferdoash
I am trying to solve a similar problem, that of reading forms. Tesseract 4 is doing well but is DROPPING lots of words withing boxes. I thought this problem of dropping words existed with Indic languages but here I am having this issue for English too! I tried to fool around with some paramet

Re: [tesseract-ocr] Re: Need Help with extracting info from Invoice

2018-01-10 Thread Afreen Ferdoash
it is still not making any difference On Wednesday, January 10, 2018 at 9:27:20 PM UTC+5:30, shree wrote: > > > On Wed, Jan 10, 2018 at 8:07 PM, Afreen Ferdoash > wrote: > >> I am trying to solve a similar problem, that of reading forms. Tesseract >> 4 is doing we

[tesseract-ocr] Re: Failed to continue from: /home/robert/tesstutorial/trainplusminus/eng.lstm

2018-01-12 Thread Afreen Ferdoash
In case you are still stuck here, I fixed this issue by downloading the latest version of eng.traineddata See https://github.com/tesseract-ocr/tesseract/issues/1069 On Monday, August 7, 2017 at 11:17:06 AM UTC+5:30, roberty...@gmail.com wrote: > > Hello, > > I'm trying to train the traineddata

Re: [tesseract-ocr] Re: Need Help with extracting info from Invoice

2018-01-12 Thread Afreen Ferdoash
dropping words issue resolved with default psm mode. I had been using psm 6 earlier On Wednesday, January 10, 2018 at 9:27:20 PM UTC+5:30, shree wrote: > > > On Wed, Jan 10, 2018 at 8:07 PM, Afreen Ferdoash > wrote: > >> I am trying to solve a similar problem,