[tesseract-ocr] Generating traindata

2019-07-02 Thread Purushotham Rao Eravalli
I am getting the below error while trying to run the command *tesstrain.sh --fonts_dir ../../fonts --fontlist 'ABCDEFGHIJKLM' --lang eng --linedata_only --langdata_dir ../../langdata_lstm --tessdta_dir ../../tessdata --save_box_tiff --maxpages 10 --output_dir train* */usr/local/bin/tesstrain_ut

Re: [tesseract-ocr] Generating traindata

2019-07-02 Thread Purushotham Rao Eravalli
3.03 - 3.05 On Tuesday, July 2, 2019 at 8:16:44 PM UTC+5:30, zdenop wrote: > > Do you use the latest code? Or which version you used? > > Zdenko > > > ut 2. 7. 2019 o 16:44 Purushotham Rao Eravalli > napísal(a): > >> I am getting the below error while trying to

[tesseract-ocr] Re: Generating traindata

2019-07-02 Thread Purushotham Rao Eravalli
, 2 июля 2019 г., 17:44:09 UTC+3 пользователь Purushotham Rao > Eravalli написал: >> >> I am getting the below error while trying to run the command >> *tesstrain.sh --fonts_dir ../../fonts --fontlist 'ABCDEFGHIJKLM' --lang >> eng --linedata_only

[tesseract-ocr] retrained file after fine tuning the tesseract

2019-07-08 Thread Purushotham Rao Eravalli
I have retrained the the tesseract model for particular font. The base trained file "eng.traineddata" file was approx 5MB. But when I retrained the model it got increased to 15MB. Do anyone know the reason for this, Can anyone say where am i doing the mistake. -- You received this message beca

[tesseract-ocr] Re: retrained file after fine tuning the tesseract

2019-07-08 Thread Purushotham Rao Eravalli
Thank you got it. On Monday, July 8, 2019 at 2:43:16 PM UTC+5:30, Purushotham Rao Eravalli wrote: > > I have retrained the the tesseract model for particular font. The base > trained file "eng.traineddata" file was approx 5MB. But when I retrained > the model it got

[tesseract-ocr] Language detection

2019-07-16 Thread Purushotham Rao Eravalli
Hi, Is there a way where we can detect that the text is english or else of any other language using the detection box given by tesseract.? Can someone please help me with this. Thanks -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubsc

[tesseract-ocr] Re: Language detection

2019-07-16 Thread Purushotham Rao Eravalli
Thank you very much On Tuesday, July 16, 2019 at 5:42:24 PM UTC+5:30, Purushotham Rao Eravalli wrote: > > Hi, > Is there a way where we can detect that the text is english or else of any > other language using the detection box given by tesseract.? Can someone > please he

[tesseract-ocr] Tesseract with GPU

2019-07-24 Thread Purushotham Rao Eravalli
Does the response time gets reduced if we run tesseract on GPUs, If so can you share best process available as of now -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an ema

[tesseract-ocr] Re: GPU for Tesseract

2019-07-24 Thread Purushotham Rao Eravalli
Does response time of tesseract decreases if we run it on GPU? On Friday, June 28, 2019 at 11:24:07 AM UTC+5:30, Pooja Kamra wrote: > > On Tesseract site, it is mentioned that no GPU is needed (No support). > What does this statement means? > If i have a machine with GPU, does it improve training

[tesseract-ocr] How do we pass coordinate to tesseract so that we escape detection process and run only recognition using tesseract

2019-09-05 Thread Purushotham Rao Eravalli
How do we pass coordinate to tesseract so that we escape detection process and run only recognition using tesseract -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email

[tesseract-ocr] Re: How do we pass coordinate to tesseract so that we escape detection process and run only recognition using tesseract

2019-09-06 Thread Purushotham Rao Eravalli
Yes, basically I will have a single line of text like "Eravalli Purushotham Rao" in the cropped image. Can you provide any link for the solution On Friday, September 6, 2019 at 11:55:16 AM UTC+5:30, Purushotham Rao Eravalli wrote: > > How do we pass coordinate to tesseract s

Re: [tesseract-ocr] How do we pass coordinate to tesseract so that we escape detection process and run only recognition using tesseract

2019-09-06 Thread Purushotham Rao Eravalli
that is the case you can export panels into separate images and run > tesseract on them. > > Ravi > > Sent from my iPhone > > On Sep 6, 2019, at 2:25 AM, Purushotham Rao Eravalli < > purushot...@sukshi.com> wrote: > > How do we pass coordinate to tesseract so

Re: [tesseract-ocr] How do we pass coordinate to tesseract so that we escape detection process and run only recognition using tesseract

2019-09-06 Thread Purushotham Rao Eravalli
> On Fri, Sep 6, 2019 at 6:18 AM Purushotham Rao Eravalli < > purushot...@sukshi.com> wrote: > >> Will it still do detection for that passed seented image or does it run >> recognition directly? >> >> On Fri, Sep 6, 2019, 3:13 PM Ravi Annaswamy >> wrote:

Re: [tesseract-ocr] summarizing LSTM

2019-09-06 Thread Purushotham Rao Eravalli
It will be great if you provide any source where we can get detailed information about the architecture used for tesseract and it's loss functions or so. Thanks On Fri, Sep 6, 2019, 6:39 PM Timothy Snyder wrote: > Do you want to learn more about neural networks or specifically, a > "summarizing

[tesseract-ocr] Which mode is better if i crop the exact text with multiple words and pass to tesseract for accuracy?

2019-09-19 Thread Purushotham Rao Eravalli
I am using some other model for text detection, I get text boxes for each line in the image(basically some time identity cards). Now i need to pass them through the tesseract for recognition. Which psm mode do you think will the higher accuracy between psm13 and psm7. -- You received this mes

[tesseract-ocr] Tesseract Recognition using psm13 for charatcers like "t", "i", "j"

2019-09-30 Thread Purushotham Rao Eravalli
Hi, I retrained tesseract with Calibiri, arial. While testing on the cropped text images I am facing issues where the characters "t", "i", "j" are all recognised as "l" adn sometimes "e" as "a". Does someone have solution for this. Thanks, Purushotham -- You received this message because yo

Re: [tesseract-ocr] Tesseract Recognition using psm13 for charatcers like "t", "i", "j"

2019-09-30 Thread Purushotham Rao Eravalli
Hi, Please look at these images. Thanks On Mon, Sep 30, 2019 at 6:35 PM Zdenko Podobny wrote: > Can you provide testing images? > I do not think there is any need to retrain tesseract for common font > like Arial. > > Zdenko > > > po 30. 9. 2019 o 12:29 Purushotham R

[tesseract-ocr] Re: Tesseract Recognition using psm13 for charatcers like "t", "i", "j"

2019-09-30 Thread Purushotham Rao Eravalli
[image: 8aa8ea34feb16d5ee596e05fffe4c81f.jpg_front2-201-6.jpg] [image: 5e07a43c069f76fcb85505f8dcda1721.jpg_front2-476-4.jpg] On Monday, September 30, 2019 at 3:59:00 PM UTC+5:30, Purushotham Rao Eravalli wrote: > > Hi, > > I retrained tesseract with Calibiri, arial. While te

Re: [tesseract-ocr] Re: Tesseract Recognition using psm13 for charatcers like "t", "i", "j"

2019-09-30 Thread Purushotham Rao Eravalli
t; > IMO 4.1 should produce the same result. I use model from tessdata_best. > > Zdenko > > > po 30. 9. 2019 o 15:12 Purushotham Rao Eravalli > napísal(a): > >> [image: 8aa8ea34feb16d5ee596e05fffe4c81f.jpg_front2-201-6.jpg] >> >> [image: 5e07a43c069f76fcb85505f8dcda

Re: [tesseract-ocr] Training tesseract 4.00 by removing unwanted symbols

2019-10-31 Thread Purushotham Rao Eravalli
Hi, Can we retrain tesseract by removing all the unwanted symbols and characters for English language. If so can someone share how to do so please. Thanks, Purushotham On Thu, Oct 31, 2019, 5:57 PM 'Yuliana Zigangirova' via tesseract-ocr < tesseract-ocr@googlegroups.com> wrote: > Hi everyone,

[tesseract-ocr] Using tesseract on python

2019-11-01 Thread Purushotham Rao Eravalli
Hi I tried using pytessract for testing tesseract but it is giving null output for few images. Is there a different way to test tesseract -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving email

Re: [tesseract-ocr] Using tesseract on python

2019-11-01 Thread Purushotham Rao Eravalli
Yeah sorry for that Thanks On Friday, November 1, 2019 at 3:07:44 PM UTC+5:30, zdenop wrote: > > Yes, there is - tesseract executable. Please read first available docs > (wiki) before asking any other questions. > > Zdenko > > > pi 1. 11. 2019 o 10:29 Purushotham R

[tesseract-ocr] Calculation of confidence levels

2019-11-22 Thread Purushotham Rao Eravalli
Is there a way to figure out how does tesseract calculate confidence levels for characters or words. As of now if we observe we are getting higher confidence even though the text is wrong, lower confidence even though the text is correct. So it might be helpful if we know. And is there anyway to

[tesseract-ocr] Cost Function used in OCR

2019-12-18 Thread Purushotham Rao Eravalli
Is their any way to tweak the cost function or modify it. What is the cost function used by tesseract. Please can someone help with this. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails

[tesseract-ocr] Training data generation using text lines

2020-04-27 Thread Purushotham Rao Eravalli
I am facing issue with text image creation, I am unable to generate the noise or disturbances in the image, please can someone help me how to generate image files from text with different type of noise in the image -- You received this message because you are subscribed to the Google Groups "t