Re: [tesseract-ocr] Tesseract-OCR Training Arabic text & numbers

2020-07-14 Thread Eliyaz L
help me with the sample dataset and can i use this <https://github.com/tesseract-ocr/tesstrain> repo to train and if any apx count of dataset and iteration can be provide that will be helpful. On Monday, July 13, 2020 at 11:55:41 AM UTC+3, Eliyaz L wrote: > > Thanks for the support,

Re: [tesseract-ocr] Tesseract-OCR Training Arabic text & numbers

2020-07-13 Thread Eliyaz L
the latest traineddata and try. > > On Sun, Jul 12, 2020, 20:52 Eliyaz L > > wrote: > >> Hi Shree, >> >> i was using thie below version. I guess you are right its 2016 file. Let >> me test with latest traineddata. >> https://tesseract-ocr.github.io/t

Re: [tesseract-ocr] Tesseract-OCR Training Arabic text & numbers

2020-07-12 Thread Eliyaz L
Hi Shree, i was using thie below version. I guess you are right its 2016 file. Let me test with latest traineddata. https://tesseract-ocr.github.io/tessdoc/Data-Files https://github.com/tesseract-ocr/tessdata/raw/4.00/ara.traineddata Meanwhile can u pls help me with arabic number. i tried ara_

Re: [tesseract-ocr] Tesseract-OCR Training Arabic text & numbers

2020-07-12 Thread Eliyaz L
ser/.local/ share/fonts/ done Input Image: [image: firstName.jpg] On Sunday, July 12, 2020 at 2:00:40 PM UTC+3, shree wrote: > > What character are you trying to add? > Please share the training data to try and replicate the issue. > > > On Sun, Jul 12, 2020, 15:35 Eliyaz L &g

[tesseract-ocr] Tesseract-OCR Training Arabic text & numbers

2020-07-12 Thread Eliyaz L
Hi, My use case is on Arabic document, the pre retrained ara.traineddata are good but not perfect. so i wish to fine tune ara.traineddata, if the results are not satisfying then have train my own custom data. please suggest me for the following: 1. for my use case in Arabic text, proble