Re: [tesseract-ocr] recognising roman with sanskrit diacritics

2018-07-12 Thread yajva
eng+iast-plus-3600 => no diacritics at all Latin+iast-plus-3600 => only macrons none other On Thursday, July 12, 2018 at 1:12:25 AM UTC+5:30, shree wrote: > > What about ocr with > > eng+iast > > > > On Wed 11 Jul, 2018, 7:44 PM yajva, > > wrote: > >&

Re: [tesseract-ocr] recognising roman with sanskrit diacritics

2018-07-11 Thread yajva
e better results. Is it possible to add diacritics to Latin? Can you help in any way? regards Venkatesh On Monday, July 2, 2018 at 2:05:47 PM UTC+5:30, yajva wrote: > > Many thanks. Downloaded and using. > Will wait for next ver. > > > On Sunday, July 1, 2018 at 12:21:19

Re: [tesseract-ocr] recognising roman with sanskrit diacritics

2018-07-02 Thread yajva
t; Attached is the OCRed output for pages 13-24 of dark pdf with it. > > I am still training a different variation. > > > > On Wed, Jun 27, 2018 at 6:46 PM Shree Devi Kumar > wrote: > >> ok. I will take a look. >> >> On Wed, Jun 27, 2018 at 5:04 PM yajva

Re: [tesseract-ocr] recognising roman with sanskrit diacritics

2018-06-27 Thread yajva
.exe > > I'd be also interested in testing of the tessdata manager, which should > now also properly handle script tessdatas > > On Tue 26 Jun, 2018, 10:59 PM yajva, > > wrote: > >> The doc is diff ver of the same text. Here's the doc used for the first. >&

Re: [tesseract-ocr] recognising roman with sanskrit diacritics

2018-06-21 Thread yajva
one more correction. On Thursday, June 21, 2018 at 11:34:00 PM UTC+5:30, yajva wrote: > > done > > On Wednesday, June 20, 2018 at 9:05:01 PM UTC+5:30, shree wrote: >> >> I am attaching the OCRed text. Please correct it so that I can use as >> groundtruth fo

Re: [tesseract-ocr] recognising roman with sanskrit diacritics

2018-06-21 Thread yajva
one a training for sanskrit for both devanagari and IAST but it >> does not include cedilla for Sh >> >> I will add it and let you know. >> >> On Wed 20 Jun, 2018, 1:17 AM yajva, > >> wrote: >> >>> I have tried Google OCR for recognizing Sansk