Re: [tesseract-ocr] Have problem with parsing bold white fonts

2020-08-05 Thread evgzak...@gmail.com
Thanks, I will try it. Evgeny вторник, 4 августа 2020 г. в 20:18:54 UTC+3, zdenop: > First you need to remove the background. > Then > https://github.com/tesseract-ocr/tessdoc/blob/master/ImproveQuality.md > > Zdenko > > > ut 4. 8. 2020 o 13:07 Евгений Захаров napísal(a): > >> Hi. It wold

Re: [tesseract-ocr] Train for big letters in the beginning of the sentences(pic)

2020-08-05 Thread tlit...@gmail.com
That's right, that initial "TO" and this is just a fraction of the text, there are dozens of examples like "TO" on a single page. But since it spreads to two lines there's nothing I can do I assume? On Tuesday, August 4, 2020 at 7:39:21 PM UTC+2 zdenop wrote: > Not sure what do you mean... > >

[tesseract-ocr] arabic number traineddata

2020-08-05 Thread Ahmed Nageh
I have trained tesseract on arabic numbers but it doesn't work in appropriate way so i need traineddata to run it on arabic number images [image: image_id.jpg] that's the number image i need to extract from ..thanks in advance -- You received this message because you are subscribed to the Goog

Re: [tesseract-ocr] Train for big letters in the beginning of the sentences(pic)

2020-08-05 Thread Tom Morris
The technical term for these is "drop-caps ," which is useful to know if you want to Google for it. It's pretty dated now, but Ray's 2007 description of the line finding algorithm says: "Assumi

[tesseract-ocr] Getting started with contributions

2020-08-05 Thread Uddeshya Tyagi
Hello developers! I'm Uddeshya Tyagi,a computer science student from Jiit,Noida,India.I recently learnt basics of *tesseract* library.I,now want to *contribute* to this project,so please help me getting started with it. -- You received this message because you are subscribed to the Google Group

Re: [tesseract-ocr] Getting started with contributions

2020-08-05 Thread Zdenko Podobny
just send pull requests to github repository. Zdenko št 6. 8. 2020 o 7:34 Uddeshya Tyagi napísal(a): > Hello developers! I'm Uddeshya Tyagi,a computer science student from > Jiit,Noida,India.I recently learnt basics of *tesseract* library.I,now > want to *contribute* to this project,so please