date:20211228

[tesseract-ocr] getting started, but no results

2021-12-28 Thread michael c

I am just starting to use the tesseract package and having no luck getting it to recognize anything. My environment is C# using the package from nuget. I am able to run fine, just no text is recognized in my sample image. It does work on the provided 'phototest.tif'. I have fiddled with many param

Re: [tesseract-ocr] getting started, but no results

2021-12-28 Thread Zdenko Podobny

Did you read the docs? https://github.com/tesseract-ocr/tessdoc/blob/main/ImproveQuality.md Zdenko ut 28. 12. 2021 o 10:28 michael c napísal(a): > I am just starting to use the tesseract package and having no luck getting > it to recognize anything. > My environment is C# using the package fro

Re: [tesseract-ocr] getting started, but no results

2021-12-28 Thread michael c

Thanks, yes i had looked at that. I began by expanding the image by 5x to get the characters to about 50 pixels high (vs about 8 initially). My initial tests generated a tessinput.tif that looked very good to my eye, but did not work for the OCR. I ended up also doing: - posterize to level 2 to

Re: [tesseract-ocr] getting started, but no results

2021-12-28 Thread Yesbird

I beleave that key is page segmentation mode - try to play with it: https://www.pyimagesearch.com/2021/11/15/tesseract-page-segmentation-modes-psms-explained-how-to-improve-your-ocr-accuracy/ On Tuesday, December 28, 2021 at 11:45:54 PM UTC+3 thisism...@gmail.com wrote: > Thanks, yes i had look