[tesseract-ocr] Re: Text extraction failure after preprocessing.

'uday kaipa' via tesseract-ocr Fri, 28 Jun 2024 05:09:15 -0700

I have resized the image so that text height would be around 30pxs and i 
have tried with 10px boarder as recommended in some threads here.
I converted image to binary, and tried all PSM modes.
I am not sure why it is not OCR'ed properly.


Any help is appreciated. :) 





On Thursday, June 27, 2024 at 6:24:36 PM UTC+2 uday kaipa wrote:

> Hi, 
>
> I have an image having number 96 in it.(that might contains a number 
> between 0 and 100.) PFA.
> I have used tesseract PSM from 6 to 13 and image size and font and 
> everything looks good to me. Text is recognized as 36.
> When i try to adjust padding or other pre-processing, it would work for 
> this image and some images are recognized incorrectly.
>
> Can anyone recommend any other pre-processing that might improve the 
> recognition.
>
> *t**esseract --oem 1 --psm 7 -c tessedit_char_whitelist=0123456789.: 
> C:/Users/xxx/Desktop/test_folder/IMG_2303_2cfac/subboxes/Image_BHU32_1_PREPROCESSED_27-06-2024_17h39m53s.JPG
>  
> new hocr*
>
>
> *Many thanks in advance.*
>
>
> *Regards*
> *Uday*
>
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/d59827e4-6973-45af-92c0-e2aebbd7f2e7n%40googlegroups.com.

[tesseract-ocr] Re: Text extraction failure after preprocessing.

Reply via email to