Fixed! Thank you, your suggestion worked.

On Tuesday, October 6, 2020 at 6:36:39 PM UTC+10:30 shree wrote:

> Have you tried cropping the image to remove the arrowhead to see if that 
> improves the result?
>
> On Tue, Oct 6, 2020 at 9:42 AM Andrew <aecb...@gmail.com> wrote:
>
>> As per my question on StackOverflow:  PyTesseract not recognizing 
>> decimals 
>> <https://stackoverflow.com/questions/64203559/pytesseract-not-recognizing-decimals>
>>
>> I'm using PyTesseract to recognise text in table cells. When it comes to 
>> recognising drug doses with decimal points, the OCR fails to recognise the 
>> period character ( . ) , though is accurate for everything else. I'm 
>> using tesseract v5.0.0-alpha.20200328 on Windows 10.
>>
>> My pre-processing consists of upscaling by 400% using cubic, conversion 
>> to black and white, dilation and erosion, morphology, and blurring. I've 
>> tried a decent combination of all of these (as well as each on their own), 
>> and nothing has recognized the .
>>
>> I've tried --psm of various values as well as a character whitelist. I 
>> believe the font is Sergoe UI.
>>
>> Before processing:  [image: S87rd.png] 
>> <https://i.stack.imgur.com/S87rd.png>
>>
>> After processing:  [image: OFjoL.png] 
>> <https://i.stack.imgur.com/OFjoL.png>
>>
>> PyTesseract output: 25mg »p
>>
>> Processing code attached
>>
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to tesseract-oc...@googlegroups.com.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/5c754a36-a0e4-427f-9650-f41200a1cda5n%40googlegroups.com
>>  
>> <https://groups.google.com/d/msgid/tesseract-ocr/5c754a36-a0e4-427f-9650-f41200a1cda5n%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>>
>
>
> -- 
>
> ____________________________________________________________
> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/0f4751a3-a6f6-4dac-94fe-b555fab02248n%40googlegroups.com.

Reply via email to