Have you tried cropping the image to remove the arrowhead to see if that
improves the result?

On Tue, Oct 6, 2020 at 9:42 AM Andrew <aecbo...@gmail.com> wrote:

> As per my question on StackOverflow:  PyTesseract not recognizing decimals
> <https://stackoverflow.com/questions/64203559/pytesseract-not-recognizing-decimals>
>
> I'm using PyTesseract to recognise text in table cells. When it comes to
> recognising drug doses with decimal points, the OCR fails to recognise the
> period character ( . ) , though is accurate for everything else. I'm
> using tesseract v5.0.0-alpha.20200328 on Windows 10.
>
> My pre-processing consists of upscaling by 400% using cubic, conversion to
> black and white, dilation and erosion, morphology, and blurring. I've tried
> a decent combination of all of these (as well as each on their own), and
> nothing has recognized the .
>
> I've tried --psm of various values as well as a character whitelist. I
> believe the font is Sergoe UI.
>
> Before processing:  [image: S87rd.png]
> <https://i.stack.imgur.com/S87rd.png>
>
> After processing:  [image: OFjoL.png]
> <https://i.stack.imgur.com/OFjoL.png>
>
> PyTesseract output: 25mg »p
>
> Processing code attached
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/5c754a36-a0e4-427f-9650-f41200a1cda5n%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/5c754a36-a0e4-427f-9650-f41200a1cda5n%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>


-- 

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduXDOzEmLdUCBD3PH16e-scc7-izoXnmfPZQHpU%2BUC-%3DtA%40mail.gmail.com.

Reply via email to