Fixed! Thank you, your suggestion worked. On Tuesday, October 6, 2020 at 6:36:39 PM UTC+10:30 shree wrote:
> Have you tried cropping the image to remove the arrowhead to see if that > improves the result? > > On Tue, Oct 6, 2020 at 9:42 AM Andrew <aecb...@gmail.com> wrote: > >> As per my question on StackOverflow: PyTesseract not recognizing >> decimals >> <https://stackoverflow.com/questions/64203559/pytesseract-not-recognizing-decimals> >> >> I'm using PyTesseract to recognise text in table cells. When it comes to >> recognising drug doses with decimal points, the OCR fails to recognise the >> period character ( . ) , though is accurate for everything else. I'm >> using tesseract v5.0.0-alpha.20200328 on Windows 10. >> >> My pre-processing consists of upscaling by 400% using cubic, conversion >> to black and white, dilation and erosion, morphology, and blurring. I've >> tried a decent combination of all of these (as well as each on their own), >> and nothing has recognized the . >> >> I've tried --psm of various values as well as a character whitelist. I >> believe the font is Sergoe UI. >> >> Before processing: [image: S87rd.png] >> <https://i.stack.imgur.com/S87rd.png> >> >> After processing: [image: OFjoL.png] >> <https://i.stack.imgur.com/OFjoL.png> >> >> PyTesseract output: 25mg »p >> >> Processing code attached >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to tesseract-oc...@googlegroups.com. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/5c754a36-a0e4-427f-9650-f41200a1cda5n%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/5c754a36-a0e4-427f-9650-f41200a1cda5n%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> > > > -- > > ____________________________________________________________ > भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/0f4751a3-a6f6-4dac-94fe-b555fab02248n%40googlegroups.com.