Have you tried cropping the image to remove the arrowhead to see if that improves the result?
On Tue, Oct 6, 2020 at 9:42 AM Andrew <aecbo...@gmail.com> wrote: > As per my question on StackOverflow: PyTesseract not recognizing decimals > <https://stackoverflow.com/questions/64203559/pytesseract-not-recognizing-decimals> > > I'm using PyTesseract to recognise text in table cells. When it comes to > recognising drug doses with decimal points, the OCR fails to recognise the > period character ( . ) , though is accurate for everything else. I'm > using tesseract v5.0.0-alpha.20200328 on Windows 10. > > My pre-processing consists of upscaling by 400% using cubic, conversion to > black and white, dilation and erosion, morphology, and blurring. I've tried > a decent combination of all of these (as well as each on their own), and > nothing has recognized the . > > I've tried --psm of various values as well as a character whitelist. I > believe the font is Sergoe UI. > > Before processing: [image: S87rd.png] > <https://i.stack.imgur.com/S87rd.png> > > After processing: [image: OFjoL.png] > <https://i.stack.imgur.com/OFjoL.png> > > PyTesseract output: 25mg »p > > Processing code attached > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/5c754a36-a0e4-427f-9650-f41200a1cda5n%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/5c754a36-a0e4-427f-9650-f41200a1cda5n%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- ____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduXDOzEmLdUCBD3PH16e-scc7-izoXnmfPZQHpU%2BUC-%3DtA%40mail.gmail.com.