Would it be easier to remove these characters from the output using editing tools?
On Tue, Mar 9, 2021, 2:30 AM Murtuza Dahodwala <murtuzamda...@gmail.com> wrote: > Hello, > Currently, my OCR model detects certain characters like *₹ *& *|.* > Is it possible that I can remove these characters by correcting my lstm > bounding box dataset and then fine-tuning it so that it does not detect > these symbols in my test images ?? > > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/ecd726d5-8ab0-4986-87b0-7ff344d3271cn%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/ecd726d5-8ab0-4986-87b0-7ff344d3271cn%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CA%2BOX7toPXQHf%3DFhyHtXg%2B9ziY_ti%3Darq0ewLrLh%3DyYPNWj--cQ%40mail.gmail.com.