Re: [tesseract-ocr] Remove certain characters while fine tuning (training) tesseract

Greg Dunkel Wed, 10 Mar 2021 09:50:30 -0800

Would it be easier to remove these characters from the output using editing
tools?


On Tue, Mar 9, 2021, 2:30 AM Murtuza Dahodwala <murtuzamda...@gmail.com>
wrote:

> Hello,
> Currently, my OCR model detects certain characters like *₹ *& *|.*
> Is it possible that I can remove these characters by correcting my lstm
> bounding box dataset and then fine-tuning it so that it does not detect
> these symbols in my test images ??
>
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/ecd726d5-8ab0-4986-87b0-7ff344d3271cn%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/ecd726d5-8ab0-4986-87b0-7ff344d3271cn%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CA%2BOX7toPXQHf%3DFhyHtXg%2B9ziY_ti%3Darq0ewLrLh%3DyYPNWj--cQ%40mail.gmail.com.

Re: [tesseract-ocr] Remove certain characters while fine tuning (training) tesseract

Reply via email to