have a look at http://www.leptonica.org/line-removal.html The source code is here: https://github.com/DanBloomberg/leptonica/blob/master/prog/lineremoval_reg.c
Zdenko pi 6. 9. 2024 o 11:08 Sundar Andaperumal <sundar2...@gmail.com> napísal(a): > Hi, > > I am trying to remove the thin horizontal line; when doing so the text in > the SUBTOTAL > gets disturbed and gives special characters like this: (`°`, `—`, `~`, > `*`, etc.) > > How to ignore / remove this horizontal line and extract the proper text > in the SUBTOTAL section. Image attached. > > thanks! > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/6ca4d72e-6dac-4db9-8d25-abbe20e5ffd3n%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/6ca4d72e-6dac-4db9-8d25-abbe20e5ffd3n%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8x_qaX2u8yg-0ay0B0J3hC5Jk3LbmS8S0QsW3mbHMTU2g%40mail.gmail.com.