have a look at http://www.leptonica.org/line-removal.html
The source code is here:
https://github.com/DanBloomberg/leptonica/blob/master/prog/lineremoval_reg.c

Zdenko


pi 6. 9. 2024 o 11:08 Sundar Andaperumal <sundar2...@gmail.com> napísal(a):

> Hi,
>
>  I am trying to remove the thin horizontal line; when doing so the text in
> the SUBTOTAL
> gets disturbed and gives special characters like this:  (`°`, `—`, `~`,
> `*`, etc.)
>
>  How to ignore / remove this horizontal line and extract the proper text
> in the SUBTOTAL section. Image attached.
>
> thanks!
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/6ca4d72e-6dac-4db9-8d25-abbe20e5ffd3n%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/6ca4d72e-6dac-4db9-8d25-abbe20e5ffd3n%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8x_qaX2u8yg-0ay0B0J3hC5Jk3LbmS8S0QsW3mbHMTU2g%40mail.gmail.com.

Reply via email to