"tables" are know issues. See e.g. https://github.com/tesseract-ocr/tesseract/issues/1979 https://github.com/tesseract-ocr/tesseract/issues/1714
Zdenko št 30. 5. 2019 o 11:56 Manasi sarode <manasi.sarode...@gmail.com> napísal(a): > I'm trying to solve the query of table content detection by using > Tesseract, but its not giving accurate results in that is Some of the > contents are missing. Also, if I can get any function/api for table > content extraction, > Observations in attached screenshots:- > > 1)Mombai is detected as Mombal > 2)Rawalpindi is detected as Rawalpind! > 3)There are spaces before and after spaces. > 4)There are underscore(_) after the numbers of second column. > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To post to this group, send email to tesseract-ocr@googlegroups.com. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/928c8f45-7759-43f6-950f-ffeecd065d6f%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/928c8f45-7759-43f6-950f-ffeecd065d6f%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8xLW5O%3DTrbieyceRh%2B%2BB_7z%2Bm9hZrRTjycqMX0Z3-_2Tw%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.