forgot to mention that I am using tesseract C++ API: tesseract::ResultIterator* res_it = api->GetIterator(); tesseract::PageIteratorLevel level = tesseract::RIL_SYMBOL; tesseract::ChoiceIterator ci(*res_it); do { if (ci.Confidence() >= 0) { Choice* c = new Choice(); const char* ch = ci.GetUTF8Text(); } } while (ci.Next()); jochen....@gmail.com schrieb am Donnerstag, 3. September 2020 um 08:10:53 UTC+2:
> Hi all, > I am using the new choice iterator in tesseract 5 to get the confidences > for all choices for each symbol of my text. But spaces (word bounderies) > are not shown, so I have no way to know when a space is between symbols. Is > there a way to for example combine the word iterator with the choice > iterator or any other way to know when a new word starts? > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/6a8400a4-e57b-40ad-bdd6-4184a58d76cen%40googlegroups.com.