forgot to mention that I am using tesseract C++ API:

        tesseract::ResultIterator* res_it = api->GetIterator();
tesseract::PageIteratorLevel level = tesseract::RIL_SYMBOL;                
        
        tesseract::ChoiceIterator ci(*res_it);
        do {
  if (ci.Confidence() >= 0) {
    Choice* c = new Choice();
    const char* ch = ci.GetUTF8Text();  
  }
          } while (ci.Next());
jochen....@gmail.com schrieb am Donnerstag, 3. September 2020 um 08:10:53 
UTC+2:

> Hi all,
> I am using the new choice iterator in tesseract 5 to get the confidences 
> for all choices for each symbol of my text. But spaces (word bounderies) 
> are not shown, so I have no way to know when a space is between symbols. Is 
> there a way to for example combine the word iterator with the choice 
> iterator or any other way to know when a new word starts?
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/6a8400a4-e57b-40ad-bdd6-4184a58d76cen%40googlegroups.com.

Reply via email to