/// <summary> /// Gets the confidence percentile for the current element at the specified level. /// </summary> public float GetConfidence(PageIteratorLevel level) { return TessApi.TessResultIteratorConfidence(Handle, level); }
I use a custom .NET layer somewhat similar to Tesseract.net. The Handle is returned from TessBaseAPIGetIterator() after a page is recognized, and level defines the element type (PageIteratorLevel.Symbol for characters, PageIteratorLevel.Word for words) On Friday, October 12, 2018 at 1:20:51 AM UTC-6, Soumik Ranjan Dasgupta wrote: > > Could you tell how did you get the confidence percentiles? I would like to > know that. :) > > On Fri, Oct 12, 2018 at 10:10 AM farhad khalafi <farhad....@gmail.com > <javascript:>> wrote: > >> I am totally puzzled with how the confidence reported at Word level >> relates to the confidences assigned to the characters of the same word. >> >> I used the attached TIFF image to recognize a simple MICR line of a >> check. >> >> The recognized text had two words: >> >> 495096 700000b01b205xX0eL00007010717 >> >> The confidence percentiles for the words were 59% and 38% respectively. >> >> The confidence percentiles for the characters of the first word were >> (rounded): >> >> 4 97% >> 9 99% >> 5 100% >> 0 100% >> 9 99% >> 6 96% >> >> I would like to know how with such high confidence scores for individual >> characters, one can compute the word level confidence at 59%. >> >> I ran this test using fast training data for English with no training of >> my own. I am not worried about the accuracy, just curious about how to >> interpret confidence scores. >> >> Thanks! >> >> >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to tesseract-oc...@googlegroups.com <javascript:>. >> To post to this group, send email to tesser...@googlegroups.com >> <javascript:>. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/1a83aa4d-5961-4265-9871-1bcac85e73e8%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/1a83aa4d-5961-4265-9871-1bcac85e73e8%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> For more options, visit https://groups.google.com/d/optout. >> > > > -- > Regards, > Soumik Ranjan Dasgupta > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/a195ce2b-051e-4193-a7bb-29993bd3588a%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.