Dňa 23.08.2012 13:08, Nick White wrote / napísal(a): > A great addition to training would be if one dictionary file was > used, combining freq-words and all-words, and a relative frequency > probability score was given to each word. This would allow more > fine-grained scoring based on exactly how likely the word is to > appear, which would be a win. > > Obviously for many cases such word frequency scores might be hard to > generate, but for others (such as mine) it isn't at all, if the word > list is generated from a large corpus of existing text. > > Would others find such a feature useful? Also, would I be better off > posting this to the bug tracker? > Please post it as issue (Feature request)[1].
[1] http://code.google.com/p/tesseract-ocr/issues/entry -- Zdenko -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-ocr@googlegroups.com To unsubscribe from this group, send email to tesseract-ocr+unsubscr...@googlegroups.com For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en