Marc Perkel wrote the following on 11/18/2006 8:24 AM -0800: > > decoder wrote: >> Marc Perkel wrote: >>> The words file needs a little documentation. Is it limited to single >>> words or phrases too? What's with the colon and the numbers after >>> the word? >>> >> Phrases are possible too, spaces and numbers are stripped out in both >> the wordlist and the OCR output before matching :) >> >> The colon + the number after it indicates a custom matching threshold >> for this word. The default threshold is defined in the FuzzyOcr.cf, >> but it makes sense to override this setting for some specific words >> which often trigger FPs with the default threshold. >> >> > > So I should use "strongbuy" in the word list instead of "strong buy" ?
Doesn't matter, either will work exactly the same as all spaces are stripped out of the word list files and the OCR output before parsing is done (and parsing is case insensitive, as well). Bill Bill