On Wed, 27 May 2009 21:19:58 -0500 René Berber <r.ber...@computer.org> wrote:
> RW wrote: > > > AFAIK though it isn't possible to place a cap on the FuzzyOCR > > score. I don't want to, but I detune it purely to reduce the > > likelyhood of something hitting my discard threshold by OCR alone. > > Isn't that done by setting focr_add_score to 0.0? The total score in > this case should be always focr_base_score, unless other of the rules > gets a hit (wrong content-type wrong file extension, etc.) No, I want the score to increase for each extra word, I just don't want it to rise to a huge score like 25 where it might go over a second threshold for outright discarding. e.g. for a threshold of 5 and a discard level of 20 you might have focr_base_score 4.5 focr_add_score 0.5 focr_max_score 14 focr_autodisable_score 20