On Friday 07 November 2003 06:24 pm, Robert Menschel wrote: > Or better: what if we specified in the rule a maximum score to accumulate > to? Maybe something like: > > accumbody T_SAMPLE /(?:word1|word2|word3|word4|word5)/i,max=2.5 > describe T_SAMPLE Message has medical words frequently used in spam > score T_SAMPLE 0.5
[snip] > I can see that it would be a challenge adding this capabilitity into a GA > run, but if you can manage it, this would certainly lessen the FP risk. > Perhaps the GA run could even calculate what the max should be in order > to avoid FPs according to the corpus. Or we could have "max=5", and have SA automatically generate rules "T_SAMPLE_1" through "T_SAMPLE_5"; the GA would give the scores, and in an optimal, non-linear fashion. -- Give a man a match, and he'll be warm for a minute, but set him on fire, and he'll be warm for the rest of his life. Advanced SPAM filtering software: http://spamassassin.org ------------------------------------------------------- This SF.Net email sponsored by: ApacheCon 2003, 16-19 November in Las Vegas. Learn firsthand the latest developments in Apache, PHP, Perl, XML, Java, MySQL, WebDAV, and more! http://www.apachecon.com/ _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk