On Friday 07 November 2003 06:24 pm, Robert Menschel wrote:

> Or better: what if we specified in the rule a maximum score to accumulate
> to? Maybe something like:
>
> accumbody  T_SAMPLE  /(?:word1|word2|word3|word4|word5)/i,max=2.5
> describe   T_SAMPLE  Message has medical words frequently used in spam
> score      T_SAMPLE  0.5

[snip]

> I can see that it would be a challenge adding this capabilitity into a GA
> run, but if you can manage it, this would certainly lessen the FP risk.
> Perhaps the GA run could even calculate what the max should be in order
> to avoid FPs according to the corpus.

Or we could have "max=5", and have SA automatically generate rules 
"T_SAMPLE_1" through "T_SAMPLE_5"; the GA would give the scores, and in an 
optimal, non-linear fashion.

-- 
Give a man a match, and he'll be warm for a minute, but set him on
fire, and he'll be warm for the rest of his life.

Advanced SPAM filtering software: http://spamassassin.org



-------------------------------------------------------
This SF.Net email sponsored by: ApacheCon 2003,
16-19 November in Las Vegas. Learn firsthand the latest
developments in Apache, PHP, Perl, XML, Java, MySQL,
WebDAV, and more! http://www.apachecon.com/
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to