Re: my girlfriend is getting ticked :)

Nels Lindquist 30 Mar 2005 22:30:53 -0000

On 30 Mar 2005 at 15:27, Matthew Lenz wrote:

<snip>
 
> here is an example of the headers from an spam that wasn't caught


> X-Spam-Status: No, score=4.1 required=5.0 tests=BAYES_99,HTML_80_90,
>         HTML_FONT_BIG,HTML_MESSAGE,HTML_TITLE_EMPTY,MIME_HTML_ONLY,
>         MSGID_FROM_MTA_ID autolearn=no version=3.0.2

> Ideas where to start (other than having her change her email address
> hehe)

The first thing I did upon installing SA 3.x and running it for a few 
days was to restore some sanity to the BAYES_* scores.

The GA has a tendency to tune down the scores assigned for extreme 
bayes results because they tend to cluster with other positive tests 
(like SURBLs).

That has the unfortunate side effect that when a message comes 
through which for whatever reason fails to trigger much besides 
BAYES_99 (as your example false-positive did), then the assigned 
score will be lower than it should be if you trust bayes, which you 
should be *more* inclined to do for the extreme cases than not.

The default 3.x scores are as follows:

score BAYES_00 0 0 -1.665 -2.599
score BAYES_05 0 0 -0.925 -0.413
score BAYES_20 0 0 -0.730 -1.951
score BAYES_40 0 0 -0.276 -1.096
score BAYES_50 0 0 1.567 0.001
score BAYES_60 0 0 3.515 0.372
score BAYES_80 0 0 3.608 2.087
score BAYES_95 0 0 3.514 2.063
score BAYES_99 0 0 4.070 1.886

Notice that for the fourth column (bayes + network tests enabled) 
BAYES_99 actually scores *lower* than BAYES_80!

I've added the following lines into my local spamassassin 
configuration, based on the scores from SA 2.6x and my own 
experience:

score        BAYES_00 0 0 -4.901 -4.900
score        BAYES_05 0 0 -0.925 -2.599
score        BAYES_20 0 0 -0.730 -1.951
score        BAYES_40 0 0 -0.276 -1.096
score        BAYES_50 0 0 1.567 0.001
score        BAYES_60 0 0 3.515 1.592
score        BAYES_80 0 0 3.608 2.087
score        BAYES_95 0 0 3.514 3.514
score        BAYES_99 0 0 4.070 5.400

Making this single change would have caught your sample false 
positive based solely on the BAYES_99 result.

----
Nels Lindquist <*>
Information Systems Manager
Morningstar Air Express Inc.

Re: my girlfriend is getting ticked :)

Reply via email to