On Mittwoch, 10. Mai 2006 23:41 Matt Kettler wrote:
> Particularly on servers with a site-wide DB used against broadly
> diverse spread of mail, increasing the token limit will improve
> accuracy.
>
> However, this comes at the expense of increased storage needs and
> slower performance. (In particular, expiry takes a LOT longer with
> larger DBs)

DB Files are about 60MB together, so not really big (I just got a 
pricelist with the new 750GB SATA drive from Seagate *g*).

And tonights expiry for server #1:
bayes: synced databases from journal in 11 seconds: 1968 unique entries 
(3059 total entries)

So it's not too long also. Could possibly be longer on a server that 
gets some million mails per day, of course.

> score used is the score the message would have got if:
>       bayes was disabled
>       the AWL was disabled
>       no userconf (ie:black/whitelists) rules were enabled.

Thats good info which should be in the man page.

> Since that message scored 8.7, and derives 3.5 of it's points from
> BAYES_99, it does not surprise me at all the message was not learned.
>
> Also, EVEN if the learning score is over the threshold, SA will not
> learn a message as spam unless:
>       there are at least 3.0 points of header rules
>       there are at least 3.0 points of body rules
>       Existing learning would not place the message in a low bayes
> category (ie: don't learn as spam if the message would have hit
> BAYES_00 otherwise)

This is written in the man page, except the last line with the BAYES_00 
wasn't clear to me from there. Is this valid just for BAYES_00 and 
BAYES_99, or also BAYES_05 and BAYES_95? 

> > Since it's already BAYES_99, you could say
> > "don't bother, I'll be fine" *g* but bayes needs to be trained
> > permanently, because tokens time out...
>
> Also realize that just because the message got BAYES_99 doesn't mean
> there are no tokens in it that can be learned from. Spam mutates. New
> phrases and words creep in. These need to be learned from, even if
> the current message is already BAYES_99.

Yes, this is very valuable info for others also I believe.

Thanks for your help on this,
mfg zmi
-- 
// Michael Monnerie, Ing.BSc    -----      http://it-management.at
// Tel: 0660/4156531                          .network.your.ideas.
// PGP Key:   "lynx -source http://zmi.at/zmi3.asc | gpg --import"
// Fingerprint: 44A3 C1EC B71E C71A B4C2  9AA6 C818 847C 55CB A4EE
// Keyserver: www.keyserver.net                 Key-ID: 0x55CBA4EE

Attachment: pgpe1NOeeBvm0.pgp
Description: PGP signature

Reply via email to