On Mittwoch, 10. Mai 2006 23:41 Matt Kettler wrote: > Particularly on servers with a site-wide DB used against broadly > diverse spread of mail, increasing the token limit will improve > accuracy. > > However, this comes at the expense of increased storage needs and > slower performance. (In particular, expiry takes a LOT longer with > larger DBs)
DB Files are about 60MB together, so not really big (I just got a pricelist with the new 750GB SATA drive from Seagate *g*). And tonights expiry for server #1: bayes: synced databases from journal in 11 seconds: 1968 unique entries (3059 total entries) So it's not too long also. Could possibly be longer on a server that gets some million mails per day, of course. > score used is the score the message would have got if: > bayes was disabled > the AWL was disabled > no userconf (ie:black/whitelists) rules were enabled. Thats good info which should be in the man page. > Since that message scored 8.7, and derives 3.5 of it's points from > BAYES_99, it does not surprise me at all the message was not learned. > > Also, EVEN if the learning score is over the threshold, SA will not > learn a message as spam unless: > there are at least 3.0 points of header rules > there are at least 3.0 points of body rules > Existing learning would not place the message in a low bayes > category (ie: don't learn as spam if the message would have hit > BAYES_00 otherwise) This is written in the man page, except the last line with the BAYES_00 wasn't clear to me from there. Is this valid just for BAYES_00 and BAYES_99, or also BAYES_05 and BAYES_95? > > Since it's already BAYES_99, you could say > > "don't bother, I'll be fine" *g* but bayes needs to be trained > > permanently, because tokens time out... > > Also realize that just because the message got BAYES_99 doesn't mean > there are no tokens in it that can be learned from. Spam mutates. New > phrases and words creep in. These need to be learned from, even if > the current message is already BAYES_99. Yes, this is very valuable info for others also I believe. Thanks for your help on this, mfg zmi -- // Michael Monnerie, Ing.BSc ----- http://it-management.at // Tel: 0660/4156531 .network.your.ideas. // PGP Key: "lynx -source http://zmi.at/zmi3.asc | gpg --import" // Fingerprint: 44A3 C1EC B71E C71A B4C2 9AA6 C818 847C 55CB A4EE // Keyserver: www.keyserver.net Key-ID: 0x55CBA4EE
pgpe1NOeeBvm0.pgp
Description: PGP signature