update Re: quirks with bayes ?

Lucio Chiappetti Thu, 02 Apr 2009 10:18:45 -0700

We did something yesterday in the lines I described, which sort-ofimproved the situation. Also there was a mistake in one of the things Isaid:

When I reported sa-learn -magic saying 30,000 spam 300,000 ham, itoccurred that I was quoting the value for our secondary MX. The primary MXhas a 1:1 ratio, sort of 250,000 to 250,000 !

In fact the daily traffic ratio is 2000:1000 mail through the two servers,while the rejected spam is sort of 1000:150.


Anyhow what we did yesterday ON THE PRIMARY MX was :

 - clean the AWL of all entries with a single occurrence (including
   those  u...@ourdomain|x.y where x.y is NOT our IP)
 - remove from AWL all entries for  u...@ourdomain
 - change whitelist_from to whitelist_from_rcvd
 - sa-learn all the quarantined spam of the last 10 days
 - lower the ham learn threshold to -2

This seemed to reject more spam (all the "casino" one, and most of the"advertising job in bad italian" [I've been told this is a scam actually]... in particular the latter is now getting no longer BAYES_00 but higherprobability ranges)

Today we applied the same to the secondary MX, with the variant that itsa-learned the last 10 days of spam from BOTH servers.

And although the traffic is still in the same ratio, the secondary is nowrejected more of the "advertising job in bad italian" and with even higherbayes ranges (BAYES_80 or higher) than the primary.

We hope this will settle in a short time (my colleague will change thecrontab so that both servers sa-learn the daily quarantine of both)


Thanks to all for the hints.

--
Lucio Chiappetti - INAF/IASF - via Bassini 15 - I-20133 Milano (Italy)
For more info : http://www.iasf-milano.inaf.it/~lucio/personal.html
-----------------------------------------------------------------------
"Nature" on government cuts to research       http://snipurl.com/4erid
"Nature" e i tagli del governo alla ricerca   http://snipurl.com/4erko

update Re: quirks with bayes ?

Reply via email to