We did something yesterday in the lines I described, which sort-of improved the situation. Also there was a mistake in one of the things I said:

When I reported sa-learn -magic saying 30,000 spam 300,000 ham, it occurred that I was quoting the value for our secondary MX. The primary MX has a 1:1 ratio, sort of 250,000 to 250,000 !

In fact the daily traffic ratio is 2000:1000 mail through the two servers, while the rejected spam is sort of 1000:150.

Anyhow what we did yesterday ON THE PRIMARY MX was :

 - clean the AWL of all entries with a single occurrence (including
   those  u...@ourdomain|x.y where x.y is NOT our IP)
 - remove from AWL all entries for  u...@ourdomain
 - change whitelist_from to whitelist_from_rcvd
 - sa-learn all the quarantined spam of the last 10 days
 - lower the ham learn threshold to -2

This seemed to reject more spam (all the "casino" one, and most of the "advertising job in bad italian" [I've been told this is a scam actually] ... in particular the latter is now getting no longer BAYES_00 but higher probability ranges)

Today we applied the same to the secondary MX, with the variant that it sa-learned the last 10 days of spam from BOTH servers.

And although the traffic is still in the same ratio, the secondary is now rejected more of the "advertising job in bad italian" and with even higher bayes ranges (BAYES_80 or higher) than the primary.

We hope this will settle in a short time (my colleague will change the crontab so that both servers sa-learn the daily quarantine of both)

Thanks to all for the hints.

--
Lucio Chiappetti - INAF/IASF - via Bassini 15 - I-20133 Milano (Italy)
For more info : http://www.iasf-milano.inaf.it/~lucio/personal.html
-----------------------------------------------------------------------
"Nature" on government cuts to research       http://snipurl.com/4erid
"Nature" e i tagli del governo alla ricerca   http://snipurl.com/4erko

Reply via email to