We did something yesterday in the lines I described, which sort-of
improved the situation. Also there was a mistake in one of the things I
said:
When I reported sa-learn -magic saying 30,000 spam 300,000 ham, it
occurred that I was quoting the value for our secondary MX. The primary MX
has a 1:1 ratio, sort of 250,000 to 250,000 !
In fact the daily traffic ratio is 2000:1000 mail through the two servers,
while the rejected spam is sort of 1000:150.
Anyhow what we did yesterday ON THE PRIMARY MX was :
- clean the AWL of all entries with a single occurrence (including
those u...@ourdomain|x.y where x.y is NOT our IP)
- remove from AWL all entries for u...@ourdomain
- change whitelist_from to whitelist_from_rcvd
- sa-learn all the quarantined spam of the last 10 days
- lower the ham learn threshold to -2
This seemed to reject more spam (all the "casino" one, and most of the
"advertising job in bad italian" [I've been told this is a scam actually]
... in particular the latter is now getting no longer BAYES_00 but higher
probability ranges)
Today we applied the same to the secondary MX, with the variant that it
sa-learned the last 10 days of spam from BOTH servers.
And although the traffic is still in the same ratio, the secondary is now
rejected more of the "advertising job in bad italian" and with even higher
bayes ranges (BAYES_80 or higher) than the primary.
We hope this will settle in a short time (my colleague will change the
crontab so that both servers sa-learn the daily quarantine of both)
Thanks to all for the hints.
--
Lucio Chiappetti - INAF/IASF - via Bassini 15 - I-20133 Milano (Italy)
For more info : http://www.iasf-milano.inaf.it/~lucio/personal.html
-----------------------------------------------------------------------
"Nature" on government cuts to research http://snipurl.com/4erid
"Nature" e i tagli del governo alla ricerca http://snipurl.com/4erko