Michael Scheidell wrote:
Bayes on cluster begs the question: what if you didn't replicate the bayes tables, and left them server specific?
It may yet take that. :( (If only for overall cluster reliability - any one of the current three machines could handle the current load without any trouble, but we're likely going to stuff ClamAV on them as well.) Unfortunately that means doing mistake-training on *each* machine - autolearn on it's own just doesn't cut it.
I'm dogfooding pretty much that exact scenario on one machine; it's got its own local Bayes DB that I'm hand-training with my own mail.
Since (depending on configurations) some of the servers might get 'spam only' (higher mx records), maybe just take one of the 'valid' bayes tables and manually copy it (sa-learn backup, sa-learn clear, restore) every week or so.
Mmmh. Access is for both inbound and outbound mail, through a load-balancer; the type of mail seen on any one system is pretty much identical over time.