From: Arthur Dent <misc.li...@blueyonder.co.uk> Date: Sat, 06 Oct 2012 11:03:18 +0100 Hello all, Following a hard drive crash I am rebuilding my small home server on a Fedora17 platform. One of the casualties of the HD crash was my spam corpus. I had a (very old) backup which happened to include a previous spam corpus so I used that to sa-learn. All my messages hit BAYES_00. I don't have many "fresh" spams. I do not run a SMTP server, I simply collect mail for my family and myself from my ISP and other sources using fetchmail. My ISP seem to filter most of the really bad stuff so I get just a trickle of spams (about 1 per day - if that) but even those hit BAYES_00 despite sometimes being identical to a previous FN that had already been learned with sa-learn. Here is my --dump magic: ... What - if anything - can I do to improve bayes performance?
Get more spam? Bayes really isn't going to do well with limited amount of spam. It does great when correctly trained using lots of spam. But with limited data, not so much. You could try starting over. It will take 6 months or so to get to 200 spam messages if you are really getting about 1 per day. You could just turn off Bayes. Or you could just turn Bayes off. I'm almost at the same point with my home email, for the same reason. -jeff