From: Arthur Dent <misc.li...@blueyonder.co.uk>
   Date: Sat, 06 Oct 2012 11:03:18 +0100
   
   Hello all,
   
   Following a hard drive crash I am rebuilding my small home server on a
   Fedora17 platform.
   
   One of the casualties of the HD crash was my spam corpus. I had a (very
   old) backup which happened to include a previous spam corpus so I used
   that to sa-learn.
   
   All my messages hit BAYES_00. 
   
   I don't have many "fresh" spams. I do not run a SMTP server, I simply
   collect mail for my family and myself from my ISP and other sources
   using fetchmail. My ISP seem to filter most of the really bad stuff so I
   get just a trickle of spams (about 1 per day - if that) but even those
   hit BAYES_00 despite sometimes being identical to a previous FN that had
   already been learned with sa-learn.
   
   Here is my --dump magic: ...
   
   What - if anything - can I do to improve bayes performance?

Get more spam?  Bayes really isn't going to do well with limited
amount of spam.  It does great when correctly trained using lots of
spam.  But with limited data, not so much.

You could try starting over.  It will take 6 months or so to get to
200 spam messages if you are really getting about 1 per day.  You
could just turn off Bayes.  Or you could just turn Bayes off.  I'm
almost at the same point with my home email, for the same reason.

-jeff

Reply via email to