On Tue, 31 Mar 2009 06:20:29 -0700 (PDT) John Hardin <jhar...@impsec.org> wrote:
> Your bayes is trained with a strong bias towards ham. It should be > more the other way, since the raw volume of email is biased towards > spam. > ... > I suggest you also consider either disabling autolearn, or push the > learn-as-ham threshold lower. I think it would be nice if SA could handle this automatically e.g. if ham is over-represented then only autolearn ham where p>0.001, and vice versa. At the moment the only way of tweaking this is to vary the thresholds, which is about the worst possible way of doing it.