On 5/5/22 14:28, Dave Wreski wrote:
No, that's how you train your corpora. If you manually look through the headers of mail that's already been processed by your mail system, the ham should be as close to BAYES_00 as possible, and spam should be at BAYES_99. If that's not the case, then it's been trained incorrectly.

/etc/mail/spamassassin/local.cf:
bayes_auto_learnĀ  0
bayes_auto_expire 0

I'd also recommend disabling auto-learn, if you have that enabled.

If you've gone through your corpus manually, and are certain the ham is all good mail and the spam emails are all bad mail, then it might be worth it to dump the existing bayes database and just retrain it with the corresponding mboxes.

I also typically add --progress to sa-learn.

Best,
Dave


Thanks, I appreciate it. I'll tune it a bit.

Thomas

Reply via email to