On 5/5/22 14:28, Dave Wreski wrote:
No, that's how you train your corpora. If you manually look through
the headers of mail that's already been processed by your mail system,
the ham should be as close to BAYES_00 as possible, and spam should be
at BAYES_99. If that's not the case, then it's been trained incorrectly.
/etc/mail/spamassassin/local.cf:
bayes_auto_learnĀ 0
bayes_auto_expire 0
I'd also recommend disabling auto-learn, if you have that enabled.
If you've gone through your corpus manually, and are certain the ham
is all good mail and the spam emails are all bad mail, then it might
be worth it to dump the existing bayes database and just retrain it
with the corresponding mboxes.
I also typically add --progress to sa-learn.
Best,
Dave
Thanks, I appreciate it. I'll tune it a bit.
Thomas