autolearn vs sa-learn / Bayes

Diego Pomatta Thu, 21 Feb 2008 05:33:10 -0800

Hello list.

Does the bayes system use a separate db for the "autolearn" mode?

Today I noticed that my SA bayes has 50 spam and 45 ham mails learned,when I thought the db had a lot more, because bayes IS being used.


# sa-learn --dump magic
0.000          0          3          0  non-token data: bayes db version
*0.000          0         50          0  non-token data: nspam
0.000          0         45          0  non-token data: nham*

# spamassassin -D --lint
...
[7896] dbg: bayes: found bayes db version 3
[7896] dbg: bayes: DB journal sync: last sync: 0

*[7896] dbg: bayes: not available for scanning, only 50 spam(s) in bayesDB < 200*

...

In the beginning , after setting up SA, bayes was not being used.
I had not trained it with anything yet, but my local.cf had:
*use_bayes 1
use_bayes_rules 1
bayes_auto_learn 1*

Reading the logs I noticed that it was only autolearning spam, not ham.
So I added
*bayes_auto_learn_threshold_nonspam 0.5*
and it started learning ham.

I monitored the logs and at some point incoming mails started triggeringthe BAYES_20, BAYES_50, BAYES_00, BAYES_95, BAYES_99, rules.So I figured it had autlearned the minimum needed amount of ham and spam(200) to start working.Every now and then I use sa-learn to feed some spam and ham to bayes,and I thought I was contributing to the same db. Those must be the 50spam and 45 ham mails.


So what's the deal? :)
/Regards

autolearn vs sa-learn / Bayes

Reply via email to