Jason Carson wrote: > Hello everyone, > > Is it necessary to clear the database... > > sa-learn --clear > > ...before I run the following to train SpamAssassin's bayesian classifier... > > sa-learn --spam /home/jason/.maildir/.Spam/cur/ > > No. That would be ill advised.
Running --clear deletes your entire bayes database, which can take a long time to recover from. I would only advise using it if you've decided all your previous training is worthless, or your database becomes corrupted. Also be sure to consider that once you clear the database SA will deactivate bayes until 200 spam and 200 nonspam messages get trained. SpamAssassin will automatically make room when it needs to by pushing out the least popular tokens through the expire process (which you can manually trigger via the sa-learn --force-expire command, but it normally checks during message processing twice a day)