On Fri, 8 Sep 2006, Bo Mellberg wrote:
It seems like the exim-users database is being touched regularly, so I'm
guessing that it has been set up by apt-get in some "auto-learning" state.
Yes, you might want to check whatever's running SpamAssassin and
see what user it's running as and also check the configuration
files (probably in /etc/mail/spamassassin) to see where it's
storing the database.
I have earlier trained spam and ham as user "bosse", which is why there is a
working db there as well.
As I am the only user on my system, it really doesn't matter if I use
site-wide or not, but rather how I invoke sa-learn.
Lets say I remove the databases for "bosse" and "root". Is this the proper
way to invoke sa-learn:
1. Log on as user "bosse"
2. sa-learn --showdots --sync --dbpath /var/spool/exim4/.spamassassin --spam
/home/bosse/Maildir/.MissedSpam/cur
Probably not, or at least not the best way.
First of all, you need to run sa-learn as the same user that
runs the filtering. Since you haven't said what user that it
is (whether it's "bosse" or some other user), it's impossible
to say whether that's the correct user to run sa-learn as.
Second, once you determine the correct user, in most cases
sa-learn should consult the same configuration file that
the learning process does, so there shouldn't be a reason to
give --dbpath.
And finally, you don't really need to run --sync every time
you train the Bayes database, although I guess it wouldn't hurt.
If I set up a cron job to do the above I could just toss missed spam into the
"MissedSpam"-folder right?
Yeah, but for efficiency reasons, you'd probably not want
messages in that folder to keep accumulating forever, so you'd
probably want a way to purge them after some period of time.
sa-learn can cope with a situation where you feed it the same
message repeatedly with no harm, but it's still a waste of
CPU cycles.
- Logan