AHA! That brings me to my other question on this mailing list "Which DB is actually used?".

General rule for learning: learn as the user you use to filter mail. Corolary: don't learn as root, since SA never runs as root.

You can generally set up SA two different ways: site-wide bayes, or individual user bayes. How you do either probably depends in part on what the rest of your mail system is.

It isn't quite clear to me how you have your system set up. In any case I don't know enough to tell you exactly what you need to do to get either effect; I could only suggest RTFM, which doesn't always help much.

It looks to me like the exim user is probably the user that you main mail processing is running under. I'd say that was the main database and you were set up site-wide, but that doesn't explain why there is a second database under your usercode.

Perhaps the exim user is the main filter and has been doing auto-learning, and you have also been manually learning mails under your own usercode? But the database sizes are relatively even, so maybe you somehow are processing mail under two different usercodes for different recipients?

       Loren


I did sa-learn --dump magic as root, but root has never been tought any spam or ham.

I did "sa-learn --dump magic --dbpath ...":

max:~# sa-learn --dump magic --dbpath /home/bosse/.spamassassin/
0.000          0          3          0  non-token data: bayes db version
0.000          0       2447          0  non-token data: nspam
0.000          0       1320          0  non-token data: nham
0.000          0     152520          0  non-token data: ntokens
0.000          0 1145221056          0  non-token data: oldest atime
0.000          0 1157497564          0  non-token data: newest atime
0.000 0 0 0 non-token data: last journal sync atime
0.000          0 1157464926          0  non-token data: last expiry atime
0.000 0 11059200 0 non-token data: last expire atime delta 0.000 0 21382 0 non-token data: last expire reduction count

max:~# sa-learn --dump magic --dbpath /var/spool/exim4/.spamassassin/
0.000          0          3          0  non-token data: bayes db version
0.000          0        477          0  non-token data: nspam
0.000          0       1966          0  non-token data: nham
0.000          0      69851          0  non-token data: ntokens
0.000          0 1130849621          0  non-token data: oldest atime
0.000          0 1157701646          0  non-token data: newest atime
0.000 0 1157687964 0 non-token data: last journal sync atime
0.000          0          0          0  non-token data: last expiry atime
0.000 0 0 0 non-token data: last expire atime delta 0.000 0 0 0 non-token data: last expire reduction count

Since the filtering is working quite well, I guess one of these two databases are used. The user "bosse" is my own user for emails. The database in /var/spool/exim4 is that for auto learning?

Questions:

1. How can I know which of these two are actually used for filtering the emails for "bosse".

2. Can I "move" the db in use for site wide usage and auto-learning?

Reply via email to