Abigail, thank you for your suggestions.

I upgraded to version 2.60 thinking that would help but I still do not see
BAYES_* scores in the xspam tag and logs. prior to october 9th the logs
showed lots of BAYES_60-99, and I don't know why it stoped. I am stuck I
have read the man files and googled, if you can help thank you in advance. 

I run spamassasin through amavisd-new v. 20030616 

A grep of my local.cf for bayes shows 
        use_bayes 1
        #use_bayes 0
        bayes_path /var/amavis/.spamassassin/bayes
        bayes_auto_learn 1
        bayes_auto_learn_threshold_nonspam 0.3

The only files I have in the .spamassassin directory are 
        auto-whitelist
        auto-whitelist.db
        bayes_seen
        bayes_toks
        user_prefs

$ sa-learn --dump magic # reports the following. 
        0.000          0          2          0  non-token data: bayes db
version
        0.000          0       3583          0  non-token data: nspam
        0.000          0        368          0  non-token data: nham
        0.000          0     139100          0  non-token data: ntokens
        0.000          0 1066246629          0  non-token data: oldest atime
        0.000          0 1066339439          0  non-token data: newest atime
        0.000          0 1066339712          0  non-token data: last journal
sync atime
        0.000          0 1066287910          0  non-token data: last expiry
atime
        0.000          0          0          0  non-token data: last expire
atime delta
        0.000          0          0          0  non-token data: last expire
reduction count

spamassassin -D --lint prints the following dubug lines plus some other
stuff
        debug: Score set 0 chosen.
        debug: running in taint mode? yes
        debug: Running in taint mode, removing unsafe env vars, and
resetting PATH
        debug: PATH included '/usr/local/sbin', keeping.
        debug: PATH included '/usr/local/bin', keeping.
        debug: PATH included '/sbin', keeping.
        debug: PATH included '/bin', keeping.
        debug: PATH included '/usr/sbin', keeping.
        debug: PATH included '/usr/bin', keeping.
        debug: PATH included '/usr/X11R6/bin', keeping.
        debug: PATH included '/root/bin', which doesn't exist, dropping.
        debug: Final PATH set to:
/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/usr/X11R6/bin
        debug: ignore: using a test message to lint rules
        debug: using "/usr/share/spamassassin" for default rules dir
        debug: using "/etc/mail/spamassassin" for site rules dir
        debug: using "/root/.spamassassin" for user state dir
        debug: using "/root/.spamassassin/user_prefs" for user prefs file
        debug: bayes: 13398 tie-ing to DB file R/O
/var/amavis/.spamassassin/bayes_toks
        debug: bayes: 13398 tie-ing to DB file R/O
/var/amavis/.spamassassin/bayes_seen
        debug: bayes: found bayes db version 2
        debug: Score set 3 chosen.
        debug: Initialising learner
        debug: dns_available set to yes in config file, skipping test
        debug: is Net::DNS::Resolver available? yes
        debug: all '*From' addrs: [EMAIL PROTECTED]
        debug: running header regexp tests; score so far=0
        debug: running body-text per-line regexp tests; score so far=2.077
        debug: bayes corpus size: nspam = 3583, nham = 368
        debug: uri tests: Done uriRE
        debug: tokenize: header tokens for *F = "U*ignore
D*compiling.spamassassin.taint.org      D*spamassassin.taint.org D*taint.org
D*org"
        debug: tokenize: header tokens for *m = " 1066342148 lint_rules "
        debug: bayes token 'N:H*m:NNNNNNNNNN' => 0.00484684684684685
        debug: bayes token 'somewhat' => 0.0256190476190476
        debug: bayes: score = 0.000852994703415655
        debug: bayes: 13398 untie-ing
        debug: bayes: 13398 untie-ing db_toks
        debug: bayes: 13398 untie-ing db_seen
        debug: Using results from Razor v2.36
        debug: Found Razor2 part: part=0 engine=4 ct=0 cf=0
        debug: leaving helper-app run mode
        debug: Razor2 results: spam? 0  highest cf score: 0
        debug: running raw-body-text per-line regexp tests; score so
far=2.077
        debug: running uri tests; score so far=2.077
        debug: uri tests: Done uriRE
        debug: running full-text regexp tests; score so far=2.077
        debug: Razor2 is available
        debug: DCCifd is not available: no r/w dccifd socket found.
        debug: Current PATH is:
/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/usr/X11R6/bin
        debug: executable for dccproc was found at /usr/local/bin/dccproc
        debug: DCC is available: /usr/local/bin/dccproc
        debug: entering helper-app run mode
        debug: DCC: got response: X-DCC-Servercave-Metrics:
INI-FILTER.interactivenw.com 1183;      Body=   12772 Fuz1=590933
Fuz2=552319
        debug: leaving helper-app run mode
        debug: all '*To' addrs:
        debug: RBL: success for 1 of 1 queries
        debug: running meta tests; score so far=2.077
        debug: is spam? score=-2.823 required=5
tests=BAYES_00,DATE_MISSING,NO_REAL_NAME
        
-----Original Message-----
From: Abigail Marshall [mailto:[EMAIL PROTECTED]
Sent: Thursday, October 16, 2003 2:49 PM
To: Matt Van Gordon; [EMAIL PROTECTED]
Subject: Re: [SAtalk] BAYES_* no longer appear in logs


Hello Matt,

Wednesday, October 15, 2003, 12:23:21 PM, you wrote:

MVG> Does the Bayesian filtering stop working if the
MVG> database becomes too lopsided?

It stops working if the database becomes corrupted, which
can happen in various ways. The first thing you should do is
look in your .spamassassin directory. You should have the
following files related to Bayes:

bayes_journal
bayes_msgcount
bayes_seen
bayes_toks


If you have any other files beginning with bayes_ but with
another extension (such as a lock file), that is a sign of
trouble.

I have tried various approaches to repairing that sort of
problem, but the only thing that really worked for me was to
delete all the bayes files and start over from scratch.

MVG> Is there a way to check how much ham and spam the
MVG> database has

Use this shell command:

spamassassin -D --lint

It will run a number of tests for Bayes, including a line
that tells you the ham/spam info. (on my server, this
currently reads:

debug: bayes corpus size: nspam = 7333, nham = 1281


If there is either insufficient spam or ham you will get a
specific error message telling you that.

Bayes periodically expires old tokens on its own; I am not
sure, but believe that it will NOT expire either ham or spam
if that leaves an insufficient corpus.  While the accuracy
of the database might suffer if there is an imbalance, Bayes
should continue to run as long as it has the minimum
requisite of each.

MVG> and what can I do to insure that the
MVG> Bayesian filtering continues to function.

Just monitor it. I've had problems in the past myself, and
basically nothing seems to have really explained either the
source of my problem or how to fix it.  I think the problem
is that it's possible (at least in versions 2.54/2.55) for
the salearn program to continue to run even after
encountering an error (such as lock file problem) and good
data can get overwritten with bad or incomplete data. I
don't think this is a frequent occurrence, but maybe it
happens from time to time with a heavy load of incoming mail
and more than one process trying to write to the database at
the same time.  That's just a guess in any case, which comes
mostly from the fact that on my system I also seem to see
lock file issues at the same time I have encountered
problems or issues with the Bayes database. This in turn
could result from particular memory limitation issues on the
system -- I don't seem to have the problem any more since I
did a server upgrade that afforded more memory.

-Abigail


-------------------------------------------------------
This SF.net email is sponsored by: SF.net Giveback Program.
SourceForge.net hosts over 70,000 Open Source Projects.
See the people who have HELPED US provide better services:
Click here: http://sourceforge.net/supporters.php
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to