On 06/29/2011 04:59 PM, John Hardin wrote: > On Wed, 29 Jun 2011, J4K wrote: > >> Over the past few months I noticed an increase in 'Start New Employment >> Today | Career Opportunity' style email. The rules I use, that are >> pretty much stock rules, correctly tag the email as spam. Usually the >> Spam score hovers between 5.5 and 6.9. > > Is there some reason you're unwilling or unable to use Bayes? If you > are getting these regularly, then training a few as spam would likely > catch most of the rest. > Hi,
I thought that Baynes was enabled. I have fed spam and ham into sa-learn daily since February 2011. Of course, I might well have been feeding data into a black hole if it is not working. I enabled (I Thought) Baynes as per the local.cf below:- use_bayes 1 bayes_auto_learn 1 bayes_expiry_max_db_size 300000 bayes_auto_expire 1 I read somewhere that this might explain what is into the dB. Not a lot, really. # sa-learn --dump magic 0.000 0 3 0 non-token data: bayes db version 0.000 0 0 0 non-token data: nspam 0.000 0 0 0 non-token data: nham 0.000 0 0 0 non-token data: ntokens 0.000 0 2147483647 0 non-token data: oldest atime 0.000 0 0 0 non-token data: newest atime 0.000 0 0 0 non-token data: last journal sync atime 0.000 0 0 0 non-token data: last expiry atime 0.000 0 0 0 non-token data: last expire atime delta 0.000 0 0 0 non-token data: last expire reduction count nham and nspam = 0 Says it all :( spamassassin -D -lint confirms: Jun 29 20:25:17.682 [26298] dbg: plugin: loading Mail::SpamAssassin::Plugin::Bayes from @INC Jun 29 20:25:17.847 [26298] dbg: config: fixed relative path: /var/lib/spamassassin/3.003001/updates_spamassassin_org/23_bayes.cf Jun 29 20:25:17.847 [26298] dbg: config: using "/var/lib/spamassassin/3.003001/updates_spamassassin_org/23_bayes.cf" for included file Jun 29 20:25:17.848 [26298] dbg: config: read file /var/lib/spamassassin/3.003001/updates_spamassassin_org/23_bayes.cf Jun 29 20:25:19.998 [26298] dbg: plugin: Mail::SpamAssassin::Plugin::Bayes=HASH(0x3e42670) implements 'learner_new', priority 0 Jun 29 20:25:19.998 [26298] dbg: bayes: learner_new self=Mail::SpamAssassin::Plugin::Bayes=HASH(0x3e42670), bayes_store_module=Mail::SpamAssassin::BayesStore::MySQL Jun 29 20:25:20.010 [26298] dbg: bayes: using username: xxxx Jun 29 20:25:20.010 [26298] dbg: bayes: learner_new: got store=Mail::SpamAssassin::BayesStore::MySQL=HASH(0x40bfe48) Jun 29 20:25:20.010 [26298] dbg: plugin: Mail::SpamAssassin::Plugin::Bayes=HASH(0x3e42670) implements 'learner_is_scan_available', priority 0 Jun 29 20:25:20.012 [26298] dbg: bayes: database connection established Jun 29 20:25:20.013 [26298] dbg: bayes: found bayes db version 3 Jun 29 20:25:20.013 [26298] dbg: bayes: Using userid: 77 Jun 29 20:25:20.013 [26298] dbg: bayes: not available for scanning, only 0 spam(s) in bayes DB < 200 Jun 29 20:25:20.027 [26298] dbg: bayes: database connection established Jun 29 20:25:20.027 [26298] dbg: bayes: found bayes db version 3 Jun 29 20:25:20.028 [26298] dbg: bayes: Using userid: 77 Jun 29 20:25:20.028 [26298] dbg: bayes: not available for scanning, only 0 spam(s) in bayes DB < 200 I read the entry on http://wiki.apache.org/spamassassin/SiteWideBayesSetup, and it looks like these are missing in my local.cf: bayes_path /var/spamassassin/bayes/bayes bayes_file_mode 0777 * QUESTION Other than defining these entries (baynes_path baynes_file) into the local.cf, and rerunning sa-learn, is there anything else I should do to get this to work?