Re: re-learning ? was - bayes - large message

2013-04-20 Thread John Hardin
On Sat, 20 Apr 2013, Joe Acquisto-j4 wrote: On 4/20/2013 at 2:00 PM, John Hardin wrote: On Sat, 20 Apr 2013, Joe Acquisto-j4 wrote: In order to send the samples, the user will forward the messages, as an attachment. Each is an individual message to either ham or spam, with the (hopefully) c

Re: Calling spamassassin directly yields very different results than calling spamassassin via amavis-new

2013-04-20 Thread Benny Pedersen
Ben Johnson skrev den 2013-04-20 19:01: Welp, that'll do it! How those four files were set to root:root ownership is beyond me, that means that root have doing some testing :) later amavisd cant write, you should change to amavis user before testing su amavis -c cmd foo but that was cert

Re: Calling spamassassin directly yields very different results than calling spamassassin via amavis-new

2013-04-20 Thread Benny Pedersen
Ben Johnson skrev den 2013-04-20 05:02: Yes, I believe that me and the system always execute SA commands as the "amavis" user. When I was using the SQL setup, I had the following in local.cf: bayes_path /var/lib/amavis/.spamassassin/bayes is amavis have homedir in /var/lib/ ? in gentoo its

Re: re-learning ? was - bayes - large message

2013-04-20 Thread Joe Acquisto-j4
>>> On 4/20/2013 at 2:00 PM, John Hardin wrote: > On Sat, 20 Apr 2013, Joe Acquisto-j4 wrote: > >> In order to send the samples, the user will forward the messages, as an >> attachment. Each is an individual message to either ham or spam, with >> the (hopefully) correct attachment. > > Are yo

Re: Calling spamassassin directly yields very different results than calling spamassassin via amavis-new

2013-04-20 Thread Benny Pedersen
Ben Johnson skrev den 2013-04-20 04:40: By "feed it a few thousand NEW spams", do you mean to scrap the training corpora that I've hand-sorted in favor of starting over? Or do you mean to clear the database and re-run the training script against the corpora? ls /path/to/maildir/spam >/tmp/sp

Re: re-learning ? was - bayes - large message

2013-04-20 Thread John Hardin
On Sat, 20 Apr 2013, Joe Acquisto-j4 wrote: In order to send the samples, the user will forward the messages, as an attachment. Each is an individual message to either ham or spam, with the (hopefully) correct attachment. Are you extracting the attachments off those messages to feed to sa-le

Re: Calling spamassassin directly yields very different results than calling spamassassin via amavis-new

2013-04-20 Thread Ben Johnson
So, the problem seems not to be SQL-specific, as it occurs with SQL or flat-file DB. Upon following Benny Pedersen's advice (to move SA configuration directives from /etc/spamassassin/local.cf to /var/lib/amavis/.spamassassin/user_prefs), I noticed something unusual: $ ls -lah /var/lib/amavis/.sp

Re: re-learning ? was - bayes - large message

2013-04-20 Thread Joe Acquisto-j4
. . . >Ok, I am officially puzzled. > >I setup email addresses on my SA box, to which I and others (they say) > send ham/spam. Then I have cron tasks that feed those emails twice daily to > bayes. And emails the output to my admin mailbox. > >I can review those admin mes

Re: re-learning ? was - bayes - large message

2013-04-20 Thread Jeff Mincy
From: "Joe Acquisto-j4" Date: Sat, 20 Apr 2013 09:10:26 -0400 >>> On 4/19/2013 at 8:33 PM, "Joe Acquisto-j4" wrote: On 4/19/2013 at 8:26 PM, "Joe Acquisto-j4" wrote: >> I thought I had corrected this issue, with someone's assistance, a while > ago: >> >> Apr 19

Re: re-learning ? was - bayes - large message

2013-04-20 Thread Matus UHLAR - fantomas
On 4/19/2013 at 8:33 PM, "Joe Acquisto-j4" wrote: On 4/19/2013 at 8:26 PM, "Joe Acquisto-j4" wrote: I thought I had corrected this issue, with someone's assistance, a while ago: Apr 19 20:21:02.477 [23670] dbg: bayes: expiry completed Apr 19 20:21:02.477 [23670] info: archive-iterator: skip

Re: re-learning ? was - bayes - large message

2013-04-20 Thread RW
On Sat, 20 Apr 2013 09:10:26 -0400 Joe Acquisto-j4 wrote: > > Ok, I am officially puzzled. > > I setup email addresses on my SA box, to which I and others (they > say) send ham/spam. Then I have cron tasks that feed those emails > twice daily to bayes. And emails the output to my admin mai

re-learning ? was - bayes - large message

2013-04-20 Thread Joe Acquisto-j4
>>> On 4/19/2013 at 8:33 PM, "Joe Acquisto-j4" wrote: On 4/19/2013 at 8:26 PM, "Joe Acquisto-j4" wrote: >> I thought I had corrected this issue, with someone's assistance, a while > ago: >> >> Apr 19 20:21:02.477 [23670] dbg: bayes: expiry completed >> Apr 19 20:21:02.477 [23670] info: arc

Re: local score ignored

2013-04-20 Thread Matus UHLAR - fantomas
What output does the command "sa-learn --dump magic" produce? On Fri, 19 Apr 2013, Joe Acquisto-j4 wrote: 0.000 0 1872 0 non-token data: nspam 0.000 0 9184 0 non-token data: nham On 19.04.13 07:41, John Hardin wrote: Generally you want the ra