Re: Re-running SA on an mbox

2009-09-22 Thread John Hardin
On Tue, 22 Sep 2009, Jeff Mincy wrote: From: MySQL Student Date: Tue, 22 Sep 2009 15:38:47 -0400 > Try using a local SA setup for stripping the headers. By local, I mean > don't use your main production SA - run a separate copy with its own > (cut down) configuration and all data bas

Re: Re-running SA on an mbox

2009-09-22 Thread Jeff Mincy
From: MySQL Student Date: Tue, 22 Sep 2009 15:38:47 -0400 > Try using a local SA setup for stripping the headers. By local, I mean > don't use your main production SA - run a separate copy with its own > (cut down) configuration and all data base accesses and UBL calls etc >

Re: Re-running SA on an mbox

2009-09-22 Thread RW
On Tue, 22 Sep 2009 13:03:16 +0100 Martin Gregorie wrote: > gawk ' > BEGIN { act = "copy" } > /^X-Spam/ { act = "skip" } > /^[A-WYZ]/ { act = "copy" } > { >

Re: Re-running SA on an mbox

2009-09-22 Thread MySQL Student
Hi, > Try using a local SA setup for stripping the headers. By local, I mean > don't use your main production SA - run a separate copy with its own > (cut down) configuration and all data base accesses and UBL calls etc > turned off. Much better idea, thanks. Thanks for the script, too. Best, Al

Re: Re-running SA on an mbox

2009-09-22 Thread Martin Gregorie
On Mon, 2009-09-21 at 23:18 -0400, MySQL Student wrote: > How can I tell when another process is using the database and when it > is free for my script to use? > > Is there a faster way to run spamassassin just to strip the SA headers? > Try using a local SA setup for stripping the headers. By lo

Re: Re-running SA on an mbox

2009-09-22 Thread Mark Martinec
On Tuesday September 22 2009 06:32:12 Benny Pedersen wrote: > On man 21 sep 2009 20:33:57 CEST, MySQL Student wrote > >> but this will invalidtate dkim headers if this headers > >> is signed, are spamassassin aware of this problem ? (in general) > > > > Are you saying there is a bug? > > partly ye

Re: Re-running SA on an mbox

2009-09-21 Thread Benny Pedersen
On man 21 sep 2009 20:33:57 CEST, MySQL Student wrote but this will invalidtate dkim headers if this headers is signed, are spamassassin aware of this problem ? (in general) Are you saying there is a bug? partly yes, its not a bug as long you keep the orginal email but spamassassin --mbox < i

Re: Re-running SA on an mbox

2009-09-21 Thread MySQL Student
Hi, It's certainly not a fast operation, but using the following will split an mbox into individual messages: export FILENO=0 mkdir msgs formail -s sh -c 'cat - >msgs/$FILENO' < mbox-name.mbox I also created a loop that would strip all the SA headers from the messages: for file in *; do ech

Re: Re-running SA on an mbox

2009-09-21 Thread MySQL Student
Hi, > IIRC you previously mentioned using Pine. Just in case you're not aware > the default format for Pine/Alpine is MBX, an extended version of > MBOX. You can tell the difference because MBX mailboxes start with a > dummy email that's hidden by the software. It seems that if you save messages

Re: Re-running SA on an mbox

2009-09-21 Thread MySQL Student
> but this will invalidtate dkim headers if this headers is signed, are > spamassassin aware of this problem ? (in general) Are you saying there is a bug? > mutt -f mbox > > in mutt save to another folder if missclassified Yes, I use pine for that, but would like to eliminate as many of the FNs

Re: Re-running SA on an mbox

2009-09-21 Thread MySQL Student
Hi, >> Thank you all for your help. The "mbox split" suggestion is a good >> one. I'll follow that route and post my experience later. > > formail -s is the way to go. I thought about that as a component of procmail. Sounds great. Thanks, Alex

Re: Re-running SA on an mbox

2009-09-21 Thread RW
On Sun, 20 Sep 2009 21:15:14 -0400 MySQL Student wrote: > Hi, > > I have an mbox with about a 100 messages in it from a few days ago. > The mbox is a combination of spam and ham. What is the best way to run > SA through these messages again, so I can catch the ones that have > URLs in them that

Re: Re-running SA on an mbox

2009-09-20 Thread Benny Pedersen
On man 21 sep 2009 04:47:23 CEST, MySQL Student wrote Wait, my mistake. I read that too fast. Does that work, and rewrite the X-Spam-Status header? imho spamassassin always remove its own known headers, but only once it can add self so yes the trick is to retest, where you will see if its

Re: Re-running SA on an mbox

2009-09-20 Thread hamann . w
>> >> Hi, >> >> > Do you just want to re-scan the whole mbox and see what rules hit now >> > for research reasons? >> >> That's a good start, but I'd like to see if I can break out the ham to >> train bayes. >> > >> Yeah, that's kind of what I thought. Maybe a program that can split >> each me

Re: Re-running SA on an mbox

2009-09-20 Thread Matt Kettler
Theo Van Dinter wrote: > You probably want "spamassassin --mbox". :) > It won't modify the messages in-place, but you can do something like > "spamassassin --mbox infile > outfile". > > If you're talking about sa-learn, though, it also knows --mbox. > Yes, but he's got mixed spam and nonspam in

Re: Re-running SA on an mbox

2009-09-20 Thread LuKreme
On Sep 20, 2009, at 20:45, MySQL Student wrote: Thank you all for your help. The "mbox split" suggestion is a good one. I'll follow that route and post my experience later. formail -s is the way to go.

Re: Re-running SA on an mbox

2009-09-20 Thread MySQL Student
Hi, >> You probably want "spamassassin --mbox". :) >> It won't modify the messages in-place, but you can do something like >> "spamassassin --mbox infile > outfile". > > My apologies if it wasn't clear, but these messages have already been Wait, my mistake. I read that too fast. Does that work, a

Re: Re-running SA on an mbox

2009-09-20 Thread MySQL Student
Hi, > You probably want "spamassassin --mbox". :) > It won't modify the messages in-place, but you can do something like > "spamassassin --mbox infile > outfile". My apologies if it wasn't clear, but these messages have already been marked by SA. Some are ham, and the rest are FPs that I'd like t

Re: Re-running SA on an mbox

2009-09-20 Thread Matt Kettler
MySQL Student wrote: > Hi, > > >> Do you just want to re-scan the whole mbox and see what rules hit now >> for research reasons? >> > > That's a good start, but I'd like to see if I can break out the ham to > train bayes. > > >> There's no way to (directly) get SA to modify email that's

Re: Re-running SA on an mbox

2009-09-20 Thread Theo Van Dinter
You probably want "spamassassin --mbox". :) It won't modify the messages in-place, but you can do something like "spamassassin --mbox infile > outfile". If you're talking about sa-learn, though, it also knows --mbox. On Sun, Sep 20, 2009 at 9:46 PM, MySQL Student wrote: > Yeah, that's kind of w

Re: Re-running SA on an mbox

2009-09-20 Thread MySQL Student
Hi, > Do you just want to re-scan the whole mbox and see what rules hit now > for research reasons? That's a good start, but I'd like to see if I can break out the ham to train bayes. > There's no way to (directly) get SA to modify email that's already in an > mbox file. The mass-check and sa-le

Re: Re-running SA on an mbox

2009-09-20 Thread Matt Kettler
MySQL Student wrote: > Hi, > > I have an mbox with about a 100 messages in it from a few days ago. > The mbox is a combination of spam and ham. What is the best way to run > SA through these messages again, so I can catch the ones that have > URLs in them that weren't on the blacklist at the time t

Re-running SA on an mbox

2009-09-20 Thread MySQL Student
Hi, I have an mbox with about a 100 messages in it from a few days ago. The mbox is a combination of spam and ham. What is the best way to run SA through these messages again, so I can catch the ones that have URLs in them that weren't on the blacklist at the time they were received? Must I break