Any tools to gauge bayes accuracy?

2005-02-24 Thread Joe Emenaker
Before I actually write this, I'll aks to see if someone already has done it. On my imap server, I've got two different trash folders, one for ham, one for spam. Nothing new there. However, on the hour, I've got a script that runs sa-learn on them and records three things for each message: -

Re: Time for my monthly beating again...

2005-02-18 Thread Joe Emenaker
Joe Flowers wrote: Very preliminary results are no less than AWESOME. So... how are you implementing the "drifting" spam threshold? - Joe smime.p7s Description: S/MIME Cryptographic Signature

Re: [Dshield] fingerprinting servers before accepting

2004-12-02 Thread Joe Emenaker
Christopher X. Candreva wrote: On Wed, 1 Dec 2004, Robert LeBlanc wrote: This actually sounds like it would be a good public DNSBL. Rather than have everyone fingerprint, the central DNSBL would perform fingerprinting of IPs that are requested and not in the cache, then cache the results. Otherw

Re: [Dshield] fingerprinting servers before accepting

2004-12-01 Thread Joe Emenaker
SM wrote: At 08:54 01-12-2004, John Hardin wrote: However, this sounds like it might be useful in Spamassassin: attempt to contact the sender on port 25, and add a little to the spamminess score if the connection is refused or times out. That was the first thought through my mind when I read the o

A script to auto-adjust spam-threshold. Suggestions?

2004-11-09 Thread Joe Emenaker
For some time now, I've had my email system set up so that I have two "trash" folders. One for ham-trash, and one for spam-trash. Hourly, my system goes through them and uses them to update the Bayes database. However, the script that does this *also* records the overall spam score it received

Re: statistics help needed

2004-10-09 Thread Joe Emenaker
Justin Mason wrote: S1 S2 S3 S4 S5 H1 H2 H3 H4 H5 RULE1: x x x x RULE2: x x x x RULE3: x x x RULE4: x (S1-S5 = 5 spam mails; H1-H5

Re: Rs: SpamAssassin 3.0.0-rc3 RELEASE CANDIDATE available!

2004-09-08 Thread Joe Emenaker
Daniel Quinlan wrote: Joe Emenaker <[EMAIL PROTECTED]> writes: Although others have already given reasons why, I figured I'd toss in the analogy to explain why the dude from CypherTrust in the article is lacking in clue: The SpamAssassin development team has been aware

Rs: SpamAssassin 3.0.0-rc3 RELEASE CANDIDATE available!

2004-09-07 Thread Joe Emenaker
On Friday 03 September 2004 09:17 pm, Theo Van Dinter wrote: - SpamAssassin now includes support for SPF (the Sender Policy Framework, http://spf.pobox.com/). Why bother with this? http://www.infoworld.com/article/04/08/31/HNspammerstudy_1.html Although others have already given reasons why, I

Re: shifting the midpoint between the average spam and average ham

2004-09-04 Thread Joe Emenaker
Justin Mason wrote: that sounds pretty cool. suggestion: get it to record what rules hit and what those rules' scores were. Actually, I'm already doing it for Bayes. When I turned off autolearning and went solely with manual-training of the Bayes db, I was interested to see if Bayes, alone, w

Re: shifting the midpoint between the average spam and average ham

2004-09-03 Thread Joe Emenaker
Joe Flowers wrote: If your "spread" is good and it's just the threshold that needs adjusting, it would be trivial to make a rule that fires on every message and give > it a score equal to the desired difference... Thanks Pierre. That may be what I have to do, if noone has a better idea. Actually,

Re: shifting the midpoint between the average spam and average

2004-09-03 Thread Joe Emenaker
Steve Bertrand wrote: > SA isn't about the "average" it's about the accuracy. If this were the case, then why aren't the spam scores ("*required_hits*") for each message either 1 or 0 and nothing else? Oh, come on now. This is just a troll on a very legitimate and informative statement. No.