Re: mass-check submissions Re: My attempt at re-calculating test scores

2010-12-24 Thread Warren Togami Jr.
I think what he is failing to understand is the scores are irrelevant, as the masscheck is only determining yes or no for each rule across a corpus. Also "current" is referring to the nightly masscheck snapshot of svn trunk including the latest rules. This does remind me however that there is a se

Re: mass-check submissions Re: My attempt at re-calculating test scores

2010-12-24 Thread John Hardin
On Fri, 24 Dec 2010, dar...@chaosreigns.com wrote: On 12/24, John Hardin wrote: If there was some way to capture the score of RBL tests separately from non-RBL tests and use them in place of the current RBL results I might agree you have a point; but if the mass checks ignore the scores that th

Re: mass-check submissions Re: My attempt at re-calculating test scores

2010-12-24 Thread Warren Togami Jr.
http://www.mail-archive.com/users@spamassassin.apache.org/msg69546.html Whitelists have almost zero impact on spamassassin's determination of ham vs spam. Believe me. This is not harmful. If you have any ham corpus it would be extremely useful to spamassassin. We have a severe lack of variety o

Re: mass-check submissions Re: My attempt at re-calculating test scores

2010-12-24 Thread Darxus
On 12/24, John Hardin wrote: > If there was some way to capture the score of RBL tests separately > from non-RBL tests and use them in place of the current RBL results > I might agree you have a point; but if the mass checks ignore the > scores that the current ruleset generates against historical

SA incorrectly tries ipv6 lookups with perl 5.10.1 and force_ipv4 can' t be set in ../local.cf possible fixes?

2010-12-24 Thread Michael Scheidell
For SA, IO-SOCKET-INET6 per module is optional unless you want to parse ipv6 addresses. For amavisd-new, its suggested that you use it. HOWEVER, this can and WILL cause problems with SA during lookups, as SA seems to try to do ipv6 lookups and fails, delaying each lookup by 28 seconds. (even

Re: mass-check submissions Re: My attempt at re-calculating test scores

2010-12-24 Thread John Hardin
On Fri, 24 Dec 2010, dar...@chaosreigns.com wrote: And it still disturbs me that mass checks use anything but the test results at the time the email is originally scored (like from the "tests" value of the X-Spam-Status header). Since I'm sure the time variance improves the accuracy of things

mass-check submissions Re: My attempt at re-calculating test scores

2010-12-24 Thread Darxus
I am one of the editors of the dnswl.org database, and while it is tempting to participate in the mass-checks, considering the effects that would have on the dnswl tests or not, I think it's better to not have that skew. I like having the QA test results to independently evaluate dnswl. I wonder

Issuing rollback DBI Mysql

2010-12-24 Thread Jack L. Stone
Using: FBSD-7.x p5-Mail-SpamAssassin-3.3.1_3 perl-5.8.9_3 mysql-server-5.0.90 I'm getting a lot of these error messages from the perl module Bayes.pm. The SA archives or google shows very little useful about it. Can anyone help? AFAIK, only started with upgrade to SA-3.3. Dec 24 08:54:05 mail sp

Re: DNSBL for email addresses?

2010-12-24 Thread David F. Skoll
On Thu, 23 Dec 2010 18:16:31 -0800 (PST) John Hardin wrote: > The response time for listing an email address in a phishing emailRBL > may be too great to see much benefit. We see a pretty good benefit from the anti-phishing email reply list. It's not so much a good tool to catch phishers as it i

Re: My attempt at re-calculating test scores

2010-12-24 Thread Yet Another Ninja
On 2010-12-24 12:37, Warren Togami Jr. wrote: You have the option of uploading your corpus to the central server to process every night. But most people have privacy concerns about that if it is their own personal ham. For this reason you have the option of running the masscheck script yourself

Re: My attempt at re-calculating test scores

2010-12-24 Thread Warren Togami Jr.
You have the option of uploading your corpus to the central server to process every night. But most people have privacy concerns about that if it is their own personal ham. For this reason you have the option of running the masscheck script yourself every night on your own server and to rsync upl

Re: Whitelist Regex Rules for range of IP's

2010-12-24 Thread Keith De Souza
> > And if you don't want it to be open to forgery > > X-Spam-Relays-Untrusted =~ /^[^\]]+ ip=212\.74\.114\./ > Many thanks guys

Re: DNSBL for email addresses?

2010-12-24 Thread mouss
Le 23/12/2010 22:56, Bob Proulx a écrit : > mouss wrote: >> John Hardin a écrit : >>> Just out of curiosity, why? An MD5 hash is shorter than an SHA hash (an >>> important consideration when you're making lots of DNS queries of the >>> hash), MD5 is computationally lighter than SHA, and MD5 is robu

Re: DNSBL for email addresses?

2010-12-24 Thread Henrik K
On Thu, Dec 23, 2010 at 09:02:50PM -0500, David F. Skoll wrote: > On Thu, 23 Dec 2010 17:08:11 -0800 (PST) > John Hardin wrote: > > > But the known-evil addresses aren't the data being protected (however > > poorly) - the email addresses from your inbound mail that you're > > checking against th

Re: My attempt at re-calculating test scores

2010-12-24 Thread m
Hi, Is this corpora available for public use (e.g using the corpora for their testings)? All I know is that SA has an old public corpora that dates back in 2005. (Sending from BB) --- Mahmoud Khonji -Original Message- From: "Warren Togami Jr." Date: Thu, 23 Dec 2010 12:45:14 To: C