> -----Original Message-----
> From: Scott A Crosby [mailto:[EMAIL PROTECTED]
> Sent: Tuesday, January 06, 2004 7:01 PM
> To: Chris Santerre
> Cc: Spamassassin-Talk (E-mail)
> Subject: Making bigevil faster by finding common prefixes
> 
> 
> On Wed, 24 Dec 2003 10:59:50 -0500, Chris Santerre 
> <[EMAIL PROTECTED]> writes:
> 
> > Updated from last few days. Rules 20-23 have been played 
> with a little.
> > Attempting to make the ruleset faster. I have some issues 
> with doing the
> > rules this way, so I'm testing them out. 
> 
> Are you having trouble doing the conversion automatically? 

Yup ;)

> I can
> describe the algorithm to transform the regexps and to find
> maximum-size prefixes if you (or someone else) wants to
> implement. I've tried, but my perl knowledge for the datastructure
> voodoo is a bit lacking, but the correct algorithm will give a new
> ruleset that will have *identical* results to doing the matches
> sequentially. The program for the conversion should be about 
> 30-50 lines.
> 
*snip*

Basically bigevil has gone completly manual now. Scripts automating it were
essential to the project. Now they become more of a hinderence. I have some
plans for some new scipts to get domain names, but adding anything to the
actual cf file has to be done by hand. 

Same example: domain.net and domain.com is a spammer. But domain.org is not.
I can't just say /domain\.(?:com|net|org)/ because of the FP. Also scripts
don't see things like:

spam2003.com, spam2004.com,..... could be rewritten as /spam200\d\.com/

Or that some of the IP addresses can be broken down to subnets. 

I see what you mean by the tree structure of the rules. Eventually I hope to
get there. I plan to pull out .org,us,info tld's into their own rules. So
I'm changing a few at a time. But at this point, automating any changes
isn't going to work :(

--Chris 



-------------------------------------------------------
This SF.net email is sponsored by: Perforce Software.
Perforce is the Fast Software Configuration Management System offering
advanced branching capabilities and atomic changes on 50+ platforms.
Free Eval! http://www.perforce.com/perforce/loadprog.html
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to