OK: Here's v1.1 ## Chris Petersen Rules ## 01-09-04 ## v1.1
## I've noticed that a lot of spams recently have been following the random-words technique, ## with very little "spam" content - often just an image or some obfuscated text. Has anyone ## given any thought to writing up a rule that detects a LACK of punctuation, or a lack of ## short words like a/and/the? It'd be easy for spammers to get around, but at least it would ## keep them out of inboxes for awhile. rawbody CP_RANDOMWORD_10 /(?:\b(?!(?:from|even|more|were|with)\b)[a-z]{4,12}\s+){10}/ describe CP_RANDOMWORD_10 string of 10+ random words score CP_RANDOMWORD_10 0.5 rawbody CP_RANDOMWORD_15 /(?:\b(?!(?:from|even|more|were|with)\b)[a-z]{4,12}\s+){15}/ describe CP_RANDOMWORD_15 string of 15+ random words score CP_RANDOMWORD_15 2.5 ## EOF <<Dan>> | -----Original Message----- | From: Carl Chipman [mailto:[EMAIL PROTECTED] | Sent: Friday, January 09, 2004 10:32 AM | To: Smart,Dan | Subject: RE: [SAtalk] detecting large collections of random words | | Btw, why not rename your rules CP_RANDOMWORDS_10 and _15 so | that the name is more accurate description? WORDWORD doesn't | tell much.. | | | | Carl Chipman ------------------------------------------------------- This SF.net email is sponsored by: Perforce Software. Perforce is the Fast Software Configuration Management System offering advanced branching capabilities and atomic changes on 50+ platforms. Free Eval! http://www.perforce.com/perforce/loadprog.html _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk