OK:
Here's v1.1

## Chris Petersen Rules
## 01-09-04
## v1.1

## I've noticed that a lot of spams recently have been following the
random-words technique,
## with very little "spam" content - often just an image or some obfuscated
text.  Has anyone
## given any thought to writing up a rule that detects a LACK of
punctuation, or a lack of
## short words like a/and/the?  It'd be easy for spammers to get around, but
at least it would
## keep them out of inboxes for awhile.

rawbody  CP_RANDOMWORD_10
/(?:\b(?!(?:from|even|more|were|with)\b)[a-z]{4,12}\s+){10}/
describe CP_RANDOMWORD_10       string of 10+ random words
score    CP_RANDOMWORD_10       0.5

rawbody  CP_RANDOMWORD_15
/(?:\b(?!(?:from|even|more|were|with)\b)[a-z]{4,12}\s+){15}/
describe CP_RANDOMWORD_15       string of 15+ random words
score    CP_RANDOMWORD_15       2.5

## EOF

<<Dan>>


 

| -----Original Message-----
| From: Carl Chipman [mailto:[EMAIL PROTECTED] 
| Sent: Friday, January 09, 2004 10:32 AM
| To: Smart,Dan
| Subject: RE: [SAtalk] detecting large collections of random words
| 
| Btw, why not rename your rules CP_RANDOMWORDS_10 and _15 so 
| that the name is more accurate description?  WORDWORD doesn't 
| tell much..
| 
| 
| 
| Carl Chipman


-------------------------------------------------------
This SF.net email is sponsored by: Perforce Software.
Perforce is the Fast Software Configuration Management System offering
advanced branching capabilities and atomic changes on 50+ platforms.
Free Eval! http://www.perforce.com/perforce/loadprog.html
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to