On Tue, 2010-11-16 at 16:02 +0000, Mike Bro wrote:
> Hello,
> 
I've seen all but (2) in ham, so be careful with these rules - you
probably want to score them at 0.01 and use them in meta-rules.

> Just wanted to create (or use if they already exist) some rules:
> 1. Email body contains more than 10 newliners without any text between them
>
I get quite a lot of bodies like this - they almost always have a *lot*
of attached photos plus an HTML text body. The multiple spaces are
always in the plaintext body part and seem to correspond to the image
positions in the HTML version - IOW this is something that an M$ mail
reader such as LookOut does when it generates the plaintext version of
the mail.

> 2. Email body contains less than 4 characters
>
I've never seen mail with so short a body - where's the spam payload?

> 3. Email body contains only a line of text and a line with some URL
> 
I've seen bodies, both as spam and ham, that only contain a URL and
whitespace on either side. Something like 

rawbody  BARE_URL /^\s{0,10}(\S{1,80}|http:\S{1,70}\.html)\s{0,10}$/

should catch it, but you'll need variants with the text line before and
after the URL.
 

Martin


Reply via email to