Do you really think it would be a problem if we found more than 3 instances of <i></i> 
in each email to mark it as spam?  Maybe I could just score it lower per instance, say 
.2

There were 58 instances of <i></i> in this email and 63 instances of <b></b>.

-----Original Message-----
From: Larry Gilson [mailto:[EMAIL PROTECTED]
Sent: Wednesday, October 29, 2003 9:35 AM
To: Mark Ritchie; [EMAIL PROTECTED]
Subject: RE: [SAtalk] Exessive HTML Code


Yes, this would be possible.

describe MY_RBDY_EXSV_TAG    MY: Excessive HTML Tags
rawbody  MY_RBDY_EXSV_TAG    /<[bi]><\/[bi]>/i
score    MY_RBDY_EXSV_TAG    4.0

Backhair did not hit because the number of characters within the tag is
fewer than 6.  Creating rules to match fewer than 6 characters within the
tag delimiters creates false positives.  You will most certainly need to
score it how you want rather than the arbitrary number I supplied.

--Larry



-----Original Message-----
From: Mark Ritchie [mailto:[EMAIL PROTECTED] 
Sent: Wednesday, October 29, 2003 8:14 AM
To: [EMAIL PROTECTED]
Subject: [SAtalk] Exessive HTML Code


I've added the popcorn, blackhair, and weeds rules a while back, but I've
noticed that I'm still getting quite a few spams messages per day.  It
always seems to be the most offensive porn and such that makes it through.

Here is an example of the source that get's through

<HTML><html>
<body bgcolor="#FFFFFF">
<p> NOT m<i></i>atu<b></b>re<i></i>,
e<i></i>xpe<i></i>ri<i></i>enc<i></i>ed. NOT cheat<i></i>ing, on t<b></b>he
s<i></i>i<i></i>de. <br>
  <b></b>NOT fli<i></i>rtin<i></i>g <b></b>- t<b></b>h<i></i>i<b></b>s is
2003's fine<i></i>st a<i></i>l<b></b>t<b></b>er<b></b>na<b></b>tive dating
lifes<b></b>tyl<i></i>e <b></b>sol<i></i>ut<i></i>io<i></i>n
w<i></i>it<i></i>h 
  tho<i></i>u<i></i>sands o<i></i>f h<b></b>or<b></b>ny
housewive<b></b>s<i></i>.<br>
  An<i></i>d <i></i>yo<b></b>u, Y<i></i>ES, Y<b></b>O<i></i>U, <i></i>can
g<b></b>e<b></b>t a<b></b>ccess to t<b></b>h<i></i>e
<b></b>wh<b></b>o<i></i>le d<i></i>a<b></b>ta<b></b>ba<i></i>se of
USA-<b></b>loc<b></b>a<i></i>te<i></i>d hou<i></i>sewi<b></b>ves 
  wh<i></i>o'r<i></i>e in <i></i>fo<b></b>r a<b></b>n<i></i>yt<b></b>hing -
f<b></b>or on<b></b>e b<b></b>uck<b></b>!<br>
  HYLF<b></b>! H<b></b>ousew<b></b>iv<i></i>es You<i></i>'d Like <b></b>to
<b></b>Fl<b></b>ir<b></b>t and F<i></i>u<i></i>ck - <b></b>yea<i></i>h,
<b></b>y<i></i>ou'd de<b></b>fin<b></b>i<i></i>tely w<b></b>ant 
  <i></i>to <b></b>do th<i></i>at, <i></i>wh<i></i>y on Ear<b></b>th
<i></i>woul<b></b>d you da<b></b>te, <b></b>anyw<i></i>ays?</p>
<p> <a href="http://www.find-chat.com/cheating/wives.html";>Clic<b></b>k here
<b></b>and p<b></b>a<b></b>y 
  1$ t<b></b>o <b></b>y<b></b>our r<i></i>ow of g<i></i>lor<i></i>ious
ho<b></b>us<b></b>e<i></i>wife affairs!</a> </p>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<p><a href="http://www.a1hostingdirect.com/gone.html";><b></b>No Mor<b></b>e
Thanks</a></p>
</body>
</html></HTML>

Now, as you can see the trick here to fool spamassassin is the <i> and <b>
tags.  Would it be possible to make a rule or adjust the rules so the
<i></i> scores high?  There is nothing inbetween and I'd have to say anyone
sending messages like this is obviously a spammer.

Mark



-------------------------------------------------------
This SF.net email is sponsored by: SF.net Giveback Program.
Does SourceForge.net help you be more productive?  Does it
help you create better code?   SHARE THE LOVE, and help us help
YOU!  Click Here: http://sourceforge.net/donate/
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to