Probably haven't look hard enough, but has anyone 
used a rule to detect (real or pseudo) HTML tags
embedded in text.  Ostensibly they're there to 
throw off bayes and other pattern matchers.

I just put up:

rawbody   TAG_IN_TEXT   /[a-zA-Z0-9]+\<\/*[a-zA-Z0-9]*\>[a-zA-Z0-9]+/
describe  TAG_IN_TEXT
score     TAG_IN_TEXT   1.0

on my test mailer, and it is hitting OK on what I *think* I'm
looking for.

Are there any legitimate uses for te<tags>xt?  If so, I'd like
to score each one individually.  Is there a method for incrementing
the score for each match within a message?

Cheers!
-sam


-------------------------------------------------------
This SF.net email is sponsored by: IBM Linux Tutorials.
Become an expert in LINUX or just sharpen your skills.  Sign up for IBM's
Free Linux Tutorials.  Learn everything from the bash shell to sys admin.
Click now! http://ads.osdn.com/?ad_id=1278&alloc_id=3371&op=click
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to