I have been writing these rules to combine various version of words into one
rule.  For example:

I start with

/prescription/

and do some minor subs to make

/pr[e3][s5]cr[i1]pt[i1][o0]n/i

Then, add gappies

/p.?r.?[e3].?[s5].?c.?r.?[i1].?p.?t.?[i1].?[o0].?n/i

(Side question, what's a good gap class, \A or [-_*/\. ], or... ?)

Is this a good rule?  A better generic rule might be

obfuscatedmatch("prescription")

which processes the string with all the infamous tricks that spammers use
(i and 1, o and 0, a and @, gaps, etc.)  Ideally, obfuscatedmatch would add
more weight than a regular match.  Someone talking innocently about their home
mortgage won't spell it [EMAIL PROTECTED]  :)



-------------------------------------------------------
This SF.Net email sponsored by: Free pre-built ASP.NET sites including
Data Reports, E-commerce, Portals, and Forums are available now.
Download today and enter to win an XBOX or Visual Studio .NET.
http://aspnet.click-url.com/go/psa00100003ave/direct;at.aspnet_072303_01/01
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to