On Sun, Jun 22, 2003 at 11:18:05PM +0900, Tomoyuki Sakurai wrote:

> Hopefully SA2.60 would solve it.

The version of 2.60 that I have sort of works in detecting obfuscated html.

It *does* detect words split apart by html comments.

It *does not* detect words split apart by bogus tags.

It *does not* reconstruct obfuscated html for the benefit of the
feature rules or the bayesian classifier.  (I've been tempted to 
pipe the html through lynx ...)

It *does not* remove text with fontcolor == backgroundcolor for
the benefit of the bayesian classifier. 

-- 
Gordon V. Cormack     CS Dept, University of Waterloo, Canada N2L 3G1
[EMAIL PROTECTED]            http://cormack.uwaterloo.ca/cormack


-------------------------------------------------------
This SF.Net email is sponsored by: INetU
Attention Web Developers & Consultants: Become An INetU Hosting Partner.
Refer Dedicated Servers. We Manage Them. You Get 10% Monthly Commission!
INetU Dedicated Managed Hosting http://www.inetu.net/partner/index.php
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to