Somewhere in the not very distant future SA is going to have to:

A) render HTML to text ala LYNX

B) run the rendered text through a grammar check, I assume that there is an
open source analyzer available.

C) have the GA establish a Bayesian baseline of grammar scores indicative of
SPAM/HAM.

Buy tracking the overall grammar score in addition to the actual content SA
should be able to recognize random word strings as indicative of spam and
apply additional penalty points.


-------------------------------------------------------
This SF.Net email is sponsored by: INetU
Attention Web Developers & Consultants: Become An INetU Hosting Partner.
Refer Dedicated Servers. We Manage Them. You Get 10% Monthly Commission!
INetU Dedicated Managed Hosting http://www.inetu.net/partner/index.php
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to