Bart Schaefer wrote on Wed, 2 Jul 2003 08:55:34 -0700 (PDT): > Possibly because Mozilla isn't written in Perl? > > Possibly because SA already has its own HTML renderer through which the > messages are passed?
And possibly because Mozilla libaries are not necessarily installed on mail machines? > > > and extract the text as the viewer would see. > > The only way to extract the text as the viewer would see it is to use the > renderer of the viewer's mail client [impossible, given that SA generally > runs before the message is even delivered], because no two renderers > produce the same result in every possible case. > Well, I think one can do three things (all of them): 1. just ignore all extra markup or seemingly markup, so that you just get the text 2. specifically check for those nasty "workarounds" since they are a spam indicator per se. 3. render the message or use some HTML-aware scanner to be able to mark those spam typical things like body="#000000", big fonts etc. Don't know what sa-2.60 does of these. Kai -- Kai Schätzl, Berlin, Germany Get your web at Conactive Internet Services: http://www.conactive.com IE-Center: http://ie5.de & http://msie.winware.org ------------------------------------------------------- This SF.Net email sponsored by: Free pre-built ASP.NET sites including Data Reports, E-commerce, Portals, and Forums are available now. Download today and enter to win an XBOX or Visual Studio .NET. http://aspnet.click-url.com/go/psa00100006ave/direct;at.asp_061203_01/01 _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk