Would it make sense to do a Bayesian analysis using not only on individual words, but also the SpamAssassin regex tests in order to detect phrases and patterns that would be missed using a naive word-by-word analysis? And if that worked, would it then not make sense to discard the standard SA scoring system altogether?

My understanding is that SA scores are currently assigned to each test using some statistical analysis of a corpus of spam and non-spam. Using the tests with the Bayes engine might allow the assigned scores to be recalculated dynamically, and thus be better tailored to an individual's mail patterns. It seems that this would be relatively easy to implement, and was wondering if anyone else thought it would be worth the trouble.

Chris Eykamp



-------------------------------------------------------
This SF.net email is sponsored by: Microsoft Visual Studio.NET comprehensive development tool, built to increase your productivity. Try a free online hosted session at:
http://ads.sourceforge.net/cgi-bin/redirect.pl?micr0003en
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to