Would it make sense to do a Bayesian analysis using not only on individual
words, but also the SpamAssassin regex tests in order to detect phrases and
patterns that would be missed using a naive word-by-word analysis? And if
that worked, would it then not make sense to discard the standard SA
scoring system altogether?
My understanding is that SA scores are currently assigned to each test
using some statistical analysis of a corpus of spam and non-spam. Using
the tests with the Bayes engine might allow the assigned scores to be
recalculated dynamically, and thus be better tailored to an individual's
mail patterns. It seems that this would be relatively easy to implement,
and was wondering if anyone else thought it would be worth the trouble.
Chris Eykamp
-------------------------------------------------------
This SF.net email is sponsored by: Microsoft Visual Studio.NET
comprehensive development tool, built to increase your
productivity. Try a free online hosted session at:
http://ads.sourceforge.net/cgi-bin/redirect.pl?micr0003en
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk
- [SAtalk] spamd/c configuration Tom Allison
- Re: [SAtalk] Bayesian hybrid Christopher Eykamp
- Re: [SAtalk] Bayesian hybrid Matt Sergeant
- Re: [SAtalk] Bayesian hybrid Vivek Khera