On Fri, 16 Feb 2007, Mark Martinec wrote:
I believe this was once mentioned on a Justin's blog (but can't find
a ref now), the following sounds promising as an additional classifier
to existing bayes (especially since the author comes from the same
organization as myself :)
http://www.virusbtn.com/spambulletin/archive/2006/01/sb200601-trec
ijsSPAM2 PPM-D compression model
Andrej Bratko (Josef Stefan Institute)
Observations:
The most startling observation is that character-based compression models
perform outstandingly well for spam filtering. Commonly used open-source
filters perform well, but not nearly so well or nearly so poorly as
reported elsewhere.
This looks very promising. I found a description of the ijsSPAM2 tool
on the site:
http://www.virusbtn.com/spambulletin/archive/2006/03/sb200603-compression
Remarkable stuff. That would be a helluva nice plugin to have.
Chris St. Pierre
Unix Systems Administrator
Nebraska Wesleyan University
----------------------------
Never send mail to [EMAIL PROTECTED]