-----BEGIN PGP SIGNED MESSAGE----- On Thu, 28 Feb 2002, Michael Moncur wrote:
> While some of the negative scores (like DEAR_SOMEBODY) might have > really turned into legitimate indicators of non-spam, I don't think > any message deserves having its spam score reduced by 8 points by > virtue of its mentioning "www.monsterhut.com", a well-known spam > source. This got me thinking. Does the corpus contain emails discussing spam? If so, that would clearly throw off the evolution of scores. Similarly, I think part of the problem is that everyobody's spam and non-spam may be vastly different. Obviously, the more sources the corpus is drawn from the less this will be an issue, but until then the GA will be craeting scores tuned more accurately for the types of users who submit to the corpus. - -- Public key #7BBC68D9 at | Shane Williams http://pgp.mit.edu/ | =----------------------------------+------------------------------- All syllogisms contain three lines | [EMAIL PROTECTED] Therefore this is not a syllogism | www.gslis.utexas.edu/~shanew -----BEGIN PGP SIGNATURE----- Version: 2.6.2 iQCVAwUBPH5Hj2a83yV7vGjZAQGeyAQAj9xy28o0lNjjkwM4IB6J6IWiIzX+vKeO R4AYcL4ZZt87BohRE5idVFfyTcO8Nw5UoxsWiFNpcWjquXaJ+adLVoaPR/28vHN+ 4w+paMNArYuMybL1Pw6SzPsa8Hd4+UVuv1sb+CgEHC0WFD4IhDRZif/rsjfxk7Q3 tXUST1fvNmw= =fJph -----END PGP SIGNATURE----- _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk