-----BEGIN PGP SIGNED MESSAGE-----

On Thu, 28 Feb 2002, Michael Moncur wrote:

> While some of the negative scores (like DEAR_SOMEBODY) might have
> really turned into legitimate indicators of non-spam, I don't think
> any message deserves having its spam score reduced by 8 points by
> virtue of its mentioning "www.monsterhut.com", a well-known spam
> source.

This got me thinking.  Does the corpus contain emails discussing spam?
If so, that would clearly throw off the evolution of scores.

Similarly, I think part of the problem is that everyobody's spam and
non-spam may be vastly different.  Obviously, the more sources the
corpus is drawn from the less this will be an issue, but until then
the GA will be craeting scores tuned more accurately for the types of
users who submit to the corpus.

- -- 
Public key #7BBC68D9 at            |                 Shane Williams
http://pgp.mit.edu/                |
=----------------------------------+-------------------------------
All syllogisms contain three lines |              [EMAIL PROTECTED]
Therefore this is not a syllogism  |   www.gslis.utexas.edu/~shanew

-----BEGIN PGP SIGNATURE-----
Version: 2.6.2

iQCVAwUBPH5Hj2a83yV7vGjZAQGeyAQAj9xy28o0lNjjkwM4IB6J6IWiIzX+vKeO
R4AYcL4ZZt87BohRE5idVFfyTcO8Nw5UoxsWiFNpcWjquXaJ+adLVoaPR/28vHN+
4w+paMNArYuMybL1Pw6SzPsa8Hd4+UVuv1sb+CgEHC0WFD4IhDRZif/rsjfxk7Q3
tXUST1fvNmw=
=fJph
-----END PGP SIGNATURE-----


_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to