"Michael 'Moose' Dinn" said:

> Has anyone taken a huge spam database and sent it through some sort of
> genetic learning program to see if spam can be identified that way?
> More of a curiosity thing than anything else.

Yes, there's a group in Greece who are doing this (slowly -- they don't
have many people working on it ;).

It works, but when I contacted them it seemed that SpamAssassin had a
better hit rate (probably since we have a lot of human ingenuity on the
case too, instead of just relying on machine learning alone).

There's also "ifile" for MH users, which (iirc) uses Naive Bayesian
classification to classify incoming mail to folders automatically.

One issue those have, is that they need a lot of pre-processing code to
strip off common formatting, headers, footers etc. so your AI code doesn't
just start thinking "it came via sf.net lists, therefore it's spam" when
you're subscribed to spamassassin-sightings, for example. ;)

--j.


-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Oh, it's good to be a geek.
http://thinkgeek.com/sf
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to