"Michael 'Moose' Dinn" said:
> Has anyone taken a huge spam database and sent it through some sort of > genetic learning program to see if spam can be identified that way? > More of a curiosity thing than anything else. Yes, there's a group in Greece who are doing this (slowly -- they don't have many people working on it ;). It works, but when I contacted them it seemed that SpamAssassin had a better hit rate (probably since we have a lot of human ingenuity on the case too, instead of just relying on machine learning alone). There's also "ifile" for MH users, which (iirc) uses Naive Bayesian classification to classify incoming mail to folders automatically. One issue those have, is that they need a lot of pre-processing code to strip off common formatting, headers, footers etc. so your AI code doesn't just start thinking "it came via sf.net lists, therefore it's spam" when you're subscribed to spamassassin-sightings, for example. ;) --j. ------------------------------------------------------- This sf.net email is sponsored by:ThinkGeek Oh, it's good to be a geek. http://thinkgeek.com/sf _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk