> -----Original Message----- > From: Manuel Schmitt [mailto:[EMAIL PROTECTED] > Sent: Thursday, January 15, 2004 8:09 AM > To: [EMAIL PROTECTED] > Subject: [SAtalk] Improvement: Image Recognition as spam criteria > > Dear readers, > > while using Spamassassin for about one month and having a > very good recognition rate I am discovering that spam that > has almost or no text within does not get detected by > SpamAssassin, neither by normal criteria nor by the Bayes > filter. I think because there is not enough information for > the Bayes filter to be able on a reliable decision. > > Now I had the following thought. What about a special image > database which is maintained in a Bayes-like-style. I could > imagine the following. While learning spam with sa-learn, the > learning process could also filter out each image in an > email. Then we have to build a more unique representation of > this image, comparable with a hash value of a string. Then we > store this "hash value" as entry a Bayes-like image database. > When new email comes in we do a comparison of all images in > this email with them in our database. > > I don't have any idea of what resouces this addition check(s) > would consume, but perhaps it would be a nice addition > feature. I was concerned about some ways how to get a hash > value out of an image, if someone is interested, feel free to > contact me. > > Best regards, > Manuel Schmitt >
Drawbacks: 1) bandwidth: you have to pull the image down, on a large spam with a dictionary attack run this could be a significant drag on resources. 2) These images can often be doing double duty as "web bugs" identifying a particular address as valid and setting the address up redistribution to other spammers and more spam. If implemented, it something I would definitely want the option to disable it. Best Regards, Bob ------------------------------------------------------- The SF.Net email is sponsored by EclipseCon 2004 Premiere Conference on Open Tools Development and Integration See the breadth of Eclipse activity. February 3-5 in Anaheim, CA. http://www.eclipsecon.org/osdn _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk