Yes, we could integrate this into all bayes-like systems, I see no problem and disk-space is not a problem any more.
Hint: I think we should store these things in a SQL database instead of in the file system, shouldn't we? ----- Original Message ----- From: "snowchyld" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]>; "Manuel Schmitt" <[EMAIL PROTECTED]> Sent: Thursday, January 15, 2004 3:11 PM Subject: Re: [SAtalk] Improvement: Image Recognition as spam criteria > i like this idea, > > possibley even design a DCC / pyzor / razor type system which would > implement a distributed checksum / md5 etc > on all image ? > > > ----- Original Message ----- > From: "Manuel Schmitt" <[EMAIL PROTECTED]> > To: <[EMAIL PROTECTED]> > Sent: Thursday, January 15, 2004 4:08 PM > Subject: [SAtalk] Improvement: Image Recognition as spam criteria > > > > Dear readers, > > > > while using Spamassassin for about one month and having a very good > > recognition rate I am discovering that spam that has almost or no text > > within does not get detected by SpamAssassin, neither by normal criteria > nor > > by the Bayes filter. I think because there is not enough information for > the > > Bayes filter to be able on a reliable decision. > > > > Now I had the following thought. What about a special image database which > > is maintained in a Bayes-like-style. I could imagine the following. While > > learning spam with sa-learn, the learning process could also filter out > each > > image in an email. Then we have to build a more unique representation of > > this image, comparable with a hash value of a string. Then we store this > > "hash value" as entry a Bayes-like image database. When new email comes in > > we do a comparison of all images in this email with them in our database. > > > > I don't have any idea of what resouces this addition check(s) would > consume, > > but perhaps it would be a nice addition feature. I was concerned about > some > > ways how to get a hash value out of an image, if someone is interested, > feel > > free to contact me. > > > > Best regards, > > Manuel Schmitt > > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Perforce Software. > Perforce is the Fast Software Configuration Management System offering > advanced branching capabilities and atomic changes on 50+ platforms. > Free Eval! http://www.perforce.com/perforce/loadprog.html > _______________________________________________ > Spamassassin-talk mailing list > [EMAIL PROTECTED] > https://lists.sourceforge.net/lists/listinfo/spamassassin-talk > ------------------------------------------------------- This SF.net email is sponsored by: Perforce Software. Perforce is the Fast Software Configuration Management System offering advanced branching capabilities and atomic changes on 50+ platforms. Free Eval! http://www.perforce.com/perforce/loadprog.html _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk