Yes, we could integrate this into all bayes-like systems, I see no problem
and disk-space is not a problem any more.

Hint: I think we should store these things in a SQL database instead of in
the file system, shouldn't we?


----- Original Message -----
From: "snowchyld" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>; "Manuel Schmitt"
<[EMAIL PROTECTED]>
Sent: Thursday, January 15, 2004 3:11 PM
Subject: Re: [SAtalk] Improvement: Image Recognition as spam criteria


> i like this idea,
>
> possibley even design a DCC / pyzor / razor type system which would
> implement a distributed checksum / md5 etc
> on all image ?
>
>
> ----- Original Message -----
> From: "Manuel Schmitt" <[EMAIL PROTECTED]>
> To: <[EMAIL PROTECTED]>
> Sent: Thursday, January 15, 2004 4:08 PM
> Subject: [SAtalk] Improvement: Image Recognition as spam criteria
>
>
> > Dear readers,
> >
> > while using Spamassassin for about one month and having a very good
> > recognition rate I am discovering that spam that has almost or no text
> > within does not get detected by SpamAssassin, neither by normal criteria
> nor
> > by the Bayes filter. I think because there is not enough information for
> the
> > Bayes filter to be able on a reliable decision.
> >
> > Now I had the following thought. What about a special image database
which
> > is maintained in a Bayes-like-style. I could imagine the following.
While
> > learning spam with sa-learn, the learning process could also filter out
> each
> > image in an email. Then we have to build a more unique representation of
> > this image, comparable with a hash value of a string. Then we store this
> > "hash value" as entry a Bayes-like image database. When new email comes
in
> > we do a comparison of all images in this email with them in our
database.
> >
> > I don't have any idea of what resouces this addition check(s) would
> consume,
> > but perhaps it would be a nice addition feature. I was concerned about
> some
> > ways how to get a hash value out of an image, if someone is interested,
> feel
> > free to contact me.
> >
> > Best regards,
> > Manuel Schmitt
>
>
>
>
> -------------------------------------------------------
> This SF.net email is sponsored by: Perforce Software.
> Perforce is the Fast Software Configuration Management System offering
> advanced branching capabilities and atomic changes on 50+ platforms.
> Free Eval! http://www.perforce.com/perforce/loadprog.html
> _______________________________________________
> Spamassassin-talk mailing list
> [EMAIL PROTECTED]
> https://lists.sourceforge.net/lists/listinfo/spamassassin-talk
>



-------------------------------------------------------
This SF.net email is sponsored by: Perforce Software.
Perforce is the Fast Software Configuration Management System offering
advanced branching capabilities and atomic changes on 50+ platforms.
Free Eval! http://www.perforce.com/perforce/loadprog.html
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to