On Fri, Sep 05, 2003 at 08:55:26PM +0200, Soeren Gerlach wrote:
> Hi,
> 
> > Changes since 2.5x:
> > [...]
> > - new Bayes tweaks -- tokenization of partial address and URI elements
> 
> Are details available about this feature?

Basically URIs are broken into
        - hostname
        - "word" tokens inside hostname, e.g. "www.jmason.org" "jmason.org" "org"
        - elements of the path
        - names of CGI parameters
        - values of CGI parameters

IIRC.

>  I'm asking because I'm currently
> thinking about something new like DCC, but especially and only for URLs in
> spam. Eventually this would become obsolote with this feature.

BTW, I don't think so -- a URI blacklist has been a good idea for a while.

--j.


-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to