Good evening, Justin,
        Thanks for taking the time to answer.

On Mon, 17 Nov 2003, Justin Mason wrote:

> -----(Modified PGP heading)-----
> Hash: SHA1
> 
> BTW, given that a URI DB cannot use regular expressions, or patterns,
> would this really be useful?
> 
> Basically with a DB you only gain efficiency when looking up exact
> strings.  So for this to be useful against URIs, you'd have to pick out
> *just* the domain part of the URI and look it up. e.g.:
> 
> http://www.stearns.org/sa-blacklist/sa-blacklist.2003111402.uri.cf
> 
> would be looked up as "www.stearns.org" or "stearns.org".)

        I would be very happy to supply a form of that file that has exact
string hostnames and domains.  As long as SA grabs the URL, drops it to
lowercase, and resolves "&#NNN;"-type obfuscation, I can provide the
host/domain list.  The only time I'd need a bit more help is if the uri is
http://host.domain.org:8888/; can SA strip the port number too?

> I suspect doing this with a DB lookup may not be such a win, compared
> to using a local eval test that parses a config file and creates an
> in-memory hash table.

        Cool.  As I said, I'm glad to provide the list in any format 
needed; how much work would the has table approach be?
        Could we even get smart and do a tree approach, starting at the 
right of the domain, where we check for the TLD first, then once we know 
it's "org", go back to the "org" list, reducing the number of domains we 
have to search through?
        Cheers,
        - Bill

---------------------------------------------------------------------------
        '"It's like watching a baby discover its toes," said New Jersey
coder Nick Nardine. "Not only does the baby think its toes are the
coolest thing in the world, it insists you must discover your toes too.
Watching these guys push Linux on us is endearing and annoying at the
same time."
        -- http://www.wired.com/news/infostructure/0,1377,57353,00.html
(Courtesy of Chris Brenton <[EMAIL PROTECTED]>)
--------------------------------------------------------------------------
William Stearns ([EMAIL PROTECTED]).  Mason, Buildkernel, freedups, p0f,
rsync-backup, ssh-keyinstall, dns-check, more at:   http://www.stearns.org
Linux articles at:                         http://www.opensourcedigest.com
--------------------------------------------------------------------------



-------------------------------------------------------
This SF. Net email is sponsored by: GoToMyPC
GoToMyPC is the fast, easy and secure way to access your computer from
any Web browser or wireless device. Click here to Try it Free!
https://www.gotomypc.com/tr/OSDN/AW/Q4_2003/t/g22lp?Target=mm/g22lp.tmpl
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to