Good evening, Justin, Thanks for taking the time to answer. On Mon, 17 Nov 2003, Justin Mason wrote:
> -----(Modified PGP heading)----- > Hash: SHA1 > > BTW, given that a URI DB cannot use regular expressions, or patterns, > would this really be useful? > > Basically with a DB you only gain efficiency when looking up exact > strings. So for this to be useful against URIs, you'd have to pick out > *just* the domain part of the URI and look it up. e.g.: > > http://www.stearns.org/sa-blacklist/sa-blacklist.2003111402.uri.cf > > would be looked up as "www.stearns.org" or "stearns.org".) I would be very happy to supply a form of that file that has exact string hostnames and domains. As long as SA grabs the URL, drops it to lowercase, and resolves "&#NNN;"-type obfuscation, I can provide the host/domain list. The only time I'd need a bit more help is if the uri is http://host.domain.org:8888/; can SA strip the port number too? > I suspect doing this with a DB lookup may not be such a win, compared > to using a local eval test that parses a config file and creates an > in-memory hash table. Cool. As I said, I'm glad to provide the list in any format needed; how much work would the has table approach be? Could we even get smart and do a tree approach, starting at the right of the domain, where we check for the TLD first, then once we know it's "org", go back to the "org" list, reducing the number of domains we have to search through? Cheers, - Bill --------------------------------------------------------------------------- '"It's like watching a baby discover its toes," said New Jersey coder Nick Nardine. "Not only does the baby think its toes are the coolest thing in the world, it insists you must discover your toes too. Watching these guys push Linux on us is endearing and annoying at the same time." -- http://www.wired.com/news/infostructure/0,1377,57353,00.html (Courtesy of Chris Brenton <[EMAIL PROTECTED]>) -------------------------------------------------------------------------- William Stearns ([EMAIL PROTECTED]). Mason, Buildkernel, freedups, p0f, rsync-backup, ssh-keyinstall, dns-check, more at: http://www.stearns.org Linux articles at: http://www.opensourcedigest.com -------------------------------------------------------------------------- ------------------------------------------------------- This SF. Net email is sponsored by: GoToMyPC GoToMyPC is the fast, easy and secure way to access your computer from any Web browser or wireless device. Click here to Try it Free! https://www.gotomypc.com/tr/OSDN/AW/Q4_2003/t/g22lp?Target=mm/g22lp.tmpl _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk