John D. Hardin wrote:
On Tue, 3 Jul 2007, Matt wrote:
Why can't Spamassassin do like a MD5 hash of any URL's in a
message and check them against a database? I just think it would
help catch things like: geocities.com/spamer123/ or
spamer123.tripod.com and etc.
Too easy to defeat using a URI with random parameters pointing to a
PHP et. al. page that ignores parameters (assuming you include
parameters in the hash) or via wildcard DNS using random third- or
fourth-level hostnames.
Even the path could be made random if they use mod_rewrite or
equivalent. If http://example.com/random/path/gets/ignored always
serves up the contents of salespitch.html, they can generate as many
URLs as they want.
The concept might still be useful for specific known "grey" hosts with a
mix of legit sites and spam sites -- geocities, tripod, blogspot, etc.
--where the URL patterns are known. If you know the pattern is
account.example.com, or example.com/account, then throw away the rest of
the URL and list/lookup the base pattern.
--
Kelson Vibber
SpeedGate Communications <www.speed.net>