On 08/15/2012 06:09 PM, John Hardin wrote:
On Wed, 15 Aug 2012, Kevin A. McGrail wrote:
On 8/15/2012 11:35 AM, John Hardin wrote:
On Wed, 15 Aug 2012, Jim Schueler wrote:
> Is there such a rule?
No, not at present.
> Can I write one (I consider myself a bit of a Perl wonk)?
Sure. Post it here and one of the rule committers can add it to their
sandbox for testing against the masscheck corpora.
The problem with what you suggest is that having a different
description
in the displayed text for a link is extremely common.
If you can manage to write a regex that detects a link tag where the
displayed text differs from the href _AND_ the displayed text is a URL,
then it might be useful. Just triggering on displayed text != href
is not
useful.
I am 99.9% sure I've personally done research on this and it was no
indication of SPAM or HAM. It is equally used in both and anecdotal
checks yesterday confirmed it.
IMO, this is a waste of time you can confirm simply by checking a
couple of legit email newsletters, for example.
Okay, let me modify my suggestion, then: if you can detect where the
displayed text for a link is a URL, and the domain name in that URL does
not match the domain name in the href, then it might be useful.
Does that seem more possible?
Wouldn't URIDetail do this?