Philip Prindeville wrote:
I'm wondering what would be involved in putting in an HTML parser
that could call various rules to check things, like the case of:
<a href="http://www.foo.com/xyzzy">http://www.bar.com/aardvark</a>
where the link disagrees with the text between the anchor tags (yeah, you
could limit it to partial matches on the host-portion)...
This is the functional equivalent of pissing in the wind. If you are
downwind, you are going to get wet.
Anchor text in too many/most cases will not match the HREF. grep is
good, but it isn't good enough to catch all cases without significant
overhead. Anchor text is a descriptor, nothing more than that. It is not
a regurgitation of the link HREF.