John Nagle <[EMAIL PROTECTED]> writes: > For our own purposes, I rewrote "htmldecode" to require a sequence > ending in ";", which means some bogus HTML escapes won't be > recognized, but correct HTML will be processed correctly. What's > general opinion of this behavior? Too strict, or OK?
I think it's fine. In the face of ambiguity (and deviation from the published standards), refuse the temptation to guess. More specifically, I don't see any reason to contort your code to understand some non-entity sequence that would be flagged as invalid by HTML validator tools. -- \ "Those who write software only for pay should go hurt some | `\ other field." -- Erik Naggum, in _gnu.misc.discuss_ | _o__) | Ben Finney -- http://mail.python.org/mailman/listinfo/python-list