Ben de Groot wrote:
The xml header in each metadata.xml states that the content is UTF-8 encoded, and any XML parser has to be able to handle this. Also, when used literally in xml, the 5 special characters & ' " < > cause awell-formedness error, as far as I know.
You're wrong, it's absolutely safe to use ' and " in element contents. You might have to escape them when in attribute value, though -- example:
<foo bar="baz " test"/>That's because the XML parser would have no way of telling if the " is supposed to terminate the attribute value or not.
of using the apostrophe. So in my opinion, if we want to use xml, we should use unicode properly.
In my opinion, there's absolutely no benefit in using fancy characters everywhere, just because it's allowed by the XML spec.
Cheers, -jkt -- cd /local/pub && more beer > /dev/mouth
signature.asc
Description: OpenPGP digital signature