Ben de Groot wrote:
The xml header in each metadata.xml states that the content is UTF-8
encoded, and any XML parser has to be able to handle this. Also, when
used literally in xml, the 5 special characters & ' " < >  cause a
well-formedness error, as far as I know.

You're wrong, it's absolutely safe to use ' and " in element contents. You might have to escape them when in attribute value, though -- example:

        <foo bar="baz &quot; test"/>

That's because the XML parser would have no way of telling if the " is supposed to terminate the attribute value or not.

of using the apostrophe. So in my opinion, if we want to use xml, we
should use unicode properly.

In my opinion, there's absolutely no benefit in using fancy characters everywhere, just because it's allowed by the XML spec.

Cheers,
-jkt

--
cd /local/pub && more beer > /dev/mouth

Attachment: signature.asc
Description: OpenPGP digital signature

Reply via email to