Kees Bos added the comment:

OK. I got clarification from the lxml list. It's not a bug. And it's sepcified 
in section 2.4 (http://www.w3.org/TR/REC-xml/#syntax):

The ampersand character (&) and the left angle bracket (<) MUST NOT appear in 
their literal form, except when used as markup delimiters, or within a comment, 
a processing instruction, or a CDATA section. If they are needed elsewhere, 
they MUST be escaped using either numeric character references or the strings " 
&amp; " and " &lt; " respectively. The right angle bracket (>) may be 
represented using the string " &gt; ", and MUST, for compatibility, be escaped 
using either " &gt; " or a character reference when it appears in the string " 
]]> " in content, when that string is not marking the end of a CDATA section.

In the content of elements, character data is any string of characters which 
does not contain the start-delimiter of any markup and does not include the 
CDATA-section-close delimiter, " ]]> ". In a CDATA section, character data is 
any string of characters not including the CDATA-section-close delimiter, " ]]> 
".


Sorry for the confusion and taking your time for a bogus report.

----------

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue18753>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to