On 27 Apr 2017, at 9:58, Hussein Shafie wrote:

On 04/26/2017 11:51 PM, Leif Halvard Silli wrote:
 1.

However, the parser of XMLmind XML editor does not conform to this: I have noticed that when DOCTYPE is excluded, then the document is
    rendered - within XXE - in quirks-mode.

@srcdoc is an attribute of element iframe. See https://www.w3.org/TR/html5/embedded-content-0.html#the-iframe-element

Element iframe is rendered in the styled view by embedding a "Web browser" (part of JavaFX, based on WebKit, I think). As such, we, XMLmind, have very few control on this embedded "Web browser". See http://docs.oracle.com/javase/8/javafx/api/javafx/scene/web/WebView.html

Ok. Is there an easy way to report this bug to Oracle? May be you have easy access? (Hint …)

 2.

Some might think (ahem - I am looking at myself ...) that a srcdoc document would be a XHTML document when inside a XHTML document, and a HTML document when inside a HTML document. But, actually (and of
    course: my bad), a srcdoc document is always a 'text/html' HTML
document. This means, says the HTML5.x spec, that you are allowed to skip the start and end tag of a lot of elements, including the root
    element.

    However, XMLmind XML editor does not implement this: Unless the
    <html> start tag and end tag is included in the srcdoc attriute,
    then XXe erroneously reports that the content is not HTML. By
contrast, the HTML5.x specification says that the @srcdoc does not
    need to contain any code - it could simply simply be this:

    |srcdoc="XMLmind must update its srcdoc HTML mind" |

The HTML parser will then, itself, insert all the lacking elements
    when the srcdoc document is rendered.

Our implementation is based on this spec: https://www.w3.org/TR/html5/embedded-content-0.html#attr-iframe-srcdoc

To be honest: I am not at all certain what the purpose of the HTML specification’s reference to XML’s document production is.

That being said: I question your particular reading of it. See below.

  [ ... snip ...]

Please tell us where to find a more authoritative spec and we'll have no problem changing our implementation accordingly.

You quoted HTML 5.0, however HTML 5.1 became official last year. That being said: HTML 5.1 continues to have the statement that you quoted from HTML 5.0 https://www.w3.org/TR/html/semantics-embedded-content.html#element-attrdef-iframe-srcdoc:

]] For iframe elements in XML documents, the srcdoc attribute, if present, must have a value that matches the production labeled document in the XML specification. [XML] [[

But what does it mean to «have a value that matches the production labeled document in the XML specification»?

Your interpretation of this requirement appears to be that the <html> element must be present. However, this is not the only possible interpreation. (Looking away from the escaping rules:) I would claim that srcdoc="<div>blah</div>" matches the document production (https://www.w3.org/TR/xml/#NT-document) just as well as srcdoc="<html>blah</html>" does!

But, going with your interpreation: Because a self closed <html/> matches XML’s document production (https://www.w3.org/TR/xml/#document), it would be logical to assume that XXe would accept srcdoc="<html/>". However, even then, XXe reports that the @srcdoc doesn’t contain HTML. Only if one does "<html></html> will XXe be satisfied. This not logical. And there are other illogical issues with your current implementation. For example, XXe considers it entirely OK to do all kinds of weird things, such as srcdoc="<html><p><p>blah</p></html>", which does not match XML’s document production - but which is a valid and meaningful, as a HTML syntax document.

Anway, just below that statement follows Example 62 https://www.w3.org/TR/html/semantics-embedded-content.html#example-add3a347, which, in the final paragraph, states:

]] [...] only the contents of the body element need appear literally in the syntax. The other elements are still present, but only by implication.[[

One could, of course, be tempted to jump to the conclusion that this is valid only for HTML syntax documents. But, at the very least, that is not how it has been implemented in Web browsers that support the both @srcdoc and XHTML.

I think that the purpose of the reference to XML’s document production is twofold:

Firstly, it clarifies that a srcdoc document does NOT have to be a XHTML document. Thus: Its purpose is to LOWER the authoring requirements compared with the authoring requirements of XHTML syntax documents. For eample, there does not need to be a namespace declaration (as this is not required by the document production. And also, it does not need to begin with the HTML root document, since this is also not required by the document production (if you do <div>blah</div>, then <div> is the root, as per the document production).

Secondly, the reference to the XML document production means that elements must be well-formed, which INCREASES the authoring requirements compared with HTML syntax documents. But given that @srcdoc documents are supposed to rendered as text/html documents anyway, why, at all, is that an requirement? I must confess that I am not certain. However, my guess is that the purpose is to handle ‘pure’ XML editors/parser: XML editors/parsers are probably able transform the content of the srcdoc attribute into a valid XML-document, but they might not be able to transform the document into a valid HTML document (since that would mean that they would have to embed two parsers - an XML parser and an HTML parser).

If I am correct about the second purpose, then, I must say, that I think there is reason to question the spec’s requirement to conform with the XML document production.
--
leif halvard sillli

--
XMLmind XML Editor Support List
xmleditor-support@xmlmind.com
http://www.xmlmind.com/mailman/listinfo/xmleditor-support

Reply via email to