Perfect! Thank you very very much!
Am 05/05/17 um 19:28 schrieb monty:
This should be fixed now. Thanks for the bug report.
Sent: Wednesday, May 03, 2017 at 4:44 PM
From: "Udo Schneider" <udo.schnei...@homeaddress.de>
To: pharo-users@lists.pharo.org
Subject: [Pharo-users] XMLHTMLParser Entity Handling oddity
All,
I'm hitting an interesting issue with XMLHTMLParser and I'm not even
sure if this is a bug or intended behaviour. Given an HTML Entity in a
String it's resolved or quoted depending on the tag (header or section tag):
doc := XMLHTMLParser parse:
'<html><head><title>Ü</title></head><body>Ü</body></html>'.
(doc findElementNamed: 'title') contentString. "'Ü'"
(doc findElementNamed: 'body') contentString. "'Ü'"
In my understanding and according to
https://www.w3.org/TR/html401/struct/global.html#h-7.4.2 Entities in the
title tag are allowed and should IMHO be resolved.
So both should return 'Ü' in this case.
Any pointers?
CU,
Udo