This should be fixed now. Thanks for the bug report.
> Sent: Wednesday, May 03, 2017 at 4:44 PM
> From: "Udo Schneider" <udo.schnei...@homeaddress.de>
> To: pharo-users@lists.pharo.org
> Subject: [Pharo-users] XMLHTMLParser Entity Handling oddity
>
> All,
>
> I'm hitting an interesting issue with XMLHTMLParser and I'm not even
> sure if this is a bug or intended behaviour. Given an HTML Entity in a
> String it's resolved or quoted depending on the tag (header or section tag):
>
> doc := XMLHTMLParser parse:
> '<html><head><title>Ü</title></head><body>Ü</body></html>'.
> (doc findElementNamed: 'title') contentString. "'Ü'"
> (doc findElementNamed: 'body') contentString. "'Ü'"
>
> In my understanding and according to
> https://www.w3.org/TR/html401/struct/global.html#h-7.4.2 Entities in the
> title tag are allowed and should IMHO be resolved.
>
> So both should return 'Ü' in this case.
>
> Any pointers?
>
> CU,
>
> Udo
>
>
>