This should be fixed now. Thanks for the bug report.

> Sent: Wednesday, May 03, 2017 at 4:44 PM
> From: "Udo Schneider" <udo.schnei...@homeaddress.de>
> To: pharo-users@lists.pharo.org
> Subject: [Pharo-users] XMLHTMLParser Entity Handling oddity
>
> All,
> 
> I'm hitting an interesting issue with XMLHTMLParser and I'm not even 
> sure if this is a bug or intended behaviour. Given an HTML Entity in a 
> String it's resolved or quoted depending on the tag (header or section tag):
> 
> doc := XMLHTMLParser parse: 
> '<html><head><title>Ü</title></head><body>Ü</body></html>'.
> (doc findElementNamed: 'title') contentString. "'Ü'"
> (doc findElementNamed: 'body') contentString.  "'Ü'"
> 
> In my understanding and according to 
> https://www.w3.org/TR/html401/struct/global.html#h-7.4.2 Entities in the 
> title tag are allowed and should IMHO be resolved.
> 
> So both should return 'Ü' in this case.
> 
> Any pointers?
> 
> CU,
> 
> Udo
> 
> 
>

Reply via email to