Problem with minidom and special chars in HTML

2005-02-22 Thread Horst Gutmann
Hi :-) I currently have quite a big problem with minidom and special chars (for example ü) in HTML. Let's say I have following input file: -- http://www.w3.org/TR/html4/strict.dtd";> ü -- And fol

Re: Problem with minidom and special chars in HTML

2005-02-22 Thread Horst Gutmann
Fredrik Lundh wrote: umm. doesn't that doctype point to an SGML DTD? even if minidom did fetch external DTD's (I don't think it does), it would probably choke on that DTD. running your documents through "tidy -asxml -numeric" before parsing them as XML might be a good idea... http://tidy.sour

Re: Problem with minidom and special chars in HTML

2005-02-23 Thread Horst Gutmann
Jarek Zgoda wrote: Horst Gutmann napisał(a): I currently have quite a big problem with minidom and special chars (for example ü) in HTML. Let's say I have following input file: -- http://www.w3.org/TR/html4/strict.dtd";> HTML4