Chas Owens [mailto:[EMAIL PROTECTED]] wrote:
> On 21 Jun 2001 10:38:08 +0200, Morgan wrote:
> > This script is exelent but I need the script to read the
> letters "åäö"
> > and "ÅÄÖ" too.
> > Cuz this is part of my launguage (Swedish) and those
> letters are in the
> > articles.
> I am working on this, I don't understand what it is doing
> with them. If
> I add <?xml version='1.0' encoding='ISO-8859-1'?> to the start of the
> file the parser doesn't bomb any more, but it outputs "Dragkamp om
> förlusttÃ¥g" instead of "Dragkamp om förlusttåg". My current
> assumption is that it is doing something funky involving two-byte
> UNICODE chars (this is based on the observation that the funky chars
> appear to be doubled). If anyone could shed some light on
> this I would
Try
[...]encoding='ISO-8859-4'[...]
ISO-8859-1 (aka Latin-1) coveres W. Europe, ISO-8859-4 is the specific
Scandinavian character set (almost, but not quite, the same as -1).
If this does not work, have a look at using UTF-8 (but this means those
accented characters will need to be encoded appropriately).
Richard Cox
Senior Software Developer
Dell Technology Online
All opinions and statements mine and do not in any way (unless expressly
stated) imply anything at all on behalf of my employer