On 21 Mar 2007 at 20:05, Dr.Ruud wrote:

> "Beginner" schreef:
>
> > The Iconv route hasn't been too successful either. I tried
> > Text::Iconv->new('ISO8859-1','utf8');


> > Thinking that my data is currently ISO8859-1but the results were not
> > as I had hoped. Where I had MICROSCÓPIO, I got MICROSCÃPIO.
>
> I don't think you are showing all characters in the utf8-result.
>
> Also you send your email in ISO-8859-1 encoding (or gmail recoded it for
> you) so we can't check whether your utf8-result is good or bad. I think
> it was good but got lost some characters when you copy-pasted it.
>

It is difficult to verify what is happening.

My locale is set to
LANG=en_US.UTF-8
LC_CTYPE="en_US.UTF-8"

I tried the following:
>iconv -f ISO8859-1 -t UTF-8 testfile.xml > testfile-utf8.xml
>diff testfile.xml testfile-utf8.xml

...snip
>                       <keyword>MICROSCÓPIO</keyword>
341c341
<                       <keyword>MICROSCÃPIO</keyword>

Then
>iconv -f UTF-8 -t ISO8859-1 testfile-utf8.xml > andback.xml
>diff andback.xml testfile
> [nothing]

I think I am going off topic a bit but I think this means that if the
file comes as ISO8859-1, I can set the xml encoding="ISO8859-1" and
the recipent can use iconv to convert the data back to UTF-8 when
they receive it.

Fingers crossed. Thanx for all the feedback.
Dp.


--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/


Reply via email to