retitle 291735 -dump should correctly convert charsets
tags 291735 + moreinfo
thanks

Hi,

martin f krafft, le Sun 12 Nov 2006 14:28:35 +0100, a écrit :
> also sprach Samuel Thibault <[EMAIL PROTECTED]> [2006.08.21.1455 +0200]:
> > These got fixed with upload of 0.5.1-5
> 
> I don't see how #291735 got fixed:

Well, for what I was speaking about, it got fixed: w3m doesn't
"simplify" the charset of the page any more: when I get to a utf-8
encode page that contains only ascii characters and a form, w3m still
consider this as a utf-8 page, and sends data in the form appropriately.

Here you are talking about another problem:

> However, in an UTF-8 environment, they can definitely be printed,
> but:
> 
> piper:~> LC_ALL=de_CH.UTF-8 w3m -dump -T 'text/html;charset=iso8859-1' 
> test.latin1.html
> <b>???</b>
> piper:~> LC_ALL=de_CH.UTF-8 w3m -dump -T 'text/html;charset=iso8859-1' 
> test.latin1.html | file -
> /dev/stdin: ASCII text

This works for me:

€ LC_ALL=fr_FR w3m -dump -T 'text/html; charset=iso-8859-1' test.htm
<b>âêî</b>
€ LC_ALL=fr_FR.UTF-8 w3m -dump -T 'text/html; charset=iso-8859-1' test.htm
<b>âêî</b>

Do you really have the de_CH.UTF-8 locale compiled?

Samuel


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to