retitle 291735 -dump should correctly convert charsets tags 291735 + moreinfo thanks
Hi, martin f krafft, le Sun 12 Nov 2006 14:28:35 +0100, a écrit : > also sprach Samuel Thibault <[EMAIL PROTECTED]> [2006.08.21.1455 +0200]: > > These got fixed with upload of 0.5.1-5 > > I don't see how #291735 got fixed: Well, for what I was speaking about, it got fixed: w3m doesn't "simplify" the charset of the page any more: when I get to a utf-8 encode page that contains only ascii characters and a form, w3m still consider this as a utf-8 page, and sends data in the form appropriately. Here you are talking about another problem: > However, in an UTF-8 environment, they can definitely be printed, > but: > > piper:~> LC_ALL=de_CH.UTF-8 w3m -dump -T 'text/html;charset=iso8859-1' > test.latin1.html > <b>???</b> > piper:~> LC_ALL=de_CH.UTF-8 w3m -dump -T 'text/html;charset=iso8859-1' > test.latin1.html | file - > /dev/stdin: ASCII text This works for me: € LC_ALL=fr_FR w3m -dump -T 'text/html; charset=iso-8859-1' test.htm <b>âêî</b> € LC_ALL=fr_FR.UTF-8 w3m -dump -T 'text/html; charset=iso-8859-1' test.htm <b>âêî</b> Do you really have the de_CH.UTF-8 locale compiled? Samuel -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

