reopen 291735 ! thanks also sprach Samuel Thibault <[EMAIL PROTECTED]> [2006.08.21.1455 +0200]: > These got fixed with upload of 0.5.1-5
I don't see how #291735 got fixed: piper:~> cat test.latin1.html <b>äöü</b> piper:~> LC_ALL=C w3m -dump -T 'text/html;charset=iso8859-1' test.latin1.html <b>???</b> piper:~> LC_ALL=C w3m -dump -T 'text/html;charset=iso8859-1' test.latin1.html | file - /dev/stdin: ASCII text piper:~> LC_ALL=de_CH w3m -dump -T 'text/html;charset=iso8859-1' test.latin1.html | file - /dev/stdin: ISO-8859 text So the output format is determined by the locale, and this makes somewhat sense: in a C environment, latin1 characters possibly cannot be printed, so w3m converts them to ?. However, in an UTF-8 environment, they can definitely be printed, but: piper:~> LC_ALL=de_CH.UTF-8 w3m -dump -T 'text/html;charset=iso8859-1' test.latin1.html <b>???</b> piper:~> LC_ALL=de_CH.UTF-8 w3m -dump -T 'text/html;charset=iso8859-1' test.latin1.html | file - /dev/stdin: ASCII text I suggest to just use the input charset as output charset. People can use iconv if they need to translate. -- .''`. martin f. krafft <[EMAIL PROTECTED]> : :' : proud Debian developer, author, administrator, and user `. `'` http://people.debian.org/~madduck - http://debiansystem.info `- Debian - when you have better things to do than fixing systems
signature.asc
Description: Digital signature (GPG/PGP)

