Hello Mutt users, like most people, I often receive eMails from Windows users. These eMails tend to contain some of those characters from the Windows-1252 character set that are not part of the iso-8859-1 standard (aka Latin-1). You know.
Since Windows-1252 is actually a small extension to iso-8859-1, it should be easy to define a mapping to iso-8859-1 codes approximating those funny characters. Right, that's what the excellent programs iconv and recode are for, aren't they? The problem is, many Windows user send bogus Content-Type header lines, so where the line should read, Content-Type: text/plain; charset=Windows-1252 I often find, Content-Type: text/plain; charset=iso-8859-1 or even, Content-Type: text/plain; charset=us-ascii instead. A quick fix would be to filter text/plain messages through a sed or tr script doing the substitutions. Some code to trigger this could be added in ~/.mailcap or ~/.procmailrc perhaps. Is this a nonsense idea? Has anybody solved this problem already? I could not find anything about this in the list archive at Yahoo. If anybody finds this idea useful, I'll go ahead with this idea and notify the list once I get my solution working. Or should this feature be included into Mutt proper? It might still be the case that this problem is just a misconfiguration issue. Here is my configuration: o Mutt: System: Linux 2.4.4-4GB (i586) [using ncurses 5.2] Einstellungen bei der Compilierung: -DOMAIN +DEBUG -HOMESPOOL -USE_SETGID +USE_DOTLOCK -DL_STANDALONE +USE_FCNTL -USE_FLOCK -USE_POP -USE_IMAP -USE_GSS -USE_SSL -USE_SASL +HAVE_REGCOMP -USE_GNU_REGEX +HAVE_COLOR +HAVE_START_COLOR +HAVE_TYPEAHEAD +HAVE_BKGDSET +HAVE_CURS_SET +HAVE_META +HAVE_RESIZETERM +HAVE_PGP -BUFFY_SIZE -EXACT_ADDRESS -SUN_ATTACHMENT +ENABLE_NLS -LOCALES_HACK +HAVE_WC_FUNCS +HAVE_LANGINFO_CODESET ++HAVE_LANGINFO_YESEXPR +HAVE_ICONV -ICONV_NONTRANS +HAVE_GETSID -HAVE_GETADDRINFO ISPELL="/usr/bin/ispell" SENDMAIL="/usr/sbin/sendmail" MAILPATH="/var/mail" PKGDATADIR="/usr/local/share/mutt" SYSCONFDIR="/usr/local/etc" EXECSHELL="/bin/sh" -MIXMASTER o iconv: iconv (GNU libc) 2.2.2 (which knows a lot of names for the character sets I've just mentioned. Hence, I don't think any charset-hook is necessary) o locale: my shell environment contains LANG=de_DE.ISO-8859-1, and I have not set locale or charset in any Muttrc. (Setting LANG=de_DE.UTF-8 makes things even worse, even though other UTF-8 enabled apps like w3m-m17n work just fine. I'll post a separate message about my experiment with UTF-8.) o my *nix: SuSE Linux 7.2 (Linux 2.4.4-4GB) Thanks for you attention. Cristian -- }{ Cristian Pietsch }{ http://www.interling.de
msg21177/pgp00000.pgp
Description: PGP signature