-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On Wed, May 11, 2016 at 06:55:31PM +0200, Michael Lange wrote: > On Wed, 11 May 2016 18:43:50 +0200 > Michael Lange <klappn...@freenet.de> wrote: > > > According to this (yet probably dated ) page: > > http://www-archive.mozilla.org/projects/intl/chardet.html > > universal_charset_detector might help if it were enabled by default. > > Just looked at the above page again, and noticed that the "japanese" > charset detector also supports UTF-8 and cp1252. So I tried to set > intl.charset.detector to "ja_parallel_state_machine" and actually at least > the minimal utf-8 text file I just tested (which contains no more than a > bunch of german umlaut characters) is now displayed correctly.
This is all pretty unfortunate: we're replacing simple, understandable mechanisms ("hey, Firefox: on my computer all text files are supposed to be utf-8, promised") by complex heuristics (which, as a very last resort are cool, mind you: "dunno what that file is: try to make a guess, please"), but *not* as the up-front decision procedure. Besides, who knows what vulnerabilities lurk in there? Hey, Mozilla! If you are listening (do you listen at all?) Where is your user's sovereignity? - -- t -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.12 (GNU/Linux) iEYEARECAAYFAlc0IhMACgkQBcgs9XrR2kYSkwCeNVxGPMtBPPOzYcOhrCqk6otf cd0An0C+T9DLagA27QP5yCSxbKKpAAKC =QSts -----END PGP SIGNATURE-----