[issue18625] ks_c-5601-1987 is used by microsoft when it really means cp949

2013-08-02 Thread Marc-Andre Lemburg
Marc-Andre Lemburg added the comment: The alias was added by Hye-Shik Chang: http://hg.python.org/cpython-fullhistory/annotate/887ce39f95f2/Lib/encodings/aliases.py#198 I've added him to the nosy list. If the alias don't match, we'd have to add a codec for the mismatching encoding to maintain

[issue18625] ks_c-5601-1987 is used by microsoft when it really means cp949

2013-08-01 Thread R. David Murray
New submission from R. David Murray: When Microsoft handles Korean text, it uses its own code page, cp949, which is a superset of ks_c-5601-1987. But when talking to the rest of the world, it claims that the character set name is ks_c-5601-1987. This means that text claimed to be in ks_c-560