[issue12742] Add support for CESU-8 encoding

2011-08-29 Thread Ezio Melotti
Ezio Melotti added the comment: I'm going to reject this. If people need it, they can always implement it using the codecs module. -- resolution: -> rejected stage: -> committed/rejected status: open -> closed ___ Python tracker

[issue12742] Add support for CESU-8 encoding

2011-08-29 Thread Adal Chiriliuc
Adal Chiriliuc added the comment: It's an internal web API at the place I work for. To be able to use it from Python in some form, I did an workaround in which I just stripped everything outside BMP: # replace characters outside BMP with 'REPLACEMENT CHARACTER' (U+FFFD) def cesu8_to_utf8(text

[issue12742] Add support for CESU-8 encoding

2011-08-26 Thread Ezio Melotti
Ezio Melotti added the comment: Can you provide some example? The page you linked says "It should be used exclusively for internal processing and never for external data exchange.", so I'm not sure why these APIs would want to use it. -- nosy: +ezio.melotti __

[issue12742] Add support for CESU-8 encoding

2011-08-12 Thread Éric Araujo
Changes by Éric Araujo : -- components: +Library (Lib) nosy: +lemburg versions: +Python 3.3 -Python 3.4 ___ Python tracker ___ ___ Pyt

[issue12742] Add support for CESU-8 encoding

2011-08-12 Thread Adal Chiriliuc
New submission from Adal Chiriliuc : CESU-8 is identical with UTF-8 except that it has a different encoding format for surrogate characters. http://en.wikipedia.org/wiki/CESU-8 It is used by some web APIs. -- components: Unicode messages: 141958 nosy: adalx priority: normal severity: