In article <b6db8982-feac-4036-8ec4-2dc720d41...@googlegroups.com>,
Ned Batchelder <n...@nedbatchelder.com> wrote:

> In Python3, repr() will return a Unicode string, and will preserve existing 
> Unicode characters in its arguments.  This has been controversial.  To get 
> the Python 2 behavior of a pure-ascii representation, there is the new 
> builtin ascii(), and a corresponding %a format string.

I'm still stuck on Python 2, and while I can understand the controversy ("It 
breaks my Python 2 code!"), this seems like the right thing to have done.  In 
Python 2, unicode is an add-on.  One of the big design drivers in Python 3 was 
to make unicode the standard.

The idea behind repr() is to provide a "just plain text" representation of an 
object.  In P2, "just plain text" means ascii, so escaping non-ascii characters 
makes sense.  In P3, "just plain text" means unicode, so escaping non-ascii 
characters no longer makes sense.

Some of us have been doing this long enough to remember when "just plain text" 
meant only a single case of the alphabet (and a subset of ascii punctuation).  
On an ASR-33, your C program would print like:

MAIN() \(
        PRINTF("HELLO, ASCII WORLD");
\)

because ASR-33's didn't have curly braces (or lower case).

Having P3's repr() escape non-ascii characters today makes about as much sense 
as expecting P2's repr() to escape curly braces (and vertical bars, and a few 
others) because not every terminal can print those.

--
Roy Smith
r...@panix.com

-- 
https://mail.python.org/mailman/listinfo/python-list

Reply via email to