[issue5358] Unicode control characters are not allowed as identifiers

2009-02-27 Thread Martin v. Löwis
Martin v. Löwis added the comment: See PEP 3131 for a specification what is an identifier in Python. Closing this as "won't fix". -- resolution: -> wont fix status: open -> closed ___ Python tracker _

[issue5358] Unicode control characters are not allowed as identifiers

2009-02-27 Thread Matthew Barnett
Matthew Barnett added the comment: The definition of a word in the new re module (actually targetted at Python 2.7) is currently a sequence of L&, N&, M& and Pc. I suppose ideally we want the definitions of a word and an identifier to be basically the same, except that an identifier can't start

[issue5358] Unicode control characters are not allowed as identifiers

2009-02-26 Thread Ezio Melotti
Ezio Melotti added the comment: Valid identifiers should begin with a letter or '_' and contain only letters, numbers and '_'. This probably means that only the Unicode characters that belong to the categories Ll, Lu (Letter Lower/Upper case), Nd (Number, Decimal Digit) and Pc (Punctuation, Conn

[issue5358] Unicode control characters are not allowed as identifiers

2009-02-26 Thread Baiju M
Baiju M added the comment: I think RFC-3454 [1] can be used as a base for selecting the control characters which can be used as a valid identifier character. [1] http://www.rfc-editor.org/rfc/rfc3454.txt ___ Python tracker

[issue5358] Unicode control characters are not allowed as identifiers

2009-02-26 Thread Baiju M
Baiju M added the comment: On a further look at this issue, I understood Python cannot use all Unicode control characters as identifiers. But for many international languages, without some control characters like ZWJ & ZWNJ [1], it won't be possible to construct all characters with proper visua

[issue5358] Unicode control characters are not allowed as identifiers

2009-02-24 Thread Ezio Melotti
Changes by Ezio Melotti : -- nosy: +ezio.melotti ___ Python tracker ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.pyt

[issue5358] Unicode control characters are not allowed as identifiers

2009-02-24 Thread Martin v. Löwis
Martin v. Löwis added the comment: Why do you think this is a bug? -- nosy: +loewis ___ Python tracker ___ ___ Python-bugs-list mailin

[issue5358] Unicode control characters are not allowed as identifiers

2009-02-24 Thread Baiju M
New submission from Baiju M : I tried to use Zero-width joiner (U+200D) as part of an identifier. It produce an exception like this: SyntaxError: invalid character in identifier I have attached the Python file which produce this error. Zero-width joiner (U+200D) is a Unicode control character: