[issue16249] unicode name accepts a punctuation glyph

2012-10-16 Thread Ezio Melotti
Ezio Melotti added the comment: The characters with the Other_ID_Continue property are also included, i.e.: 00B7 ; Other_ID_Continue # Po MIDDLE DOT 0387 ; Other_ID_Continue # Po GREEK ANO TELEIA 1369..1371; Other_ID_Continue # No [9] ETHIOPIC DIGIT ONE..ETHIO

[issue16249] unicode name accepts a punctuation glyph

2012-10-16 Thread julien tayon
julien tayon added the comment: http://www.fileformat.info/info/unicode/char/b7/index.htm the unicode category is Po (Ponctuation). Empirically, it cannot start a variable name so according to the rules given in the lexical analyser it should be one of : Mn, Mc, Nd, Pc Which is not the case

[issue16249] unicode name accepts a punctuation glyph

2012-10-16 Thread R. David Murray
R. David Murray added the comment: The rules for python identifiers are documented here: http://docs.python.org/dev/reference/lexical_analysis.html#identifiers Are you saying that the behavior does not match the documentation? -- nosy: +r.david.murray ___

[issue16249] unicode name accepts a punctuation glyph

2012-10-16 Thread julien tayon
New submission from julien tayon: I guess unicode variable names are restricted to letters, and that symbols and punctuation shoud be ignored (except _). I have tested other dots (punctuation) they dont work. Only http://www.fileformat.info/info/unicode/char/00b7/index.htm oddly enough has wo