Ezio Melotti added the comment:
The characters with the Other_ID_Continue property are also included, i.e.:
00B7 ; Other_ID_Continue # Po MIDDLE DOT
0387 ; Other_ID_Continue # Po GREEK ANO TELEIA
1369..1371; Other_ID_Continue # No [9] ETHIOPIC DIGIT ONE..ETHIO
julien tayon added the comment:
http://www.fileformat.info/info/unicode/char/b7/index.htm
the unicode category is Po (Ponctuation).
Empirically, it cannot start a variable name so according to the rules given in
the lexical analyser it should be one of : Mn, Mc, Nd, Pc
Which is not the case
R. David Murray added the comment:
The rules for python identifiers are documented here:
http://docs.python.org/dev/reference/lexical_analysis.html#identifiers
Are you saying that the behavior does not match the documentation?
--
nosy: +r.david.murray
___
New submission from julien tayon:
I guess unicode variable names are restricted to letters, and that symbols and
punctuation shoud be ignored (except _).
I have tested other dots (punctuation) they dont work.
Only
http://www.fileformat.info/info/unicode/char/00b7/index.htm
oddly enough has wo