Denis S. Otkidach wrote: > On 10 Feb 2005 11:49:33 -0800 > "Serge Orlov" <[EMAIL PROTECTED]> wrote: > > > This thread is about problems only with LANG=C or LANG=POSIX, it's not > > about other locales. Other locales are working as expected. > > You are not right. I have LANG=de_DE.UTF-8, and the Python test_re.py > doesn't pass.
I meant "only with C or POSIX locales" when I wrote "only with LANG=C or LANG=POSIX". My bad. > $LANG doesn't matter if I don't call setlocale. Sure. > Fortunately setting any non-C locale solves the problem for all (I > believe) unicode character: > > >>> re.compile(ur'\w+', re.U).findall(u'\xb5\xba\xe4\u0430') > [u'\xb5\xba\xe4\u0430'] I can't find the strict definition of isalpha, but I believe average C program shouldn't care about the current locale alphabet, so isalpha is a union of all supported characters in all alphabets Serge. -- http://mail.python.org/mailman/listinfo/python-list