Re: Turkic I and re

2011-09-16 Thread Oktay Safak
Well, I'm a self taught Turkish python coder. I was bitten by this first in Python 2.3 and asked the group about it then. You can find the correspondence by googling "unicode bug in turkish characters? " There are a couple of

Re: Turkic I and re

2011-09-16 Thread Steven D'Aprano
Thomas Rachel wrote: > Am 15.09.2011 15:16 schrieb Alan Plum: > >> The Turkish 'I' is a peculiarity that will probably haunt us programmers >> until hell freezes over. Meh, I don't think it's much more peculiar that any other diacritic issue. If I'm German or English, I probably want ö and O to

Re: Turkic I and re

2011-09-16 Thread Thomas Rachel
Am 15.09.2011 15:16 schrieb Alan Plum: The Turkish 'I' is a peculiarity that will probably haunt us programmers until hell freezes over. That's why it would have been nice if the Unicode guys had defined "both Turkish i-s" at separate codepoints. Then one could have the three pairs I, i ("n

Re: Turkic I and re

2011-09-15 Thread Yaşar Arabacı
Hi, I am a Turkish self-taught python user. Personally, I don't think I am in a position to discuss a issue in this scale. But in my opinion, I think pardus* developers should be invited to join to this discussion. As they are using python heavily on most of their projects** I think they would hav

Re: Turkic I and re

2011-09-15 Thread Tim Chase
On 09/15/11 09:06, MRAB wrote: It's somewhat unlikely that Unicode will become locale-dependent in Python because it would cause problems; you don't want: "i".upper() == "I" to be maybe true, maybe false. An option would be to specify whether it should be locale-dependent. There have b

Re: Turkic I and re

2011-09-15 Thread MRAB
On 15/09/2011 14:44, John-John Tedro wrote: On Thu, Sep 15, 2011 at 1:16 PM, Alan Plum mailto:m...@alanplum.com>> wrote: On 2011-09-15 15:02, MRAB wrote: The regex module at http://pypi.python.org/pypi/__regex currently uses a comp

Re: Turkic I and re

2011-09-15 Thread John-John Tedro
On Thu, Sep 15, 2011 at 1:16 PM, Alan Plum wrote: > On 2011-09-15 15:02, MRAB wrote: > >> The regex module at >> http://pypi.python.org/pypi/**regexcurrently >> uses a >> compromise, where it matches 'I' with 'i' and also 'I' with 'ı' and 'İ' >> with 'i'. >> >

Re: Turkic I and re

2011-09-15 Thread Alan Plum
On 2011-09-15 15:02, MRAB wrote: The regex module at http://pypi.python.org/pypi/regex currently uses a compromise, where it matches 'I' with 'i' and also 'I' with 'ı' and 'İ' with 'i'. I was wondering if it would be preferable to have a TURKIC flag instead ("(?T)" or "(?T:...)" in the pattern).