[issue1170] shlex have problems with parsing unicode

2007-09-17 Thread dexen deVries
Changes by dexen deVries: -- components: Library (Lib), Unicode severity: normal status: open title: shlex have problems with parsing unicode type: behavior versions: Python 2.5 __ Tracker <[EMAIL PROTECTED]> <http://bugs.python.org/

[issue1170] shlex have problems with parsing unicode

2007-09-17 Thread dexen deVries
New submission from dexen deVries: Feeding unicode to shlex object created in POSIX compat mode causes UnicodeDecodeError to be raised. It appears that shlex object defines sting .wordchars, containing latin-1 (iso8859-1) encoded characters with charcodes >=128, which is used to check whet

[issue1170] shlex have problems with parsing unicode

2007-09-17 Thread dexen deVries
dexen deVries added the comment: A quick paste to illustrate: the exception is raised only when unicode object is passed to shlex. Warning: the cStringIO module is unsuitable for use there, only the StringIO. cStringIO does not output unicode. dexen!muraena!~$ python Python 2.5.1 (r251:54863

[issue1170] shlex have problems with parsing unicode

2007-09-17 Thread dexen deVries
dexen deVries added the comment: One remark to previous message: the first time i created shlex object in non-POSIX mode (the default), in later it's in POSIX mode (due to the third parameter to shlex being True). The bug in question manifests only in POSIX mode. BTW, that so-called