Re: split lines from stdin into a list of unicode strings

2013-09-05 Thread Kurt Mueller
Am 05.09.2013 10:33, schrieb Peter Otten: > Kurt Mueller wrote: >> Am 29.08.2013 11:12, schrieb Peter Otten: >>> kurt.alfred.muel...@gmail.com wrote: On Wednesday, August 28, 2013 1:13:36 PM UTC+2, Dave Angel wrote: > On 28/8/2013 04:32, Kurt Mueller wrote: >> For some text manipulatio

Re: split lines from stdin into a list of unicode strings

2013-09-05 Thread Peter Otten
Kurt Mueller wrote: > Am 29.08.2013 11:12, schrieb Peter Otten: >> kurt.alfred.muel...@gmail.com wrote: >>> On Wednesday, August 28, 2013 1:13:36 PM UTC+2, Dave Angel wrote: On 28/8/2013 04:32, Kurt Mueller wrote: > For some text manipulation tasks I need a template to split lines > f

Re: split lines from stdin into a list of unicode strings

2013-09-05 Thread Kurt Mueller
Am 29.08.2013 11:12, schrieb Peter Otten: > kurt.alfred.muel...@gmail.com wrote: >> On Wednesday, August 28, 2013 1:13:36 PM UTC+2, Dave Angel wrote: >>> On 28/8/2013 04:32, Kurt Mueller wrote: For some text manipulation tasks I need a template to split lines from stdin into a list of str

Re: split lines from stdin into a list of unicode strings

2013-08-29 Thread Peter Otten
Kurt Mueller wrote: > I have to say that I am a bit disapointed by the chardet library. > The encoding for the single character 'ü' > is detected as {'confidence': 0.99, 'encoding': 'EUC-JP'}, > whereas "file" says: > $ echo "ü" | file -i - > /dev/stdin: text/plain; charset=utf-8 > $ > > "ü" is a

Re: split lines from stdin into a list of unicode strings

2013-08-29 Thread Kurt Mueller
Am 29.08.2013 11:12, schrieb Peter Otten: > kurt.alfred.muel...@gmail.com wrote: >> On Wednesday, August 28, 2013 1:13:36 PM UTC+2, Dave Angel wrote: >>> On 28/8/2013 04:32, Kurt Mueller wrote: For some text manipulation tasks I need a template to split lines from stdin into a list of str

Re: split lines from stdin into a list of unicode strings

2013-08-29 Thread Peter Otten
kurt.alfred.muel...@gmail.com wrote: > On Wednesday, August 28, 2013 1:13:36 PM UTC+2, Dave Angel wrote: >> On 28/8/2013 04:32, Kurt Mueller wrote: >> > For some text manipulation tasks I need a template to split lines >> > from stdin into a list of strings the way shlex.split() does it. >> > The

Re: split lines from stdin into a list of unicode strings

2013-08-28 Thread kurt . alfred . mueller
On Wednesday, August 28, 2013 1:13:36 PM UTC+2, Dave Angel wrote: > On 28/8/2013 04:32, Kurt Mueller wrote: > > For some text manipulation tasks I need a template to split lines > > from stdin into a list of strings the way shlex.split() does it. > > The encoding of the input can vary. > Does that

Re: split lines from stdin into a list of unicode strings

2013-08-28 Thread Dave Angel
On 28/8/2013 04:32, Kurt Mueller wrote: > This is a follow up to the Subject > "right adjusted strings containing umlauts" You started a new thread, with a new subject line. So presumably we're starting over with a clean slate. > > For some text manipulation tasks I need a template to split lin