Re: decode(..., errors='ignore') has no effect

2010-01-12 Thread Peter Otten
Lie Ryan wrote: > On 01/12/10 23:50, Jens Müller wrote: >>> To convert unicode into str you have to *encode()* it. >>> >>> u"...".decode(...) will implicitly convert to ASCII first, i. e. is >>> equivalent to >>> >>> u"...".encode("ascii").decode(...) >>> >>> Hence the error message >> >> Ah - ye

Re: decode(..., errors='ignore') has no effect

2010-01-12 Thread Lie Ryan
On 01/12/10 23:50, Jens Müller wrote: >> To convert unicode into str you have to *encode()* it. >> >> u"...".decode(...) will implicitly convert to ASCII first, i. e. is >> equivalent to >> >> u"...".encode("ascii").decode(...) >> >> Hence the error message > > Ah - yes of course. > > And how can

Re: decode(..., errors='ignore') has no effect

2010-01-12 Thread Jens Müller
To convert unicode into str you have to *encode()* it. u"...".decode(...) will implicitly convert to ASCII first, i. e. is equivalent to u"...".encode("ascii").decode(...) Hence the error message Ah - yes of course. And how can you use the system's default encoding with errors=ignore? The de

Re: decode(..., errors='ignore') has no effect

2010-01-12 Thread Ulrich Eckhardt
Jens Müller wrote: > I try to decode a string,e.g. > u'M\xfcnchen, pronounced [\u02c8m\u028fn\xe7\u0259n]'.decode('cp1252', > 'ignore') > but even thoug I use errors='ignore' > I get UnicodeEncodeError: 'charmap' codec can't encode character u'\u02c8' > in position 21: character maps to > > How c

Re: decode(..., errors='ignore') has no effect

2010-01-12 Thread Peter Otten
Jens Müller wrote: > I try to decode a string,e.g. > u'M\xfcnchen, pronounced [\u02c8m\u028fn\xe7\u0259n]'.decode('cp1252', > 'ignore') > but even thoug I use errors='ignore' > I get UnicodeEncodeError: 'charmap' codec can't encode character u'\u02c8' > in position 21: character maps to > > How

decode(..., errors='ignore') has no effect

2010-01-12 Thread Jens Müller
Hi, I try to decode a string,e.g. u'M\xfcnchen, pronounced [\u02c8m\u028fn\xe7\u0259n]'.decode('cp1252', 'ignore') but even thoug I use errors='ignore' I get UnicodeEncodeError: 'charmap' codec can't encode character u'\u02c8' in position 21: character maps to How come? Thanks, Jens --