Xah Lee <[EMAIL PROTECTED]> wrote:
> " It's very wasteful of space. In most texts, the majority of the
>code points are less than 127, or less than 255, so a lot of space is
>occupied by zero bytes. "
>
>Not true. In Asia, most chars has unicode number above 255. Considered
>globally, *possibly*
J. Cliff Dyer wrote:
" ...UCS-2, for example, is a fixed width, 2-byte encoding that can
handle any unicode code point up to 0x, but cannot handle the 3
and 4 byte extension sets. "
I was going to reply to say that this is a good point. But on my way i
looked up wikipedia,
http://en.wikipedia
On Mon, 10 Sep 2007 19:26:20 -0700, Xah Lee wrote:
> ・ Many Internet standards are defined in terms of textual data, and
> can't handle content with embedded zero bytes.
>
> Not sure what he mean by "can't handle content with embedded zero
> bytes". Overall i think this sentence is silly, and h
Xah Lee wrote:
> This post is about some notes and corrections to a online article
> regarding unicod and python.
>
> --
>
> by happenstance i was reading:
>
> Unicode HOWTO
> http://www.amk.ca/python/howto/unicode
>
> Here's some problems i see:
>
> ・ No conspicuous authorship. (howeve
This post is about some notes and corrections to a online article
regarding unicod and python.
--
by happenstance i was reading:
Unicode HOWTO
http://www.amk.ca/python/howto/unicode
Here's some problems i see:
・ No conspicuous authorship. (however, oddly, it has a conspicuous
ackno
Google groups seems to be stripping my quotation markslately.
Here's a retry to post my previous message.
--
Xah Lee wrote:
If i have a nested list, where the atoms are unicode strings, e.g.
# -*- coding: utf-8 -*-
ttt=[[u"→
On Sep 10, 8:12 am, Carsten Haese <[EMAIL PROTECTED]> wrote:
Xah Lee wrote:
If i have a nested list, where the atoms are unicode strings, e.g.
# -*- coding: utf-8 -*-
ttt=[[u"→",u"↑"], [u"αβγ"],...]
print ttt
how can i print it without getting the u'\u1234' notation?
i.e. i want it print just l
On Mon, 2007-09-10 at 06:59 -0700, Xah Lee wrote:
> If i have a nested list, where the atoms are unicode strings, e.g.
>
> # -*- coding: utf-8 -*-
> ttt=[[u"→",u"↑"], [u"αβγ"],...]
> print ttt
>
> how can i print it without getting the u'\u1234' notation?
> i.e. i want it print just like this: [[
If i have a nested list, where the atoms are unicode strings, e.g.
# -*- coding: utf-8 -*-
ttt=[[u"→",u"↑"], [u"αβγ"],...]
print ttt
how can i print it without getting the u'\u1234' notation?
i.e. i want it print just like this: [[u"→"], ...]
I can of course write a loop then for each string use