Re: byte count unicode string

Marc 'BlackJack' Rintsch Tue, 19 Sep 2006 23:42:56 -0700

In <[EMAIL PROTECTED]>, willie wrote:

> # What's the correct way to get the
> # byte count of a unicode (UTF-8) string?
> # I couldn't find a builtin method
> # and the following is memory inefficient.
> 
> ustr = "example\xC2\x9D".decode('UTF-8')
> 
> num_chars = len(ustr)    # 8
> 
> buf = ustr.encode('UTF-8')
> 
> num_bytes = len(buf)     # 9


That is the correct way.

Ciao,
        Marc 'BlackJack' Rintsch
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: byte count unicode string

Reply via email to