# What's the correct way to get the
# byte count of a unicode (UTF-8) string?
# I couldn't find a builtin method
# and the following is memory inefficient.

ustr = "example\xC2\x9D".decode('UTF-8')

num_chars = len(ustr)    # 8

buf = ustr.encode('UTF-8')

num_bytes = len(buf)     # 9


# Thanks.

-- 
http://mail.python.org/mailman/listinfo/python-list

Reply via email to