Re: "convert" string to bytes without changing data (encoding)

Tim Chase Wed, 28 Mar 2012 12:03:43 -0700

On 03/28/12 13:05, Ross Ridge wrote:

Ross Ridge<rri...@csclub.uwaterloo.ca>  wr=

But a Python Unicode string might be stored in several
ways; for all you know, it might actually be stored as a sequence of
apples in a refrigerator, just as long as they can be referenced
correctly.


But it is in fact only stored in one particular way, as a series of bytes.

There's no logical Python way to turn that into a series of bytes.


Nonsense.  Play all the semantic games you want, it already is a series
of bytes.

Internally, they're a series of bytes, but they are MEANINGLESSbytes unless you know how they are encoded internally. Thosebytes could be UTF-8, UTF-16, UTF-32, or any of a number of otherpossible encodings[1]. If you get the internal byte stream,there's no way to meaningfully operate on it unless you also knowhow it's encoded (or you're willing to sacrifice the ability toreliably get the string back).


-tkc

[1]
http://docs.python.org/library/codecs.html#standard-encodings




--
http://mail.python.org/mailman/listinfo/python-list

Re: "convert" string to bytes without changing data (encoding)

Reply via email to