Re: Re: "convert" string to bytes without changing data (encoding)

Evan Driscoll Wed, 28 Mar 2012 12:25:09 -0700

On 01/-10/-28163 01:59 PM, Ross Ridge wrote:

Steven D'Aprano<steve+comp.lang.pyt...@pearwood.info>  wrote:

The right way to convert bytes to strings, and vice versa, is via
encoding and decoding operations.


If you want to dictate to the original poster the correct way to do
things then you don't need to do anything more that.  You don't need to
pretend like Chris Angelico that there's isn't a direct mapping from
the his Python 3 implementation's internal respresentation of strings
to bytes in order to label what he's asking for as being "silly".


That mapping may as well be:

  def get_bytes(some_string):
      import random
      length = random.randint(len(some_string), 5*len(some_string))
      bytes = [0] * length
      for i in xrange(length):
          bytes[i] = random.randint(0, 255)
      return bytes

Of course this is hyperbole, but it's essentially about as muchguarantee as to what the result is.

As many others have said, the encoding isn't defined, and I would guessvaries between implementations. (E.g. if Jython and IronPython use theirhost platforms' native strings, both have 16-bit chars and thus probablyuse UTF-16 encoding. I am not sure what CPython uses, but I bet it's*not* that.)

It's even guaranteed that the byte representation won't change! Ifsomething is lazily evaluated or you have a COW string or something, thebytes backing it will differ.

So yes, you can say that pretending there's not a mapping of strings tointernal representation is silly, because there is. However, there'snothing you can say about that mapping.


Evan
--
http://mail.python.org/mailman/listinfo/python-list

Re: Re: "convert" string to bytes without changing data (encoding)

Reply via email to