Re: Chardet, file, ... and the Flexible String Representation

Chris Angelico Fri, 06 Sep 2013 12:43:50 -0700

On Sat, Sep 7, 2013 at 1:46 AM, Piet van Oostrum <[email protected]> wrote:
> The FSR simply stores a Unicode string as an array[*] of ints (the Unicode 
> code points of the characters of the string. That's it. Then it uses a 
> memory-efficient way to store this array of ints. But that has nothing to do 
> with character sets. The same principle could be used for any array of ints.


Python does, in fact, store integers in different-sized blocks of
memory according to size - though not for anything smaller than
32-bit.

>>> sys.getsizeof(100)
14
>>> sys.getsizeof(1000000000000000000000000000000000)
28

So why this is suddenly a bad thing for characters is a mystery none
but he can comprehend.

ChrisA
-- 
https://mail.python.org/mailman/listinfo/python-list

Re: Chardet, file, ... and the Flexible String Representation

Reply via email to