Re: [fpc-devel] Memory consumed by strings

Daniël Mantione Sun, 23 Nov 2008 04:31:52 -0800


Op Sun, 23 Nov 2008, schreef listmember:

On 2008-11-23 14:10, Daniël Mantione wrote:

Therefore, any other encoding is a waste of memory and does not gain you
any speed. For that reason, I don't see the compiler switch from 8-bit
processing either.


I nearly fully agree with you.

Except that, when a string constant needs to contain non-ASCI chars. What dowe do in these cases?

The common approach is to do nothing, no processing needs to be done. I.e.the compiler justs passes on the bytes one by one from the source file tothe object file.

For an IDE, this is a little bit more complicated. I.e. searching for a çin a source file needs to find both the composed and the decomposedvariant, and in the case of UTF-8, this character can be encoded in 1, 2,3 or 4 bytes which all need to be found. This is where UTF-16 and UTF-32start to make sense.

Only if you need to process characters (rather than pass them on),
UTF-32 is a lot faster and simpler.


Yes. If I knew how to write this patch, I'd be working on it right now.

Unfortunately an UTF-32 string type is not on our roadmap either, so itwould have to be an user contribution.


Daniël

_______________________________________________
fpc-devel maillist  -  [email protected]
http://lists.freepascal.org/mailman/listinfo/fpc-devel

Re: [fpc-devel] Memory consumed by strings

Reply via email to