[PHP-DEV] Re: [ZEND-ENGINE-CVS] cvs: ZendEngine2 / zend.c

Andi Gutmans Tue, 16 Aug 2005 16:53:12 -0700

I think we should make the following assumptions:

a) Being able to create and manipulate IS_UNICODE zvals whenunicode_semantics=off will be very useful to people including the exposingof the ICU extension.b) Defining Unicode identifiers like classes/properties/functions ifunicode_semantics=off does not seem useful and should be prohibited.c) People can always find ways of misusing the language & apis to reach astate which they shouldn't be reaching, For example, assuming (a) & (b)using create_function to misuse the engine and create a Unicode functionname when Unicode=off.

I don't believe we can or should enforce every possibility of misuse orwe'll bloat the code and will never reach perfection. That said, weprobably can enforce the obvious places where people try to define unicodeclasses/functions/properties when unicode_semantics=off.

btw, I'm only referring to identifiers. If unicode=off then i believethings like arrays should support IS_UNCODE keys/values in addition toIS_STRING for reasons as in (a). As per original design those two wouldn'tmatch though as they would when we're in full blown unicode mode.

Dmitry, do you thing that not allowing unicode identifiers when unicode=offwould be hard to accomplish? it would make life easier when it comes tocode that sparked this discussion (and maybe harder in other cases).

Due to (c) I'm king of worried of trying to simplify the model and we mightjust need to provide eaier to use apis to extension writers which wouldsave them effort in checking the different options. A ggood API is key inmaking sure that we get a consistent implementation and upgrade of phpfunctions.


Andi

At 03:13 PM 8/16/2005 -0700, Andrei Zmievski wrote:

It does make the engine more complicated though, because we can't justcheck for UG(unicode) and expect all identifiers to be of the same type.We would actually need to amend a lot of API functions to include passingthe identifier type along, e.g. zend_get_active_function() would need toreturn the identifier type along with the identifier itself.
-Andrei

On Aug 16, 2005, at 1:36 PM, Andi Gutmans wrote:
IIRC if unicode_semnantics=on, we agreed to use Unicode for array offsetsand properties (and do auto-conversion). however, if unicode = off, weshould not do auto conversion but allow php users to manually createunicode data. when it comes to arrays we agreed that in this case theycan use strings and unicode as they wish (makes sense for apps that can'tmake the complete move but can unicode-enable some of the app, forexample, a web service).so bottom line, i dont think we can expect class name and property to bein the same encoding unless we hard code it, but i like the flexibilityof being able to use unicode strings when unicode_semantics is off....
(this took me far too long to write :)


--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: http://www.php.net/unsub.php

[PHP-DEV] Re: [ZEND-ENGINE-CVS] cvs: ZendEngine2 / zend.c

Reply via email to