[PHP-DEV] Re: [ZEND-ENGINE-CVS] cvs: ZendEngine2 / zend.c

Andrei Zmievski Wed, 17 Aug 2005 14:24:01 -0700

I agree with all of this. Anyone want to update README.UNICODE toreflect this change?


-Andrei



On Aug 16, 2005, at 4:52 PM, Andi Gutmans wrote:

I think we should make the following assumptions:
a) Being able to create and manipulate IS_UNICODE zvals whenunicode_semantics=off will be very useful to people including theexposing of the ICU extension.b) Defining Unicode identifiers like classes/properties/functionsif unicode_semantics=off does not seem useful and should beprohibited.c) People can always find ways of misusing the language & apis toreach a state which they shouldn't be reaching, For example,assuming (a) & (b) using create_function to misuse the engine andcreate a Unicode function name when Unicode=off.
I don't believe we can or should enforce every possibility ofmisuse or we'll bloat the code and will never reach perfection.That said, we probably can enforce the obvious places where peopletry to define unicode classes/functions/properties whenunicode_semantics=off.
btw, I'm only referring to identifiers. If unicode=off then ibelieve things like arrays should support IS_UNCODE keys/values inaddition to IS_STRING for reasons as in (a). As per original designthose two wouldn't match though as they would when we're in fullblown unicode mode.
Dmitry, do you thing that not allowing unicode identifiers whenunicode=off would be hard to accomplish? it would make life easierwhen it comes to code that sparked this discussion (and maybeharder in other cases).
Due to (c) I'm king of worried of trying to simplify the model andwe might just need to provide eaier to use apis to extensionwriters which would save them effort in checking the differentoptions. A ggood API is key in making sure that we get a consistentimplementation and upgrade of php functions.
Andi

At 03:13 PM 8/16/2005 -0700, Andrei Zmievski wrote:
It does make the engine more complicated though, because we can'tjust check for UG(unicode) and expect all identifiers to be of thesame type. We would actually need to amend a lot of API functionsto include passing the identifier type along, e.g.zend_get_active_function() would need to return the identifiertype along with the identifier itself.
-Andrei

On Aug 16, 2005, at 1:36 PM, Andi Gutmans wrote:
IIRC if unicode_semnantics=on, we agreed to use Unicode for arrayoffsets and properties (and do auto-conversion). however, ifunicode = off, we should not do auto conversion but allow phpusers to manually create unicode data. when it comes to arrays weagreed that in this case they can use strings and unicode as theywish (makes sense for apps that can't make the complete move butcan unicode-enable some of the app, for example, a web service).so bottom line, i dont think we can expect class name andproperty to be in the same encoding unless we hard code it, but ilike the flexibility of being able to use unicode strings whenunicode_semantics is off....
(this took me far too long to write :)


--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: http://www.php.net/unsub.php

[PHP-DEV] Re: [ZEND-ENGINE-CVS] cvs: ZendEngine2 / zend.c

Reply via email to