>>>>>> Unicode code points can be defined with \u, but PHP6 breaks >>>>>> existing octal and hex escape sequences. >>>> >>>> I don't understand what this means... >>> >>> I think I know... >>> >>> I have code like this, somewhere: >>> >>> if (preg_match("|[\xF0-\xFF]|", $data)){ >>> $data = un_microsuck($data); >>> } >>> >>> un_microsuck() basically detects and converts any of the goof-ball >>> extended ASCII from MS products (Word, Outlook, etc) to an HTML >>> equivalent character. >>> >>> But now \xF0 isn't going to be ASCII 128 anymore, is it? >> >> \xF0 never was ASCII. ASCII (ISO-646) is 7bit character set. \xF0 is >> decimal 240. It is 8bit. > > Don't tell me. > > Tell Microsoft. > > Cuz I sure as heck get a LOT of input data >> \x7f and I have to do > something reasonable with it... > > And I did say "extended ASCII" in the other paragraph, after all... > >>> Or maybe \xF0 will "work" but the octal \360 won't? >> >> Are you sure that you can't do that by setting >> unicode.something_encoding to iso-8859-1 or windows-1252? > > I dunno. > > Doesn't really matter if I can't set those in .htaccess, that's for sure.
All unicode. settings except unicode.semantics are PHP_INI_ALL. >From README.UNICODE ---- Script Encoding =============== ... If you cannot change the encoding system wide, you can use a pragma to override the INI setting in a local script: <?php declare(encoding = 'Shift-JIS'); ?> ---- -- Tomas -- PHP Internals - PHP Runtime Development Mailing List To unsubscribe, visit: http://www.php.net/unsub.php