Yes, without the JSON_UNESCAPED_UNICODE flag, all characters with Unicode values >= 0x80 are escaped. That's the default behavior.
On Tue, Jan 5, 2016 at 2:45 PM, Andrea Faulds <a...@ajf.me> wrote: > Hi Eddie, > > Eddie Kohler wrote: >> >> The U+2028 LINE SEPARATOR and U+2029 PARAGRAPH SEPARATOR >> characters are allowed unescaped in JSON strings, but *not* allowed >> unescaped >> in Javascript. This is widely considered a minor wart in the JSON >> specification. >> <https://medium.com/joys-of-javascript/json-js-42a28471221d> >> >> As a result, the JSON_UNESCAPED_UNICODE flag is dangerous to use when >> generating HTML. For example, this will generate a Javascript error >> ("Unexpected >> token ILLEGAL") in the user's browser: >> >> ``` >> $x = mb_convert_encoding('
', 'UTF-8', 'HTML-ENTITIES'); >> echo '<script>x = ', json_encode($x, JSON_UNESCAPED_UNICODE), >> ';</script>'; >> ``` >> >> The proposal is for `json_encode(..., JSON_UNESCAPED_UNICODE)` to >> escape the U+2028 and U+2029 characters as \u2028 and \u2029. A new flag, >> JSON_UNESCAPED_LINE_TERMINATORS, preserves the former behavior. >> >> It's important to note that this change *only* affects the non-default >> JSON_UNESCAPED_UNICODE flag. > > > This sounds reasonable. I'd like to ask, though, does this mean that without > that flag, U+2028 and U+2029 are always escaped? > > Thanks. > -- > Andrea Faulds > https://ajf.me/ > > -- > PHP Internals - PHP Runtime Development Mailing List > To unsubscribe, visit: http://www.php.net/unsub.php > -- PHP Internals - PHP Runtime Development Mailing List To unsubscribe, visit: http://www.php.net/unsub.php