On 14/10/14 23:48, Johannes Schlüter wrote:

On Tue, 2014-10-14 at 23:18 +0300, Aleksey Tulinov wrote:
Very good point. I'll give another example: is there a substring "s" in
string "Maße"? If it's case-sensitive search, when there is no such
substring, but if it's case-insensitive search, then "ß" folds into "ss"
and substring "s" appears.

In Unicode 5.1 there is "ẞ" U+1E9E LATIN CAPITAL LETTER SHARP S.

(The point of this post mostly is to show that there is another
dimension making this even more complicated, again - different Unicode
versions)
>

It's still in Unicode 7.0. According to Unicode character database "ß" uppercase is "SS", "ẞ" lowercase is "ß", both casefolds into "ss". Thus upper(lower("ẞ")) should produce "SS". There is another dimension indeed.

--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: http://www.php.net/unsub.php

Reply via email to