> > We need to address that, then. If we're doing > unicode, we damn well need to do it right--å is > å, regardless of whether it's composed or > decomposed.
Agreed -- on some level. But If we want to implement Larry's :u0 (bytes) and :u1 (code points) levels we need to have also the "more raw" comparisons available, somehow. (I do not remember whether Larry specified would :u2 do by default some of the Unicode normalizations, thus doing (de)compositions.) > If people want low-level binary comparisons (and > generally we *shouldn't* for most things) then > they'll need to force the string to binary. And I'm not certain whether "forcing to binary" is the right visual image or approach here. Maybe we need some sort of "pragma" support so that we can tweak the ":u level"? The default level could well be :u2, the highest we can do without picking some "language" rules.