Norbert Preining dijo [Mon, Aug 10, 2009 at 08:55:27PM +0200]: > On Mo, 10 Aug 2009, Roger Leigh wrote: > > Of course there's a penalty for certain operations. But UTF-8 is about > > as compact as an extended encoding is going to get. > > Rubbish. You know why in Japan and other Asian countries UTF8 is not > so common? Because many of their glyphs need 4 (four!) bytes, while > for example jis-2022 (AFAIR) is much more compact. > > We are not living in an ASCII world anymore.
It's not that much about the size as it is about backwards compatibility. We users of Latin-based alphabets migrate easily to UTF8, with occassional problems where we use diacritics. Eastern Asian encodings are _completely_ incompatible with UTF8, so it is just not possible to tolerate broken text every now and then. Everything just breaks completely. -- Gunnar Wolf • gw...@gwolf.org • (+52-55)5623-0154 / 1451-2244 -- To UNSUBSCRIBE, email to debian-devel-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org