Tzafrir Cohen <[EMAIL PROTECTED]> wrote on 31/7/02 17:25:
>On Wed, 31 Jul 2002, Oded >Arbel wrote: > >> BTW: I prefer storing >textual data in the database >as unicode >> (preferably utf8 to >facilitate easier display to >the web) encoded data >> in binary fields - it gives >predictable enough sorting, >and you neednot >> worry about character >sets, especially when aiming >for multi-lingual >> applications (and I do >consider english/hebrew >being multi-lingual >> enough to warrant >unicode). >> > >What if there is some English >text? case insensitive search >is quite >problematic in some cases >('aaa' comes after 'ZZZ'). true, but if you are willing to accept that (and I don't think its such a problem for most uses), then you are in the clear - no matter what encoding you choose, as long as you are being consistent about it. >Also, if the UTF8 text >includes nikud it has to be >ignored-away during the >sorting. why ? though have never tried sorting on text with nikud, but I fail to see the problem : why should it matter if alef with 'segol' sorts after alef with 'kamatz'? >IIRC proper utf collating >would do the above two (this >costs some extra >cpu cycles even when this >collating is not used, I >believe). Does mysql know about utf-8 ? -- Oded ================================================================= To unsubscribe, send mail to [EMAIL PROTECTED] with the word "unsubscribe" in the message body, e.g., run the command echo unsubscribe | mail [EMAIL PROTECTED]