Re: collate not support Unicode Variation Selector

2022-08-04 Thread Kyotaro Horiguchi
At Thu, 4 Aug 2022 19:01:33 +0900, 荒井元成 wrote in > Thank you for your reply. > > SQLServer supports Unicode Variation Selector, so I would like PostgreSQL to > support them as well. I studied the code a bit further, then found that simple comparison can ignore selectors by using nondeterministi

RE: collate not support Unicode Variation Selector

2022-08-04 Thread 荒井元成
; t...@sss.pgh.pa.us; pgsql-hackers@lists.postgresql.org Subject: Re: collate not support Unicode Variation Selector At Wed, 3 Aug 2022 20:12:53 +0900, 荒井元成 wrote in > Thank you for your reply. > > About 60,000 characters are registered in the IPAmj Mincho font designated by the

Re: collate not support Unicode Variation Selector

2022-08-04 Thread Kyotaro Horiguchi
At Wed, 3 Aug 2022 20:12:53 +0900, 荒井元成 wrote in > Thank you for your reply. > > About 60,000 characters are registered in the IPAmj Mincho font designated by > the national specifications. > It should be able to handle all characters. Yeah, it is one of that fonts. But I didn't know that MS-

RE: collate not support Unicode Variation Selector

2022-08-03 Thread 荒井元成
...@gmail.com Cc: t...@sss.pgh.pa.us; n2...@ndensan.co.jp; pgsql-hackers@lists.postgresql.org Subject: Re: collate not support Unicode Variation Selector At Wed, 3 Aug 2022 14:02:08 +1200, Thomas Munro wrote in > On Wed, Aug 3, 2022 at 12:56 PM Tom Lane wrote: > > Maybe it would help i

Re: collate not support Unicode Variation Selector

2022-08-02 Thread Kyotaro Horiguchi
At Wed, 3 Aug 2022 14:02:08 +1200, Thomas Munro wrote in > On Wed, Aug 3, 2022 at 12:56 PM Tom Lane wrote: > > Maybe it would help if you run the strings through normalize() first? > > I'm not sure if that can combine combining characters. > > I think the similarity between Latin combining cha

Re: collate not support Unicode Variation Selector

2022-08-02 Thread Thomas Munro
On Wed, Aug 3, 2022 at 12:56 PM Tom Lane wrote: > Maybe it would help if you run the strings through normalize() first? > I'm not sure if that can combine combining characters. I think the similarity between Latin combining characters and these ideographic variations might end there. I don't thi

Re: collate not support Unicode Variation Selector

2022-08-02 Thread Tom Lane
Thomas Munro writes: > So you want to match only strings that contain U&'\+003436' *not* > followed by a variation selector (as we also discussed at [1]). I'm > pretty sure that everything in PostgreSQL considers variation > selectors to be separate characters. There might be something that does

Re: collate not support Unicode Variation Selector

2022-08-02 Thread Thomas Munro
On Wed, Aug 3, 2022 at 12:09 PM 荒井元成 wrote: > D209007=# create table ivstest ( moji text collate "ja-x-icu" CONSTRAINT > firstkey PRIMARY KEY ); > D209007=# insert into ivstest (moji) values ( U&'\+003436' || U&'\+0E0101' || > U&'\+00304D'); > D209007=# insert into ivstest (moji) values ( U&'\+

RE: collate not support Unicode Variation Selector

2022-08-02 Thread 荒井元成
Hi, I hope supported Unicode Variation Selector on collate. I will resend it because there was a typo. D209007=# create table ivstest ( moji text collate "ja-x-icu" CONSTRAINT firstkey PRIMARY KEY ); D209007=# insert into ivstest (moji) values ( U&'\+003436' || U&'\+0E0101' || U&'\+00304D');

collate not support Unicode Variation Selector

2022-08-02 Thread 荒井元成
Hi, I hope supported Unicode Variation Selector on collate. D209007=# create table ivstesticu ( D209007(# moji text D209007(# ); D209007=# create table ivstest ( D209007(# moji text collate "ja-x-icu" CONSTRAINT firstkey PRIMARY KEY D209007(# ); D209007=# insert into ivstest (moji) values ( U&'