Re: Windows UTF-8, non-ICU collation trouble

2020-01-11 Thread Noah Misch
On Wed, Dec 11, 2019 at 01:54:47PM +1300, Thomas Munro wrote: > On Tue, Dec 10, 2019 at 10:29 PM Noah Misch wrote: > > This does suggest some set of CompareString* parameters is free from the > > problem. If that's right, we could offer collations based on that. (I'm > > not > > sure it would b

Re: Windows UTF-8, non-ICU collation trouble

2019-12-10 Thread Thomas Munro
On Tue, Dec 10, 2019 at 10:29 PM Noah Misch wrote: > On Tue, Dec 10, 2019 at 03:41:15PM +1300, Thomas Munro wrote: > > I ran a variation of your program on Appveyor's Studio/Server 2019 > > image, and the result was the same: it thinks that cmp(s1, s2) == 0, > > cmp(s2, s3) == 0, but cmp(s1, s3) =

Re: Windows UTF-8, non-ICU collation trouble

2019-12-10 Thread Noah Misch
On Tue, Dec 10, 2019 at 03:41:15PM +1300, Thomas Munro wrote: > On Fri, Dec 6, 2019 at 8:33 PM Noah Misch wrote: > > On Fri, Dec 06, 2019 at 07:56:08PM +1300, Thomas Munro wrote: > > > On Fri, Dec 6, 2019 at 7:34 PM Noah Misch wrote: > > > > We use system UTF-16 collation to implement UTF-8 colla

Re: Windows UTF-8, non-ICU collation trouble

2019-12-09 Thread Thomas Munro
On Fri, Dec 6, 2019 at 8:33 PM Noah Misch wrote: > On Fri, Dec 06, 2019 at 07:56:08PM +1300, Thomas Munro wrote: > > On Fri, Dec 6, 2019 at 7:34 PM Noah Misch wrote: > > > We use system UTF-16 collation to implement UTF-8 collation on Windows. > > > The > > > PostgreSQL security team received a

Re: Windows UTF-8, non-ICU collation trouble

2019-12-05 Thread Noah Misch
On Fri, Dec 06, 2019 at 07:56:08PM +1300, Thomas Munro wrote: > On Fri, Dec 6, 2019 at 7:34 PM Noah Misch wrote: > > We use system UTF-16 collation to implement UTF-8 collation on Windows. The > > PostgreSQL security team received a report, from Timothy Kuun, that this > > collation does not upho

Re: Windows UTF-8, non-ICU collation trouble

2019-12-05 Thread Thomas Munro
On Fri, Dec 6, 2019 at 7:34 PM Noah Misch wrote: > We use system UTF-16 collation to implement UTF-8 collation on Windows. The > PostgreSQL security team received a report, from Timothy Kuun, that this > collation does not uphold the "symmetric law" and "transitive law" that we > require for btre

Windows UTF-8, non-ICU collation trouble

2019-12-05 Thread Noah Misch
We use system UTF-16 collation to implement UTF-8 collation on Windows. The PostgreSQL security team received a report, from Timothy Kuun, that this collation does not uphold the "symmetric law" and "transitive law" that we require for btree operator classes. The attached test program demonstrate