Re: UTF-8 NFC/NFD paths issue

2011-02-10 Thread C. Michael Pilato
On 09/20/2010 10:38 AM, Erik Huelsmann wrote: > To recap, the change I'm proposing is that we check pathnames with NFC/D > aware comparison routines upon add_file() / add_directory() inside > libsvn_repos or libsvn_fs_* - of which I suspect it's easier to handle > inside the latter. In my proposal,

Re: UTF-8 NFC/NFD paths issue

2010-09-20 Thread Erik Huelsmann
Sorry to have left the discussion running so long without contributing to it myself. The reason I started about changing the repository / fs is because it is where we store the dataset that we'll need to support forever: working copies get destroyed and checked out over and over every hour, every d

Re: UTF-8 NFC/NFD paths issue

2010-09-20 Thread Branko Čibej
On 17.09.2010 13:22, Greg Stein wrote: > On Thu, Sep 16, 2010 at 19:26, Daniel Shahaf wrote: >> Greg Stein wrote on Thu, Sep 16, 2010 at 00:59:59 -0400: >>> On Wed, Sep 15, 2010 at 23:35, Daniel Shahaf >>> wrote: >> ... If yes, then we infer that no two in-repository paths (which are

Re: UTF-8 NFC/NFD paths issue

2010-09-19 Thread Daniel Shahaf
Greg Stein wrote on Sat, Sep 18, 2010 at 15:55:57 -0400: > On Sat, Sep 18, 2010 at 04:42, Daniel Shahaf wrote: > > Greg Stein wrote on Fri, Sep 17, 2010 at 07:22:12 -0400: > >> On Thu, Sep 16, 2010 at 19:26, Daniel Shahaf > >> wrote: > >> > Greg Stein wrote on Thu, Sep 16, 2010 at 00:59:59 -0400

Re: UTF-8 NFC/NFD paths issue

2010-09-18 Thread Greg Stein
On Sat, Sep 18, 2010 at 04:42, Daniel Shahaf wrote: > Greg Stein wrote on Fri, Sep 17, 2010 at 07:22:12 -0400: >> On Thu, Sep 16, 2010 at 19:26, Daniel Shahaf wrote: >> > Greg Stein wrote on Thu, Sep 16, 2010 at 00:59:59 -0400: >> >> On Wed, Sep 15, 2010 at 23:35, Daniel Shahaf >> >> wrote: >>

Re: UTF-8 NFC/NFD paths issue

2010-09-18 Thread Daniel Shahaf
Greg Stein wrote on Fri, Sep 17, 2010 at 07:22:12 -0400: > On Thu, Sep 16, 2010 at 19:26, Daniel Shahaf wrote: > > Greg Stein wrote on Thu, Sep 16, 2010 at 00:59:59 -0400: > >> On Wed, Sep 15, 2010 at 23:35, Daniel Shahaf > >> wrote: > >... > >> > If yes, then we infer that no two in-repository

Re: UTF-8 NFC/NFD paths issue

2010-09-17 Thread Greg Stein
On Thu, Sep 16, 2010 at 19:26, Daniel Shahaf wrote: > Greg Stein wrote on Thu, Sep 16, 2010 at 00:59:59 -0400: >> On Wed, Sep 15, 2010 at 23:35, Daniel Shahaf wrote: >... >> > If yes, then we infer that no two in-repository paths (which are >> > bytewise different) canonicalize to the same byte s

Re: UTF-8 NFC/NFD paths issue

2010-09-16 Thread Daniel Shahaf
Greg Stein wrote on Thu, Sep 16, 2010 at 00:59:59 -0400: > On Wed, Sep 15, 2010 at 23:35, Daniel Shahaf wrote: > > Erik Huelsmann wrote on Wed, Sep 15, 2010 at 23:20:06 +0200: > >> Yesterday, I was talking to CMike about our long-standing issue with UTF-8 > >> strings designating a certain path no

Re: UTF-8 NFC/NFD paths issue

2010-09-15 Thread Greg Stein
On Wed, Sep 15, 2010 at 23:35, Daniel Shahaf wrote: > Erik Huelsmann wrote on Wed, Sep 15, 2010 at 23:20:06 +0200: >> Yesterday, I was talking to CMike about our long-standing issue with UTF-8 >> strings designating a certain path not neccessarily being equal to other >> strings designating the sa

Re: UTF-8 NFC/NFD paths issue

2010-09-15 Thread Daniel Shahaf
Erik Huelsmann wrote on Wed, Sep 15, 2010 at 23:20:06 +0200: > Yesterday, I was talking to CMike about our long-standing issue with UTF-8 > strings designating a certain path not neccessarily being equal to other > strings designating the same path. The issue has to do with NFC (composed) > and NFD

UTF-8 NFC/NFD paths issue

2010-09-15 Thread Erik Huelsmann
Yesterday, I was talking to CMike about our long-standing issue with UTF-8 strings designating a certain path not neccessarily being equal to other strings designating the same path. The issue has to do with NFC (composed) and NFD (decomposed) representation of Unicode characters. CMike nicely call