> On 22 Mar 2017, at 09:05, Alastair Houghton <alast...@alastairs-place.net> 
> wrote:
> 
> In the context of filesystems (and specifically filenames), the phrases “bag 
> of bytes” and “bunch of bytes” have a fairly specific meaning.  The point is 
> that the filesystem doesn’t inspect the bytes it’s given, and doesn’t care 
> what they represent (about the only exception is that it probably doesn’t 
> support embedded NULs).  It isn’t suggesting that the names are treated as an 
> unordered set of bytes (that’d just be silly).  It’s just expressing the fact 
> that the filesystem doesn’t care what they are - it may compare them, and if 
> it does so, it will use binary ordering (not some other collation sequence) 
> and won’t worry about things like case or encoding at all.

That doesn’t sound sensible at all. It means you can create a filename with a 
byte sequence that isn’t valid UTF-8 and which likely then cannot be accessed 
by MacOS/iOS processes.

It means that you could create multiple files with the “same" name, and that 
doesn’t sound like a win either. e.g. Aandi’s examples of LATIN SMALL LETTER E 
(U+0065)
COMBINING ACUTE ACCENT (U+0301) and LATIN SMALL LETTER E WITH ACUTE (U+00E9)

How can a “next gen” filesystem avoid using Unicode rules when handling 
filenames?

Chris
_______________________________________________

Cocoa-dev mailing list (Cocoa-dev@lists.apple.com)

Please do not post admin requests or moderator comments to the list.
Contact the moderators at cocoa-dev-admins(at)lists.apple.com

Help/Unsubscribe/Update your Subscription:
https://lists.apple.com/mailman/options/cocoa-dev/archive%40mail-archive.com

This email sent to arch...@mail-archive.com

Reply via email to