On Dec 1, 2013, at 8:36 AM, Graham Cox <graham....@bigpond.com> wrote:
> Scanning my entire hard drive (excluding hidden files), which took several > hours, sure I had plenty of collisions - but absolutely no false ones - they > all turned out to be genuine duplicates of existing files. This is using the > FNV-1a 64-bit hash + length approach. I have a drive sitting here that has a few *million* image files; I'd be willing to bet zero collisions. Maybe some time or other I'll try it out with my stripped down murmur hash. What would be interesting, if you have the time, is figure out how much you can shorten the hash without collisions... BTW, IIRC, one of the weaknesses with FNV has to do with strings of 0s, and as long as your image data is compressed, it will never contain long strings of 0s. -- Scott Ribe scott_r...@elevated-dev.com http://www.elevated-dev.com/ (303) 722-0567 voice _______________________________________________ Cocoa-dev mailing list (Cocoa-dev@lists.apple.com) Please do not post admin requests or moderator comments to the list. Contact the moderators at cocoa-dev-admins(at)lists.apple.com Help/Unsubscribe/Update your Subscription: https://lists.apple.com/mailman/options/cocoa-dev/archive%40mail-archive.com This email sent to arch...@mail-archive.com