Re: [Pharo-users] Hash collision

Martin McClure Mon, 19 Dec 2016 13:26:44 -0800

On 12/19/2016 01:27 AM, Andres Valloud wrote:

At first glance, that the failure code only sees "two" things when itshould see "eight" seems to be problematic.
Perhaps the primitive insists on hashing byte objects, and there is adistinction between "byte" objects and "word" objects (whatever "word"means, presumably a constant width integer across all platforms). Ihaven't looked at the code.
From my perspective... back in the day that primitive used to hashbytes, and from what I saw here the failure code is hashing multi-bytethings. If all of these observations are correct, then I'd say thefailure code isn't doing what the primitive is doing, and in doing soit's introducing a lot of collisions that I'd like to believe theintended hash function wouldn't produce.


Ah, I see your concern.

As far as I can see, all classes that are using the StringHash primitiveare actually byte objects, so things are, I believe, working as designed.

The only problem is that Ben did an experiment to see whether Float'shashing would be improved by using the StringHash primitive. Which itfailed to do, because Float is not a byte object.

We could use an equivalent primitive to hash word objects, but I haven'tfound one.

We could also use a primitive to retrieve the bytes of a word object,and I haven't found one of those either. There are places that areconverting the words to large integers and then hashing those, whichwould work for Floats.


-Martin

Re: [Pharo-users] Hash collision

Reply via email to