Tread carefully here ... by forcing localilty ... you will sacrifice high
availability by algorithmically creating a bias and a single point of
failure in the cluster.

Cassandra made this easier to do ... and then there were many questions on
the user list about:

"why is my cluster is unbalanced?"
'why is one of my nodes is falling over when the others are not?"
etc...

On Thu, Nov 10, 2011 at 4:25 AM, Elias Levy <fearsome.lucid...@gmail.com>wrote:

> On Wed, Nov 9, 2011 at 12:00 PM, <riak-users-requ...@lists.basho.com>wrote:
>
>> From: Nate Lawson <n...@root.org>
>>
>>
>> We have been looking into ways to cluster keys to benefit from the
>> LevelDB backend's prefix compression. If we were doing a batch of lookups
>> and the keys from a given document could be grouped on a partition, they
>> could be read with less disk IO. However, consistent hashing in Riak
>> currently spreads keys randomly, based on SHA-1.
>>
>> What I think we could really use is a per-bucket configurable prefix
>> length for consistent hashing. We would then create keys of the form
>> "DocumentID:<key>" and tell Riak to only hash DocumentID to get the
>> partition. This way, keys from the same document would be clustered on the
>> same partition.
>>
>> Is this on your roadmap already? It seems like it wouldn't require too
>> many changes to Riak.
>>
>
> If you look at the bucket properties by fetching them, you'll notice that
> there is already an entry for specifying a key hashing function.  Its not
> documented, but you can read the Erlang code for the existing function
> (pretty much just a call to SHA1), and create your own to do whatever
> partitioning you want.
>
>
>
> _______________________________________________
> riak-users mailing list
> riak-users@lists.basho.com
> http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
>
>
_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to