Can you extrapolate what you mean by appending the keys? Is that specifying a bucket as storing multiple values, or is there some other way to model a list of things with fast read/write times?
-Anthony On Thu, Jan 13, 2011 at 10:37:18AM -0500, nx wrote: > I think you can avoid listing all keys in a bucket by maintaining a > separate object that contains a list of the current keys. I usually > append the keys to a "/bucket/_collection" object. > > On Thu, Jan 13, 2011 at 9:27 AM, Sean Cribbs <s...@basho.com> wrote: > >> > >> Unfortunately, even if additional nodes yield linear performance > >> gains, the m/r overhead seems very large -- if I'm getting 1.5 seconds > >> to process 1,000 items on one node, it seems apparent that I should > >> get roughtly 1.5 seconds to process 3,000 items on 3 nodes, which > >> still is awfully slow. > >> > >> Do you know how Riak compares to HBase, MongoDB or Cassandra for large > >> dataset processing and analysis with m/r, when talking hundreds of > >> millions, or even billions of keys? It would seem that key traversal > >> performance would preventing Riak from competing in that space. Maybe > >> you could do something with Riak Search, but I'm not sure if it would > >> comparable. > > > > To be fair, you can't do a microbenchmark and then try to extrapolate it to > > large datasets; things change at scale. Also, key-listing has been a known > > limitation of Riak for a long time, and one we have been quite vocal about. > > There have been improvements recently, but it's still an O(N) computation > > where N is the total number of keys stored in the cluster. Therefore, it's > > important to structure your data such that you limit the use of key lists. > > Compare performance after you have done that, and run your benchmark on > > something other than a single node (4 or more in a cluster is best), with a > > dataset that approximates the target size. > > > > Sean Cribbs <s...@basho.com> > > Developer Advocate > > Basho Technologies, Inc. > > http://basho.com/ > > > > > > _______________________________________________ > > riak-users mailing list > > riak-users@lists.basho.com > > http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com > > > > _______________________________________________ > riak-users mailing list > riak-users@lists.basho.com > http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com -- ------------------------------------------------------------------------ Anthony Molinaro <antho...@alumni.caltech.edu> _______________________________________________ riak-users mailing list riak-users@lists.basho.com http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com