Hi Dan,
Thanks for the explanation. I haven't been looking into the MapReduce
wiki page for a while so I missed that point about link walking. Also,
thanks for the clarification that reduce phase is performed on the
initiating node. Though I didn't use reduce functionality for now,
it's an importa
Hi Dmitry,
Regarding map reduce query performance in a cluster, map phases are run in
parallel so adding more machines to the cluster means more map functions can
be run simultaneously. Reduce phases currently only run on the node that
initiated the query so additional machines will not affect the
On Fri, Jun 25, 2010 at 8:25 PM, Ryan Tilder wrote:
> Hi, Dmitry. There are some gaps in the information you included here that
> might help clarify what's going on so I'm going to just rattle off some
> questions for clarification.
> Is your test driver only making requests of a single EC2 insta
Hi, Dmitry. There are some gaps in the information you included here that
might help clarify what's going on so I'm going to just rattle off some
questions for clarification.
Is your test driver only making requests of a single EC2 instance? Or are
you querying all 7 nodes directly in so sort of
Greetings.
I tried running Riak with bitcask backend on 7 Amazon EC2 standard
large instances (7.5 GB RAM, 4 EC2 CPU units) and performed some
tests.
For comparison, I built up the following Riak clusters:
7 physical nodes ring
1 physical node ring (on one of the 7 instances, but I ran the tests