Just wanted to follow up on this.

We were never able to achieve throughput scaling in the cloud.  We were able to 
verify that many of our cluster nodes and test servers were collocated on the 
same physical hardware (thanks Stu for the tip on the Rackspace REST API), and 
that performance on collocated nodes rose and fell in concert.  Ultimately we 
moved to dedicated hardware and throughput scaled as expected with additional 
nodes.

Thanks for everyone's help on this.  We had to move on in the interest of 
moving our project along, but I'd still be interested to see benchmarks from 
successful cloud installations.  Maybe with the node routing in 0.7 and larger 
cluster sizes, the cloud might become a more viable option for highly available 
high read throughput applications.

Best,

  -- Oren


On Jul 20, 2010, at 3:18 PM, Peter Schuller wrote:

>> (I'm hoping to have time to run my test on EC2 tonight; will see.)
> 
> Well, I needed three c1.xlarge EC2 instances running py_stress to even
> saturate more than one core on the c1.xlarge instance running a single
> cassandra node (at roughly 21k reqs/second)... Depending on how
> reliable vmstat/top is on EC2 to begin with.
> 
> Would have to revisit this later if I'm to get any sensible numbers
> that are even remotely trustworthy...
> 
> -- 
> / Peter Schuller

Reply via email to