On Mar 20, 2010, at 2:53 AM, Lenin Gali wrote:

> 1. Eventual consistency: Given a volume of 5K writes / sec and roughly 1500 
> writes are Updates per sec while the rest are inserts, what kind of latency 
> can be expected in eventual consistency?

Depending on the size of the cluster you're not looking at much latency at all. 
On the order of 10's of ms's.

> 2. Performance: Are there any bench marks on how many writes /sec and 
> reads/sec cassandra supports on an "n node" cluster? a Node can be of 
> variable size and would like to know the hardware/software details of the 
> cluster as well. 

Cassandra's performance is impressive. We had one node spike at 103,000 reads a 
second with a load of only about 6, which is high, but not alarmingly so.

> 3. EC2: Has any one implemented cassandra on EC2 and what kind transaction 
> volume are they using it for and how is their experience with cassandra on 
> EC2?.

We have a 15 node cluster on EC2. We have a patch that is a rack aware strategy 
specifically for EC2 zones where it replicates keys in a manner so that you 
have one key in each AZ. We run Cassandra across 3 AZ's on large instances with 
the ephemeral drives in a RAID0 setup with XFS.

You might also be interested in this:
http://stu.mp/2009/12/disk-io-and-throughput-benchmarks-on-amazons-ec2.html

> 4. Overhead and issues: What are typical nightmare scenario's one could face 
> when using Cassandra for heavy write / read intensive systems?

We haven't ran into any, but when we do find hot spots in the cluster we 
bootstrap a new node into the cluster with a token range that will alleviate 
the hot spot. This is rather painless in our experiences. 

> 5. Backups : If there is a  4 or 5 TB cassandra cluster what do you recommend 
> the backup scenario's could be?

There isn't one that I know of. This is what the replication factor is for. We 
keep three copies of each key in three different datacenters. That's our backup 
strategy. 

> Also, Does cassandra support counters?

Not yet, but there's active work happening in this area.

> Digg's article said they are going to contribute their work to open source 
> any idea when that would be?

Chris Goffinet is a committer to Cassandra. Digg's contributions are 
contributed back in an almost daily fashion.

--Joe

Reply via email to