Hi, We are evaluating Cassandra 2.1 for our new production system. The following are the requirements:
1. 15K writes/sec with 5 KB blob in a single column of a column family, 2. This is followed by immediate Reads by multiple consumer threads, the read requires us to return entire Row and not only the recently updated column. 3. Around 1B unique keys. So I am assuming for the reads the data can be fetched from both Memtable (if it is not flushed) and Key-cache. (Row-cache is disabled) How can we optimize for higher Read throughput at the cost of Writes? Machine configuration, 10 Node cluster. - 24 core/machine - 64 GB RAM - 2TB*5 HDD per machine - 10G NIC Till now, for optimization we have done the following: 1. We have provided 4G keycache. 2. Created parition-key and clustering key in such a way that, for every new event a new cell is created, we never update a record. 3. No light-weight transactions 4. ReplicationFactor 3 5. Write quorum 2, Read quorum 2 How can we further optimize for the write/read patterns explained above? Thanks, Pracheer