Re: Which of these VPS configurations would perform better for Cassandra ?

2013-08-04 Thread Radim Kolar
with 2 GB RAM be prepared to expect crashes because it hardly can handle peaks with increased memory consumption by compaction, validation, etc. KVM works good only if you are using recent version and virtio drivers and provider is not overselling memory. At shared hosting you will not be able

Re: Which of these VPS configurations would perform better for Cassandra ?

2013-08-04 Thread Rajkumar Gupta
okay, so what should a workable VPS configuration to start with & minimum how many nodes to start with(2 ok?) ? Seriously I cannot afford the tensions of colocation setup. My hosting provider provides SSD drives with KVM virtualization.

Re: Which of these VPS configurations would perform better for Cassandra ?

2013-08-04 Thread David Schairer
Of course -- my point is simply that if you're looking for speed, SSD+KVM, especially in a shared tenant situation, is unlikely to perform the way you want to. If you're building a pure proof of concept that never stresses the system, it doesn't matter, but if you plan an MVP with any sort of s

Re: Which of these VPS configurations would perform better for Cassandra ?

2013-08-04 Thread Radim Kolar
workable configuration depends on your requirements. You need to develop own testing procedure. How much data will have whats 95 percentile response time target size of rows number of columns per row data grow rate data rewrite rate ttl expiration used never aim for "minimum". Cassandra has hug

Re: org.apache.cassandra.io.sstable.CorruptSSTableException

2013-08-04 Thread Keith Wright
Re-sending hoping to get some help. Any ideas would be much appreciated! From: Keith Wright mailto:kwri...@nanigans.com>> Date: Friday, August 2, 2013 3:01 PM To: "user@cassandra.apache.org" mailto:user@cassandra.apache.org>> Subject: org.apache.cassandra.io.sst

Re: Which of these VPS configurations would perform better for Cassandra ?

2013-08-04 Thread Ben Bromhead
If you want to get a rough idea of how things will perform, fire up YCSB (https://github.com/brianfrankcooper/YCSB/wiki) and run the tests that closest match how you think your workload will be (run the test clients from a couple of beefy AWS spot-instances for less than a dollar). As you are a

Better to have lower or greater cardinality for partition key in CQL3?

2013-08-04 Thread David Ward
Hello, Was curious what people had found to be better for structuring/modeling data into C*? With my data I have two primary keys, one 64 bit int thats 0 - 50 million ( its unlikely to go higher then 70 million ever ) and another 64 bit that's probably close to hitting a trillion in the next