Re: How Cassandra determines the splits

2012-05-02 Thread Patrik Modesto
Hi, I had a simillar problem with Cassandra 0.8.x and the problem was when configured Cassandra with rpc_address: 0.0.0.0 and starting Hadoop job from outside the Cassandra cluster. But with version 1.0.x the problem is gone. You can debug the splits with thrift. This is a copy&paste part of my s

How Cassandra determines the splits

2012-05-01 Thread Filippo Diotalevi
Hi, I'm having problems in my Cassandra/Hadoop (1.0.8 + cdh3u3) cluster related to how cassandra splits the data to be processed by Hadoop. I'm currently testing a map reduce job, starting from a CF of roughly 1500 rows, with cassandra.input.split.size 10 cassandra.range.batch.size 1 but what