Hi,
I had a simillar problem with Cassandra 0.8.x and the problem was when
configured Cassandra with rpc_address: 0.0.0.0 and starting Hadoop job
from outside the Cassandra cluster. But with version 1.0.x the problem
is gone.
You can debug the splits with thrift. This is a copy&paste part of my
s
Hi,
I'm having problems in my Cassandra/Hadoop (1.0.8 + cdh3u3) cluster related to
how cassandra splits the data to be processed by Hadoop.
I'm currently testing a map reduce job, starting from a CF of roughly 1500
rows, with
cassandra.input.split.size 10
cassandra.range.batch.size 1
but what