First I would try to simplify your architecture. Get everything onto the same
OS.
Then change the topology so you have 1 job tracker, and 4 nodes that ran both
Cassandra and Hadoop tasks. So that reading and mapping the data is happening
on the same nodes. Reads from cassandra happen as range
Hello Cassandra users,
I am trying to read and process data in Cassandra using Hadoop. I have a 4-node
Cassandra cluster, and an 8-node Hadoop cluster:- 1 Namenode/Jobtracker- 7
Datanodes/Tasktrackers (4 of them are also hosting Cassandra)
I am using Cassandra 1.2 beta, Hadoop 0.20.2, java 1.6_u