Re: Map-Reduce on top of cassandra

Jeremy Hanna Mon, 14 Mar 2011 08:35:07 -0700

Can you go into the #cassandra channel and ask your question?  See if 
jeromatron or driftx are around.  That way there can be a back and forth about 
settings and things.


http://webchat.freenode.net/?channels=#cassandra

On Mar 14, 2011, at 10:06 AM, Or Yanay wrote:

> Hi All,
>  
> I am trying to write some map-reduce tasks so I can find out stuff like – how 
> many records have X status?
> I am using 0.7.0 and have 5 nodes with ~100G of data on each node.
>  
> I have written the code based on the word_count example and the map-reduce is 
> running successfully BUT is extremely slow (about 2 hours for the simplest 
> key count).
>  
> I am now looking to track down the slowness and tune my process, or explore 
> alternative ways to achieve the same goal.
>  
> Can anyone point me to a way to tune my map-reduce job?
> Does anyone have any experience exploring Cassandra data with Hadoop cluster 
> configuration? ( As suggested 
> inhttp://wiki.apache.org/cassandra/HadoopSupport#ClusterConfig)
>  
> Thanks,
> Orr
>

Re: Map-Reduce on top of cassandra

Reply via email to