Re: Updating (as opposed to just setting) Cassandra data via Hadoop

Ian Kallen Fri, 07 May 2010 06:54:58 -0700

On 5/6/10 3:26 PM, Stu Hood wrote:

Ian: I think that as get_range_slice gets faster, the approach that Mark was 
heading toward may be considerably more efficient than reading the old value in 
the OutputFormat.

Interesting, I'm trying to understand the performance impact of thedifferent ways to do this. Under Mark's approach, the prior values arepulled out of Cassandra in the mapper in bulk, then merged and writtenback to Cassandra in the reducer; the get_range_slice is faster than theindividual row fetches that my approach does in the reducer. Is thatwhat you mean or are you referring to something else?

thanks!
-Ian


--
Ian Kallen
blog: http://www.arachna.com/roller/spidaman
tweetz: http://twitter.com/spidaman
vox: 925.385.8426

Re: Updating (as opposed to just setting) Cassandra data via Hadoop

Reply via email to