On 5/6/10 3:26 PM, Stu Hood wrote:
Ian: I think that as get_range_slice gets faster, the approach that Mark was 
heading toward may be considerably more efficient than reading the old value in 
the OutputFormat.
Interesting, I'm trying to understand the performance impact of the different ways to do this. Under Mark's approach, the prior values are pulled out of Cassandra in the mapper in bulk, then merged and written back to Cassandra in the reducer; the get_range_slice is faster than the individual row fetches that my approach does in the reducer. Is that what you mean or are you referring to something else?
thanks!
-Ian

--
Ian Kallen
blog: http://www.arachna.com/roller/spidaman
tweetz: http://twitter.com/spidaman
vox: 925.385.8426


Reply via email to