FYI: http://techblog.netflix.com/2012/02/aegisthus-bulk-data-pipeline-out-of.html
-Jeremiah On Aug 30, 2013, at 9:21 AM, "Hiller, Dean" <dean.hil...@nrel.gov> wrote: > is there a SSTableInput for Map/Reduce instead of ColumnFamily (which uses > thrift)? > > We are not worried about repeated reads since we are idempotent but would > rather have the direct speed (even if we had to read from a snapshot, it > would be fine). > > (We would most likely run our M/R on 4 nodes of the 12 nodes we have since we > have RF=3 right now). > > Thanks, > Dean