Re: Multiple input column families in Cassandra Hadoop mapreduce

2011-07-15 Thread Jeremy Hanna
+1 - We do a lot of this with Pig - joining over several column families. Pig makes it just work. I think Hive does something similar. Unless you really need that much control over your process, I would really use one of those two. On Jul 15, 2011, at 5:28 PM, Jonathan Ellis wrote: > The eas

Re: Multiple input column families in Cassandra Hadoop mapreduce

2011-07-15 Thread Jonathan Ellis
The easy answer is "use something like Pig or Hive that does these joins for you under the hood." Not actually sure what the hard answer is. :) On Fri, Jul 15, 2011 at 1:34 AM, Markus Mock wrote: > Hello, > with org.apache.cassandra.hadoop.ConfigHelper.setInputColumnFamily I can set > up the map

Multiple input column families in Cassandra Hadoop mapreduce

2011-07-15 Thread Markus Mock
Hello, with org.apache.cassandra.hadoop.ConfigHelper.setInputColumnFamily I can set up the map phase to read from one column family. Is it possible to have multiple mapper classes each mapping over their own column family so that data from multiple column families can be "joined" in the reduce pha