subject:"hadoop map join with ColumnFamilyInputFormat"

Re: hadoop map join with ColumnFamilyInputFormat

2012-03-01 Thread Jeremy Hanna

I haven't used that in particular, but it's pretty trivial to do that with Pig and I would imagine it would just do the right thing under the covers. It's a simple join with Pig. We use pygmalion to get data from the Cassandra bag. A simple example would be: DEFINE FromCassandraBag org.pygmal

hadoop map join with ColumnFamilyInputFormat

2012-03-01 Thread Benoit Mathieu

Hi all, I want to write a MapReduce job with a Map task taking its data from 2 CFs. Those 2 CFs have the same row keys and are in same keyspace, so they are partionned the same way across my cluster and it would be nice that the Map task reads the both column families locally. In hadoop package o