Request for suggestions

srinivasrajagopalan Mon, 26 Nov 2012 11:04:11 -0800

Hi,


We have a scenario where we want a single Hadoop job to create/manage multiple 
mapper tasks where each mapper task will query a subset of columns in a 
relational database table. We looked into DataDrivenDBInputFormat, but that 
only seems to facilitate partitioning where each mapper task can query a subset 
of rows in a relational database table. 

I am not sure if Pig can help us in this case.

Appreciate any suggestions in this regard.

Thanks
srinivas

Request for suggestions

Reply via email to