Re: Merge rows into csv

2015-12-08 Thread ayan guha
reduceByKey would be a perfect fit for you On Wed, Dec 9, 2015 at 4:47 AM, Krishna wrote: > Hi, > > what is the most efficient way to perform a group-by operation in Spark > and merge rows into csv? > > Here is the current RDD > - > ID STATE > ---

Merge rows into csv

2015-12-08 Thread Krishna
Hi, what is the most efficient way to perform a group-by operation in Spark and merge rows into csv? Here is the current RDD - ID STATE - 1 TX 1NY 1FL 2CA 2OH - This is the required output