Re: Merge rows into csv
reduceByKey would be a perfect fit for you On Wed, Dec 9, 2015 at 4:47 AM, Krishna wrote: > Hi, > > what is the most efficient way to perform a group-by operation in Spark > and merge rows into csv? > > Here is the current RDD > - > ID STATE > ---
Merge rows into csv
Hi, what is the most efficient way to perform a group-by operation in Spark and merge rows into csv? Here is the current RDD - ID STATE - 1 TX 1NY 1FL 2CA 2OH - This is the required output