Re: aggregateByKey on PairRDD

2016-03-30 Thread write2sivakumar@gmail
e, (book,tech))(eBay, (book1,tech)) Thanks,Sivakumar Original message From: Daniel Haviv Date: 30/03/2016 18:58 (GMT+08:00) To: Akhil Das Cc: Suniti Singh , u...@spark.apache.org, dev Subject: Re: aggregateByKey on PairRDD Hi,shouldn't groupByKey

Re: aggregateByKey on PairRDD

2016-03-30 Thread Akhil Das
Isn't it what tempRDD.groupByKey does? Thanks Best Regards On Wed, Mar 30, 2016 at 7:36 AM, Suniti Singh wrote: > Hi All, > > I have an RDD having the data in the following form : > > tempRDD: RDD[(String, (String, String))] > > (brand , (product, key)) > > ("amazon",("book1","tech")) > > ("eB

aggregateByKey on PairRDD

2016-03-29 Thread Suniti Singh
Hi All, I have an RDD having the data in the following form : tempRDD: RDD[(String, (String, String))] (brand , (product, key)) ("amazon",("book1","tech")) ("eBay",("book1","tech")) ("barns&noble",("book","tech")) ("amazon",("book2","tech")) I would like to group the data by Brand and wou