Can you describe your use case a bit more ? Since the row keys are not sorted in your example, there is a chance that you get indeterministic results when you aggregate on groups of two successive rows.
Thanks On Mon, Mar 28, 2016 at 9:21 AM, sujeet jog <sujeet....@gmail.com> wrote: > Hi, > > I have a RDD like this . > > [ 12, 45 ] > [ 14, 50 ] > [ 10, 35 ] > [ 11, 50 ] > > i want to aggreate values of first two rows into 1 row and subsequenty the > next two rows into another single row... > > i don't have a key to aggregate for using some of the aggregate pyspark > functions, how to achieve it ? > > >