/Sorting-Reduced-Groupd-Values-without-Explicit-Sorting-tp8508p13088.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.
-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e
Jerry, thank you very much for further clarifications!
best,
Mahmoud
From: Shao, Saisai [saisai.s...@intel.com]
Sent: Sunday, June 29, 2014 8:17 PM
To: user@spark.apache.org
Subject: RE: Sorting Reduced/Groupd Values without Explicit Sorting
Yes, the current
he order of elements within
partitions after shuffle, so you have to sort by yourself.
Thanks
Saisai.
From: Parsian, Mahmoud [mailto:mpars...@illumina.com]
Sent: Monday, June 30, 2014 11:08 AM
To: user@spark.apache.org
Subject: RE: Sorting Reduced/Groupd Values without Explicit Sorting
Hi Jerry,
Th
(4,0)
There is a partitioner, but it does not preserve the order of RDD elements.
Thanks again,
best,
Mahmoud
From: Shao, Saisai [saisai.s...@intel.com]
Sent: Sunday, June 29, 2014 6:41 PM
To: user@spark.apache.org
Subject: RE: Sorting Reduced/Groupd Values w
ilto:mpars...@illumina.com]
Sent: Monday, June 30, 2014 9:00 AM
To: user@spark.apache.org
Subject: Sorting Reduced/Groupd Values without Explicit Sorting
Given the following time series data:
name, time, value
x,2,9
x,1,3
x,3,6
y,2,5
y,1,7
y,3,1
z,3,7
z,4,0
z,1,4
z,2,8
we want to generate the foll
Given the following time series data:
name, time, value
x,2,9
x,1,3
x,3,6
y,2,5
y,1,7
y,3,1
z,3,7
z,4,0
z,1,4
z,2,8
we want to generate the following (the reduced/grouped values are sorted by
time).
x => [(1,3), (2,9), (3,6)]
y => [(1,7), (2,5), (3,1)]
z => [(1,4), (2,8), (3,7), (4,0)]
One obv