Hi, I have analytics data with timestamps on each element. I'd like to analyze consecutive elements using Spark, but haven't figured out how to do this.
Essentially what I'd want is a transform from a sorted RDD [A, B, C, D, E]
to an RDD [(A,B), (B,C), (C,D), (D,E)]. (Or some other way to analyze
time-related elements.)
How can this be achieved?
* Sampo Niskanen*
*Lead developer / Wellmo*
[email protected]
+358 40 820 5291
