Hi all,

I see that the pivot functionality is being added to spark DFs from 1.6
onward.

I am interested to see if there is a Spark SQL syntax available for
pivoting? example: Slide 11 of [1]

*pandas (Python) - pivot_table(df, values='D', index=['A', 'B'],
columns=['C'], aggfunc=np.sum) *

*reshape2 (R) - dcast(df, A + B ~ C, sum) *

*Oracle 11g - SELECT * FROM df PIVOT (sum(D) FOR C IN ('small', 'large')) p*


Best

[1]
http://www.slideshare.net/SparkSummit/pivoting-data-with-sparksql-by-andrew-ray

-- 
Niranda Perera
@n1r44 <https://twitter.com/N1R44>
+94 71 554 8430
https://www.linkedin.com/in/niranda
https://pythagoreanscript.wordpress.com/

Reply via email to