Re: comparaing row in pyspark data frame

2016-06-08 Thread Jacek Laskowski
On Wed, Jun 8, 2016 at 2:05 PM, pseudo oduesp wrote: > how we can compare columns to get max of row not columns and get name of > columns where max it present ? First thought - a UDF. Pozdrawiam, Jacek Laskowski https://medium.com/@jaceklaskowski/ Mastering Apache Spark http://bit.ly/maste

Re: comparaing row in pyspark data frame

2016-06-08 Thread Ted Yu
Do you mean returning col3 and 0.4 for the example row below ? > On Jun 8, 2016, at 5:05 AM, pseudo oduesp wrote: > > Hi, > how we can compare multiples columns in datframe i mean > > if df it s dataframe like that : > >df.col1 | df.col2 | df.col3 >

comparaing row in pyspark data frame

2016-06-08 Thread pseudo oduesp
Hi, how we can compare multiples columns in datframe i mean if df it s dataframe like that : df.col1 | df.col2 | df.col3 0.2 0.3 0.4 how we can compare columns to get max of row not columns and get name of columns where max it