Hi, I have an rdd with n rows and m columns... but most of them are 0 .. its as sparse matrix..
I would like to only get the non zero entries with their index? Any equivalent python code would be for i,x in enumerate(matrix): for j,y in enumerate(x): if y: print i,j,y Now, what I would like to do is save i,j,y entries? How do I do this in pyspark. Thanks