Hi,
I have an rdd with n rows and m columns... but most of them are 0 ..
its as sparse matrix..
I would like to only get the non zero entries with their index?
Any equivalent python code would be
for i,x in enumerate(matrix):
for j,y in enumerate(x):
if y:
print i,j,y
Now, what I would like to do is save i,j,y entries?
How do I do this in pyspark.
Thanks
