Hi, I am trying to create an rdd out of large matrix.... sc.parallelize suggest to use broadcast But when I do
sc.broadcast(data) I get this error: Traceback (most recent call last): File "<stdin>", line 1, in <module> File "/usr/common/usg/spark/1.0.2/python/pyspark/context.py", line 370, in broadcast pickled = pickleSer.dumps(value) File "/usr/common/usg/spark/1.0.2/python/pyspark/serializers.py", line 279, in dumps def dumps(self, obj): return cPickle.dumps(obj, 2) SystemError: error return without exception set Help?