Hi,
   I am trying to create an rdd out of large matrix.... sc.parallelize
suggest to use broadcast
But when I do

sc.broadcast(data)
I get this error:

Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/usr/common/usg/spark/1.0.2/python/pyspark/context.py", line 370,
in broadcast
    pickled = pickleSer.dumps(value)
  File "/usr/common/usg/spark/1.0.2/python/pyspark/serializers.py", line
279, in dumps
    def dumps(self, obj): return cPickle.dumps(obj, 2)
SystemError: error return without exception set
Help?

Reply via email to