Re: zip two RDD in pyspark

2014-07-29 Thread Nick Pentreath
ead.run(Thread.java:744) > > > As I see it, the job is completed, but I don't understand what's > happening > > to 'String cannot be cast to [B'. I tried to zip two > parallelCollectionRDD > > and it works fine. But here I have a MappedRDD at textFi

Re: zip two RDD in pyspark

2014-07-29 Thread Davies Liu
s error? Maybe there are some corner cases during serialization. > Also, why Python does not have ZipWithIndex()? The features in PySpark are much less than Spark, hopefully it will catch up in next two releases. > > Thanks for any help. > > > > -- > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.com/zip-two-RDD-in-pyspark-tp10806.html > Sent from the Apache Spark User List mailing list archive at Nabble.com.

zip two RDD in pyspark

2014-07-28 Thread lllll
? Thanks for any help. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/zip-two-RDD-in-pyspark-tp10806.html Sent from the Apache Spark User List mailing list archive at Nabble.com.