subject:"Re\: Creating a python port for a Scala Spark Projeect"

Re: Creating a python port for a Scala Spark Projeect

2016-06-22 Thread Daniel Imberman

Thank you Holden, I look forward to watching your talk! On Wed, Jun 22, 2016 at 7:12 PM Holden Karau wrote: > PySpark RDDs are (on the Java side) are essentially RDD of pickled objects > and mostly (but not entirely) opaque to the JVM. It is possible (by using > some internals) to pass a PySpark

Re: Creating a python port for a Scala Spark Projeect

2016-06-22 Thread Holden Karau

PySpark RDDs are (on the Java side) are essentially RDD of pickled objects and mostly (but not entirely) opaque to the JVM. It is possible (by using some internals) to pass a PySpark DataFrame to a Scala library (you may or may not find the talk I gave at Spark Summit useful https://www.youtube.com