Hoon Park created ZEPPELIN-1883:
-----------------------------------

             Summary: Can't import packages requested by SPARK_SUBMIT_OPTION in 
pyspark
                 Key: ZEPPELIN-1883
                 URL: https://issues.apache.org/jira/browse/ZEPPELIN-1883
             Project: Zeppelin
          Issue Type: Bug
          Components: pySpark
            Reporter: Hoon Park
             Fix For: 0.7.0


Zeppelin pyspark can't import submitted packages by {{SPARK_SUBMIT_OPTION}}. 
For example, 

{code}
// conf/zeppelin-env.sh
...

export SPARK_HOME="~/github/apache-spark/1.6.2-bin-hadoop2.6"
export SPARK_SUBMIT_OPTIONS="--packages 
com.datastax.spark:spark-cassandra-connector_2.10:1.6.2,TargetHolding:pyspark-cassandra:0.3.5
 --exclude-packages org.slf4j:slf4j-api"

...
{code}

And then try import that pyspark cassandra module in zeppelin pyspark 
interpreter

{code}
import pyspark_cassandra


Traceback (most recent call last):
  File 
"/var/folders/lr/8g9y625n5j39rz6qhkg8s6640000gn/T/zeppelin_pyspark-5266742863961917074.py",
 line 267, in <module>
    raise Exception(traceback.format_exc())
Exception: Traceback (most recent call last):
  File 
"/var/folders/lr/8g9y625n5j39rz6qhkg8s6640000gn/T/zeppelin_pyspark-5266742863961917074.py",
 line 265, in <module>
    exec(code)
  File "<stdin>", line 1, in <module>
ImportError: No module named pyspark_cassandra
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to