I'm trying to set up iPython notebook on an edge node with port forwarding so
I can run pyspark off my laptop's browser. I've mostly been following the
Cloudera guide here:
http://blog.cloudera.com/blog/2014/08/how-to-use-ipython-notebook-with-apache-spark/
I got this working on one cluster running Spark 1.0. But now on Spark 1.3
(with Python 2.7 and Java 7), I'm getting the error below when I run
/opt/cloudera/parcels/CDH/lib/spark/python/pyspark/shell.py at the line: "sc
= SparkContext(appName="PySparkShell", pyFiles=add_files)"
Before showing the error, I'll note that running "pyspark --master
yarn-client" DOES work, so I can run pyspark fine atop YARN, but it looks
like ipython notebook is calling Spark via a different method and producing
an error. Any ideas?
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/opt/cloudera/parcels/CDH/lib/spark/python/pyspark/context.py", line
111, in __init__
conf, jsc, profiler_cls)
File "/opt/cloudera/parcels/CDH/lib/spark/python/pyspark/context.py", line
159, in _do_init
self._jsc = jsc or self._initialize_context(self._conf._jconf)
File "/opt/cloudera/parcels/CDH/lib/spark/python/pyspark/context.py", line
212, in _initialize_context
return self._jvm.JavaSparkContext(jconf)
File
"/opt/cloudera/parcels/CDH/lib/spark/python/lib/py4j-0.8.2.1-src.zip/py4j/java_gateway.py",
line 701, in __call__
File
"/opt/cloudera/parcels/CDH/lib/spark/python/lib/py4j-0.8.2.1-src.zip/py4j/protocol.py",
line 300, in get_return_value
py4j.protocol.Py4JJavaError: An error occurred while calling
None.org.apache.spark.api.java.JavaSparkContext.
: java.io.FileNotFoundException:
/user/spark/applicationHistory/application_1438611042507_0055.inprogress (No
such file or directory)
at java.io.FileOutputStream.open(Native Method)
at java.io.FileOutputStream.<init>(FileOutputStream.java:221)
at java.io.FileOutputStream.<init>(FileOutputStream.java:110)
at
org.apache.spark.scheduler.EventLoggingListener.start(EventLoggingListener.scala:117)
at org.apache.spark.SparkContext.<init>(SparkContext.scala:399)
at
org.apache.spark.api.java.JavaSparkContext.<init>(JavaSparkContext.scala:61)
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at
sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57)
at
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
at java.lang.reflect.Constructor.newInstance(Constructor.java:526)
at py4j.reflection.MethodInvoker.invoke(MethodInvoker.java:234)
at py4j.reflection.ReflectionEngine.invoke(ReflectionEngine.java:379)
at py4j.Gateway.invoke(Gateway.java:214)
at
py4j.commands.ConstructorCommand.invokeConstructor(ConstructorCommand.java:79)
at py4j.commands.ConstructorCommand.execute(ConstructorCommand.java:68)
at py4j.GatewayConnection.run(GatewayConnection.java:207)
at java.lang.Thread.run(Thread.java:745)
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/Error-when-running-pyspark-shell-py-to-set-up-iPython-notebook-tp24188.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]