Github user yywwd commented on the issue: https://github.com/apache/zeppelin/pull/2231 @zjffdu I'm sorry to response slowly, because I tried to reproduce the bugs. I thought it may caused by my codes, so I tried to use official PySpark examples, the bugs still occurred. This is the code I used. https://github.com/apache/incubator-livy/blob/master/examples/src/main/python/pi_app.py **Note:** I comment the last line # client.stop(True), cause I won't want to close the session after submitting just one job. The details about this bug are as follows: 1. When I use default yarn mode, that is "yarn", official PySpark examples and programmatic API work well. Using Livy interpreter in Zeppelin will throw exception:` NoSuchMethodError: org.apache.spark.ui.SparkUI.appUIAddress()` for the spark master. 2. ThenI change the yarn mode into "yarn-cluster" as SivaKaviyappa suggested, Zeppelin works well. But the logs of this statement will hav a warning: `"Warning: Master yarn-cluster is deprecated since 2.0. Please use master \"yarn\" with specified deploy mode instead.` However, Using programmatic API will have such bug: 2.1 I delete all Livy sessions, and run pi_app.py. It throw such exception: ``` ReadTimeout: HTTPConnectionPool(host='172.31.5.251', port=8998): Read timed out. (read timeout=10) Traceback (most recent call last): File "/home/ec2-user/wandongwu/livy_test_9/pi_app.py", line 35, in <module> pi = client.submit(pi_job).result() File "/usr/local/lib/python2.7/site-packages/concurrent/futures/_base.py", line 462, in result return self.__get_result() File "/usr/local/lib/python2.7/site-packages/concurrent/futures/_base.py", line 414, in __get_result raise exception_type, self._exception, self._traceback TypeError: raise: arg 3 must be a traceback or None ``` But I find It has started a new Livy session. So I edit configuration parameter into `'http://<livy-server>:8998/sessions/0' 2`. It will throw another exception: ``` Traceback (most recent call last): File "/home/ec2-user/wandongwu/livy_test_9/pi_app.py", line 35, in <module> pi = client.submit(pi_job).result() File "/usr/local/lib/python2.7/site-packages/concurrent/futures/_base.py", line 462, in result return self.__get_result() File "/usr/local/lib/python2.7/site-packages/concurrent/futures/_base.py", line 414, in __get_result raise exception_type, self._exception, self._traceback Exception: org.apache.livy.repl.PythonJobException: Client job error:Traceback (most recent call last): File "/mnt/yarn/usercache/livy/appcache/application_1528945006613_0302/container_1528945006613_0302_01_000001/tmp/4991895008696585180", line 159, in processBypassJob deserialized_job = pickle.loads(serialized_job) File "/usr/lib64/python2.7/pickle.py", line 1388, in loads return Unpickler(file).load() File "/usr/lib64/python2.7/pickle.py", line 864, in load dispatch[key](self) File "/usr/lib64/python2.7/pickle.py", line 1096, in load_global klass = self.find_class(module, name) File "/usr/lib64/python2.7/pickle.py", line 1130, in find_class __import__(module) ImportError: No module named cloudpickle.cloudpickle ``` 3. Then I change yarn mode into default mode, that is "yarn", programmatic API can work well, but Zeppelin still cannot work.
---