Github user yywwd commented on the issue:

    https://github.com/apache/zeppelin/pull/2231
  
    @zjffdu I'm sorry to response slowly, because I tried to reproduce the 
bugs. I thought it may caused by my codes, so I tried to use official PySpark 
examples, the bugs still occurred. This is the code I used. 
https://github.com/apache/incubator-livy/blob/master/examples/src/main/python/pi_app.py
    **Note:** I comment the last line # client.stop(True), cause I won't want 
to close the session after submitting just one job. The details about this bug 
are as follows:
    
    1. When I use default yarn mode, that is "yarn", official PySpark examples 
and programmatic API work well. Using Livy interpreter in Zeppelin will throw  
exception:` NoSuchMethodError: org.apache.spark.ui.SparkUI.appUIAddress()` for 
the spark master.
    
    2. ThenI change the yarn mode into "yarn-cluster" as SivaKaviyappa 
suggested, Zeppelin works well. But the logs of this statement will hav a 
warning: `"Warning: Master yarn-cluster is deprecated since 2.0. Please use 
master \"yarn\" with specified deploy mode instead.`
    However, Using programmatic API will have such bug:  
    
      2.1  I delete all Livy sessions, and run pi_app.py. It throw such 
exception:  
    ```
    ReadTimeout: HTTPConnectionPool(host='172.31.5.251', port=8998): Read timed 
out. (read timeout=10)
    Traceback (most recent call last):
      File "/home/ec2-user/wandongwu/livy_test_9/pi_app.py", line 35, in 
<module>
        pi = client.submit(pi_job).result()
     File "/usr/local/lib/python2.7/site-packages/concurrent/futures/_base.py", 
line 462, in result
        return self.__get_result()
      File 
"/usr/local/lib/python2.7/site-packages/concurrent/futures/_base.py", line 414, 
in __get_result
        raise exception_type, self._exception, self._traceback
    TypeError: raise: arg 3 must be a traceback or None
    ```
    But I find It has started a new Livy session. So I edit configuration 
parameter into `'http://<livy-server>:8998/sessions/0' 2`. It will throw 
another exception:
    ```
    Traceback (most recent call last):
      File "/home/ec2-user/wandongwu/livy_test_9/pi_app.py", line 35, in 
<module>
        pi = client.submit(pi_job).result()
      File 
"/usr/local/lib/python2.7/site-packages/concurrent/futures/_base.py", line 462, 
in result
        return self.__get_result()
      File 
"/usr/local/lib/python2.7/site-packages/concurrent/futures/_base.py", line 414, 
in __get_result
        raise exception_type, self._exception, self._traceback
    Exception: org.apache.livy.repl.PythonJobException: Client job 
error:Traceback (most recent call last):
      File 
"/mnt/yarn/usercache/livy/appcache/application_1528945006613_0302/container_1528945006613_0302_01_000001/tmp/4991895008696585180",
 line 159, in processBypassJob
        deserialized_job = pickle.loads(serialized_job)
      File "/usr/lib64/python2.7/pickle.py", line 1388, in loads
        return Unpickler(file).load()
      File "/usr/lib64/python2.7/pickle.py", line 864, in load
        dispatch[key](self)
      File "/usr/lib64/python2.7/pickle.py", line 1096, in load_global
        klass = self.find_class(module, name)
      File "/usr/lib64/python2.7/pickle.py", line 1130, in find_class
        __import__(module)
    ImportError: No module named cloudpickle.cloudpickle
    ```
    
    3. Then I change yarn mode into default mode, that is "yarn", programmatic 
API can work well, but Zeppelin still cannot work.


---

Reply via email to