Jungtaek Lim created ZEPPELIN-539:
-------------------------------------

             Summary: RemoteInterpreter Heartbeat
                 Key: ZEPPELIN-539
                 URL: https://issues.apache.org/jira/browse/ZEPPELIN-539
             Project: Zeppelin
          Issue Type: New Feature
          Components: zeppelin-interpreter
            Reporter: Jungtaek Lim


>From ZEPPELIN-535, I addressed that remote interpreter can become not able to 
>respond for some circumstances, for example, OOME.

There're two scenarios, which one is that remote interpreter is not killed but 
not respond, another one is that remote interpreter is killed.

https://gist.github.com/HeartSaVioR/26190a71df6815aa62b8
https://gist.github.com/HeartSaVioR/d5049e7464534c48b4bf

Determining such scenarios requires analysis of Zeppelin's logs which seems not 
easy thing to normal users, and users cannot do within Zeppelin UI.
Furthermore, in order to make it back to normal, users should restart 
interpreter by hand.

It would be help to users when Zeppelin determines remote interpreter's status 
and let users know when it's not able to respond.
It would be more better to restore remote interpreter if possible.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to