I think if you have a shared storage for notebooks (for example, NFS
mounted from a third server),
and a load-balancer that supports sticky sessions (like F5) on top, it
should be possible to have HA without
any code change in Zeppelin. Am I missing something?



-- 
Ruslan Dautkhanov

On Fri, Jun 30, 2017 at 5:54 PM, Alexander Filipchik <afilipc...@gmail.com>
wrote:

> Honestly,  HA requires more than just active stand by.
> It should be able to scale without major surgeries, which is not possible
> right now. For example, if you start too many interpreters, zeppelin box
> will simply run out of memory.
>
> Alex
>
> On Thu, Jun 29, 2017 at 10:59 PM, wenxing zheng <wenxing.zh...@gmail.com>
> wrote:
>
>> at first, I would think GIT storage is a good option and we can push and
>> pull the changes regularly.
>>
>> With multiple zeppelin instances, maybe we need a new component or
>> service to act as a distributed scheduler: dispatch the Job to and manage
>> the Jobs on the Zeppelin instances.
>>
>> On Fri, Jun 30, 2017 at 1:26 PM, Vinay Shukla <vinayshu...@gmail.com>
>> wrote:
>>
>>> Here is what I think should be part of HA consideration:
>>>
>>>    1. Have multiple Zeppelin Instances
>>>    2. Have the notebooks storage backed by something like an NFS so all
>>>    notebooks are visible across all Zeppelin instances
>>>    3. Put multiple load balancers infront of Zeppelin to route requests.
>>>
>>> Consider that HA needs scalability, which depends on which interpreter
>>> you plan to use. So you might need to consider HA at both Zeppelin and
>>> interpreter level. For example if you were using Z + Livy + Spark, you will
>>> need to consider scalability + HA needs of Z + Livy interpreter + Livy
>>> Server + Spark (on Cluster manager).
>>>
>>> On Thu, Jun 29, 2017 at 10:04 PM, wenxing zheng <wenxing.zh...@gmail.com
>>> > wrote:
>>>
>>>> and do we have any architecture doc for reference? Because we need to
>>>> add the HA capability as soon as possible, hope we can figure it out.
>>>>
>>>> On Fri, Jun 30, 2017 at 12:33 PM, wenxing zheng <
>>>> wenxing.zh...@gmail.com> wrote:
>>>>
>>>>> Thanks to Jeff and Moon.
>>>>>
>>>>> So currently the active-active model doesn't work on GIT storage, am I
>>>>> right?
>>>>>
>>>>> On Fri, Jun 30, 2017 at 12:16 PM, moon soo Lee <m...@apache.org>
>>>>> wrote:
>>>>>
>>>>>> Basically active-(hot)standby model would work.
>>>>>> Two or more Zeppelin instance can be started and pointing the same
>>>>>> notebook storage, if only one Zeppelin instance (active) change notebook 
>>>>>> at
>>>>>> any given time.
>>>>>>
>>>>>> In case of the active instance fails, one of rest instance can take
>>>>>> over the role by refreshing notebook list and start make change.
>>>>>>
>>>>>> But all these fail over is not provided by Zeppelin and need to
>>>>>> depends on external script or HA software (like Heartbeat).
>>>>>>
>>>>>> Like Jeff mentioned, community does not have concrete plan for having
>>>>>> HA built-in at this moment.
>>>>>>
>>>>>> Hope this helps,
>>>>>>
>>>>>> Thanks,
>>>>>> moon
>>>>>>
>>>>>> On Fri, Jun 30, 2017 at 1:01 PM Jeff Zhang <zjf...@gmail.com> wrote:
>>>>>>
>>>>>>>
>>>>>>> No concrete plan for that. There're other higher priority things
>>>>>>> need to be done. At least it would not be available in 0.8, maybe after 
>>>>>>> 1.0
>>>>>>>
>>>>>>>
>>>>>>> wenxing zheng <wenxing.zh...@gmail.com>于2017年6月30日周五 上午11:47写道:
>>>>>>>
>>>>>>>> Thanks to Jianfeng.
>>>>>>>>
>>>>>>>> Do you  know any plan on this?
>>>>>>>>
>>>>>>>>
>>>>>>>> On Fri, Jun 30, 2017 at 11:32 AM, Jianfeng (Jeff) Zhang <
>>>>>>>> jzh...@hortonworks.com> wrote:
>>>>>>>>
>>>>>>>>> HA is not supported, there’s still  lots of configuration files
>>>>>>>>> stored in local file system.
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> Best Regard,
>>>>>>>>> Jeff Zhang
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> From: wenxing zheng <wenxing.zh...@gmail.com>
>>>>>>>>> Reply-To: "users@zeppelin.apache.org" <users@zeppelin.apache.org>
>>>>>>>>> Date: Friday, June 30, 2017 at 9:40 AM
>>>>>>>>> To: "users@zeppelin.apache.org" <users@zeppelin.apache.org>
>>>>>>>>> Subject: Query about the high availability of Zeppelin
>>>>>>>>>
>>>>>>>>> Hello all,
>>>>>>>>>
>>>>>>>>> I still didn't find any docs on this topic? Appreciated if anyone
>>>>>>>>> can shed some lights on how to get the Zeppelin into a cluster with
>>>>>>>>> shared/centralized storage
>>>>>>>>>
>>>>>>>>> Regards, Wenxing
>>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>
>>>>
>>>
>>
>

Reply via email to