Hi Dafeng, Actually the purpose of binding notebook with one specific instance is because cached notebooks in memory are not synchronized across other Zeppelin instances even if they're sharing a single notebook storage. That's why i thought a request for specific notebook should go to the specific Zeppelin instance.
Happy new year! Best, moon On Wed, Dec 30, 2015 at 7:19 PM Dafeng Wang <dafe...@microsoft.com> wrote: > Hi Moon, > > > > Thanks again for the answer. Your idea looks great! > > The purpose of binding notebook with one specific instance is just to save > the loading time, right? I’ll do a little bit try with this solution, then > probably will bring more questions: such as how to map one ZeppelinServer > to different clusters, and so onJ > > > > And Happy new year! > > > > Regards, > > Dafeng > > > > *From:* moon soo Lee [mailto:m...@apache.org] > *Sent:* Thursday, December 31, 2015 9:57 AM > > > *To:* users@zeppelin.incubator.apache.org > *Subject:* Re: How to support multiple notebooks in Zeppelin > > > > Hi Dafeng, > > > > Right, all informations are cached in memory once loaded. To serve large > amount of notebooks, we'll need to modify Zeppelin a bit to not keep them > in the memory. > > > > For #2, Multiple ZeppelinServer can be configured to use shared notebook > storage. And loadbalancer in front of them can distribute the REST API and > websocket traffic. > > If loadbalancer redirect the traffic based on notebook id in the path > (e.g. '2BA6X4HHM' from path 'http://localhost:8080/#/notebook/2BA6X4HHM > <http://localhost:9000/#/notebook/2BA6X4HHM>'), > > and let the same notebook id stick to the same Zeppelin instance, > ZeppelinServer will able to scale out for many users. > > > > Will this fits for your use case? Let me know if you have different idea. > > > > Thanks, > > moon > > > > On Wed, Dec 30, 2015 at 5:37 PM Dafeng Wang <dafe...@microsoft.com> wrote: > > Hi Moon, > > > > Thanks for your quick response. > > 1. For first question, got your answer. that means all meta info, > paragraph, and result of one notebook will be cached in memory once loaded, > right? > > 2. For #2, my real question will be how to serve millions of users > for Zeppelin. Per my current understanding REST API and websocket > connection handling will be coupled together within one server, if it’s > true, then my original question will become to “how to scale out the server > that servers REST API, Websocket connection”, or not, what’s the > differences of scaling them when I wanna support more users? > > > > Regards, > > Dafeng > > > > *From:* moon soo Lee [mailto:m...@apache.org] > *Sent:* Thursday, December 31, 2015 12:11 AM > *To:* users@zeppelin.incubator.apache.org > *Subject:* Re: How to support multiple notebooks in Zeppelin > > > > Hi Dafeng, > > > > Zeppelin at the moment keeps every notebook in memory once it's been > loaded. So number of notebooks supported by instance will be limited by the > memory on the system. > > > > Scale out Zeppelin-Server you mean scale out the server that serves REST > API, Websocket connection? > > > > Thanks, > > moon > > On Wed, Dec 30, 2015 at 1:57 AM Dafeng Wang <dafe...@microsoft.com> wrote: > > Hi All, > > > > I had tried with Zeppelin today, it works perfectly in stand-alone mode, > my questions now are: > > 1. Capacity limitation of Zeppelin-Server, in other words, how many > notebook instance one server can support > > 2. If we want to scale out Zepplein-server, is that possible? If > so, then how? Let’s say the environment it’s yarn + spark > > > > Regards, > > Dafeng > >