Hi guys,
I create an umbrella ticket for multiple user support for zeppelin as I can
see more and more requirement of this from community, feel free to add sub
task that I miss.
https://issues.apache.org/jira/browse/ZEPPELIN-1337
On Tue, Aug 9, 2016 at 9:20 AM, Alexander Bezzubov wrote:
> Hi
Hi Egor,
let me share two approaches that we used to archive Apache Zeppelin working
in muliteanant environment with Apache Spark:
- run a separate container with Zeppelin per-user, from the small cluster
of Docker machines (so a single machine runs just 2-3 containers \w
SparkContexts)
This work
, 2016 at 5:11 AM
To: "users@zeppelin.apache.org<mailto:users@zeppelin.apache.org>"
mailto:users@zeppelin.apache.org>>
Subject: Re: Multiuser support of Zeppelin.
One zeppelin per user in mesos container on datanode type server is fine for
me. An Ansible script configure each instance w
One zeppelin per user in mesos container on datanode type server is fine
for me. An Ansible script configure each instance with user specifities and
launch it in Marathon. A service discovery (basic shell script) update an
apache server with basic auth and route each user to his instance. Mesos
als
I need to build a chart for 10 days for all countries(200) for several
products by some dimensions. I would need at least 4-6 gb per zeppelin for
it.
2016-08-05 12:31 GMT-07:00 Mohit Jaggi :
> put your big results somewhere else not in Z’s memory?
>
> On Aug 5, 2016, at 12:26 PM, Egor Pahomov wr
put your big results somewhere else not in Z’s memory?
> On Aug 5, 2016, at 12:26 PM, Egor Pahomov wrote:
>
> - Use spark driver in “cluster mode” where driver runs on a worker instead of
> the node running Z
>
> Even without driver Z is heavy process. You need a lot of RAM to keep big
> resu
>
> - Use spark driver in “cluster mode” where driver runs on a worker instead
> of the node running Z
Even without driver Z is heavy process. You need a lot of RAM to keep big
results from job. And most of all - zeppelin 0.5.6 does not support cluster
mode and I'm not ready to move to 0.6.
2016
Egor,
Running a scale out system like Spark with multiple users is always tricky.
Operating systems are designed to let multiple users share a single machine.
But for “big data” a single user requires the use of several machines which is
the exact opposite. Having said that I would suggest the f
Hi, I'd like to discuss best practices for using zeppelin in the
multi-user environment. There are several naive approaches, I've tried for
at least couple month each and not a single one worked:
*All users on one zeppelin.*
- One spark context - people really break sc and when they are all i