+1 to the idea of separating the "code" from the configuration, data, logs
and notebooks at a high level directory.

BTW, 1.2GB doesn't seem too large. But you could perhaps leave out the
interpreters you don't want to use.

On Tue, Sep 6, 2016 at 11:39 AM, John Omernik <j...@omernik.com> wrote:

> Hey all,
>
> I know folks are using Docker with Zeppelin, and I am trying to find a
> nice balance between size of images and usefulness of the image.
>
> Basically, when I build Zeppelin, the resulting directory is quite large,
> 1.2GB.   So the first question is, what is actually needed in the Docker
> Image? How can I make it smaller?
>
> At the same time, I am trying to find ways to ensure things like the Conf
> directory, the Logs directory etc can exist outside the container for
> persistence.
>
> Simple right?  Well I am using some ENV variables as such
>
> "ZEPPELIN_CONF_DIR":"/conf",
>
> "ZEPPELIN_NOTEBOOK_DIR":"/notebooks",
>
> "ZEPPELIN_HOME":"/zeppelin",
>
> "ZEPPELIN_LOG_DIR":"/logs",
>
>
> and having the root live at Zeppelin, and having conf, notebooks, and logs
> be volumes mounted to external persistent storage.
>
>
> This is working, but what if I want to create a directory for "custom"
> jars, interpreters, etc, what would the easier way to do that be?   I know
> in the Apache Drill project, they've added the concept of a "site"
> directory that is a bit more holistic than a "conf" directory as it allows
> them to add libs and keep that directory complete separated from the
> released jars  (https://drill.apache.org/docs/apache-drill-1-8-0-
> release-notes/) Is there anything like this in Zeppelin? Is there an easy
> way to keep customized things in a directory that is separate?
>
>
> Any thoughts on how you've optimized Dockerized setups for Zeppelin would
> be welcome! Thanks!
>
>
> John
>
>
>

Reply via email to