I like the 'zeppelin-bin-netinst’ idea too. Hopefully it would be easy to 
configure it to work with a proxy for users behind a corporate firewall.
Thanks, Lucas.

From: Mohit Jaggi [mailto:mohitja...@gmail.com]
Sent: 17 June 2016 18:06
To: users@zeppelin.apache.org
Subject: EXT: Re: Ask opinion regarding 0.6.0 release package

sure…that is possible. one can also make a build from source and customize as 
needed. but not having to do that makes things easier. i do believe that for 
the vast majority of cases a minimal build with spark (and possibly other small 
items like shell, jdbc, python) will be quite valuable, imho.
is there a lot of overhead involved in having multiple binaries available? i am 
happy to volunteer some time to help with this if needed.

On Jun 17, 2016, at 9:45 PM, moon soo Lee 
<m...@apache.org<mailto:m...@apache.org>> wrote:

In case of no internet access, how about

a. download 'zeppelin-bin-netinst' and run 'bin/install-interpreter.sh', and 
then copy the package to production env.
b. download 'zeppelin-bin-all' and copy the package to production env.

?

Thanks,
moon


On Fri, Jun 17, 2016 at 9:10 AM Mohit Jaggi 
<mohitja...@gmail.com<mailto:mohitja...@gmail.com>> wrote:
Many production environments have no internet access. A script like  this can 
be useful to some but it should not replace the proposed min binary.

Sent from my iPhone

On Jun 17, 2016, at 9:20 PM, moon soo Lee 
<m...@apache.org<mailto:m...@apache.org>> wrote:
Hi,

Thanks for bringing this discussion.
it's great idea minimize binary package size.

Can we set a policy to decide which interpreter goes to 'zeppelin-bin-min', 
which is not?

One alternative is, instead of making 'zeppelin-bin-min', we can make 
'zeppelin-bin-netinst'.
We can provide a shell script such as, 'bin/install-interpreter.sh' and the 
script will download interpreters and their dependencies from maven repository 
and store under /interpreter dir. By leveraging DependencyResolver[1], i think 
we can make this feature in couple of hours.

Only spark interpreter can not be installed in simple way, while it requires 
some python and R packages under /interpreter dir and they're not available on 
maven repository, so it'll need special treatment, but all other interpreters 
can be installed in the simple way.

Then, 'zeppelin-bin-netinst' version can have minimal package size, and still 
gives easy way to install all the interpreters.
Also 'bin/install-interpreter.sh' will still useful even if we have dynamic 
interpreter loading feature [2], to build offline package.

what do you think?

[1] 
https://github.com/apache/zeppelin/blob/master/zeppelin-interpreter/src/main/java/org/apache/zeppelin/dep/DependencyResolver.java<https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_apache_zeppelin_blob_master_zeppelin-2Dinterpreter_src_main_java_org_apache_zeppelin_dep_DependencyResolver.java&d=CwMFaQ&c=IV_clAzoPDE253xZdHuilRgztyh_RiV3wUrLrDQYWSI&r=c1CCNND4PG-Q_V2AJWDWrugZAXQ8Y3EE_f_mAHcpXcs&m=5yX9TVM8vp2oYgFtB4gACTyCQL3FWTK2OoSXVzsJpdg&s=b48EeMu0glDkXuGn72ZTy8ZteEiVBzmpbTqELmhgsRc&e=>
[2] 
https://issues.apache.org/jira/browse/ZEPPELIN-598<https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_jira_browse_ZEPPELIN-2D598&d=CwMFaQ&c=IV_clAzoPDE253xZdHuilRgztyh_RiV3wUrLrDQYWSI&r=c1CCNND4PG-Q_V2AJWDWrugZAXQ8Y3EE_f_mAHcpXcs&m=5yX9TVM8vp2oYgFtB4gACTyCQL3FWTK2OoSXVzsJpdg&s=MK9lpcZjSIlgFO0CVk6kMWB1bCPqpWK_0qhSjOQ5FzA&e=>


On Fri, Jun 17, 2016 at 1:02 AM mina lee 
<mina...@apache.org<mailto:mina...@apache.org>> wrote:
Hi all!

Zeppelin just started release process. Prior to creating release candidate I 
want to ask users' opinion about how you want it to be packaged.

For the last release(0.5.6), we have released one binary package which includes 
all interpreters.
The concern with providing one type of binary package is that package size will 
be quite big(~600MB).
So I am planning to provide two binary packages:
  - zeppelin-0.6.0-bin-all.tgz (includes all interpreters)
  - zeppelin-0.6.0-bin-min.tgz (includes only most used interpreters)

I am thinking about putting spark(pyspark, sparkr, sql), python, jdbc, shell, 
markdown, angular in minimized package.
Could you give your opinion on whether these sets are enough, or some of them 
are ok to be excluded?

Community's opinion will be helpful to make decision not only for 0.6.0 but 
also for 0.7.0 release since we are planning to provide only minimized package 
from 0.7.0 release. From the 0.7.0 version, interpreters those are not included 
in binary package will be able to use dynamic interpreter feature [1] which is 
in progress under [2].

Thanks,
Mina

[1] 
http://zeppelin.apache.org/docs/0.6.0-SNAPSHOT/manual/dynamicinterpreterload.html<https://urldefense.proofpoint.com/v2/url?u=http-3A__zeppelin.apache.org_docs_0.6.0-2DSNAPSHOT_manual_dynamicinterpreterload.html&d=CwMFaQ&c=IV_clAzoPDE253xZdHuilRgztyh_RiV3wUrLrDQYWSI&r=c1CCNND4PG-Q_V2AJWDWrugZAXQ8Y3EE_f_mAHcpXcs&m=5yX9TVM8vp2oYgFtB4gACTyCQL3FWTK2OoSXVzsJpdg&s=4zHncvKGMfOlq-dTmD3m23Rv0jjkaqwWEnowkaJSHks&e=>
[2] 
https://github.com/apache/zeppelin/pull/908<https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_apache_zeppelin_pull_908&d=CwMFaQ&c=IV_clAzoPDE253xZdHuilRgztyh_RiV3wUrLrDQYWSI&r=c1CCNND4PG-Q_V2AJWDWrugZAXQ8Y3EE_f_mAHcpXcs&m=5yX9TVM8vp2oYgFtB4gACTyCQL3FWTK2OoSXVzsJpdg&s=TB3EaiWKtliKYXmXWHJLyZK4Kti6Ev97GVBJFfhCcVw&e=>

Reply via email to