Line feed ( \r ) behind %pyspark command causes interpreter not found

2016-04-11 Thread TITTMANN, Lucas (Allianz en France)
Hi, I have the following problem: I develop code in PyCharm and copy + paste to Zeppelin nodes. I'm on Windows 7, so pasting automatically results in line breaks being "\r\n" instead of simply "\n". However, this results in "Error" - sometimes without error message, sometimes with "pyspark interp

RE: Line feed ( \r ) behind %pyspark command causes interpreter not found

2016-04-11 Thread Joel Van Veluwen
I've raised this issue here. I don't have the time or talent to fix the issue... https://issues.apache.org/jira/browse/ZEPPELIN-577 Joel Van Veluwen QUANTIUM Level 25, 8 Chifley 8-12 Chifley Square Sydney NSW 2000 T: +61 2 8224 8981 M: +61 403 153 265 F: +61 2 9292 6444 W: quantium.com.au

Tons of jetty directories in /tmp?

2016-04-11 Thread Chris Miller
Hi, I'm running Zeppelin from the latest EMR distribution. For some reason, Zeppelin is creating *many* "jetty" directories in /tmp -- so much so that it filled up the disk on my namenode. Example: ** drwxr-xr-x 3 zeppelin zeppelin 19 Mar 31 12:14 jetty-0.0.0.0-8890-zeppelin-

Zeppelin on EMR doesn't start?

2016-04-11 Thread Chris Miller
Hi. I'm running Zeppelin on the latest EMR distribution. Recently the Zeppelin service will not start. I'm not sure what's going on... here is what I see in the logs: WARN [2016-04-11 11:36:29,512] ({main} ZeppelinConfiguration.java[create]:95) - Failed to load configuration,

Re: Zeppelin server hangs due to websocket deadlock bug in Jetty 8

2016-04-11 Thread Alexander Bezzubov
Thank you for pointing LIST_NOTES broadcasting to every client, I'm not sure that that's what was meant to happen in such case. I have never seen the behavior you describe and it looks like a race condition on a run note message. Did you have a chance to try applying only the first part of the cha

pyspark and elasticsearch - accessing ES fileds

2016-04-11 Thread Oren Shani
Hi All, I connected pyspark under Zeppelin to my Elasticsearch DB and I am able to do this: %pyspark es_rdd = sc.newAPIHadoopRDD( inputFormatClass="org.elasticsearch.hadoop.mr.EsInputFormat", keyClass="org.apache.hadoop.io.NullWritable", valueClass="org.elasticsearch.hadoop.mr.Linked

Re: Zeppelin on EMR doesn't start?

2016-04-11 Thread Hyung Sung Shim
Hello. It seems like one of your notebook json file has problem. Could you check your notebook files are valid? and What zeppelin and EMR version are you using? 2016-04-11 20:41 GMT+09:00 Chris Miller : > Hi. I'm running Zeppelin on the latest EMR distribution. Recently the > Zeppelin service wil

Interpreter Alias

2016-04-11 Thread John Omernik
So I copied the %jdbc interpreter. I am looking to create a specific connection to Apache Drill, and would like to use the jdbc interpreter, but invoke it by calling %drill rather than %jdbc, is this possible? I tried create interpreter and did a new name "drill" with interpreter type jdbc, but I j

Re: HA for Zeppelin

2016-04-11 Thread Johnny W.
John & Vincent, I am interested in the per instance per user approach. I have some questions about this approach: -- 1. how long will it take to launch a Zeppelin instance (and initialize SparkContext) when user log in? 2. will the instance be destroyed when user log out? if not, how do you deal wi

Re: HA for Zeppelin

2016-04-11 Thread John Omernik
1. Things launch pretty fast for me, however, it depends if the docker container I am running Zeppelin in is cached on the node mesos wants to run it on. If not, it pulls from a local docker registry, so worst case, up to a minute to get things running if the image isn't cached. 2. No, if the user

Re: HA for Zeppelin

2016-04-11 Thread Johnny W.
Thanks John for your insights. For 2., one solution we have experimented is spark dynamic resource allocation. We could define a timer to scale down. Hope that helps. J. On Mon, Apr 11, 2016 at 4:24 PM, John Omernik wrote: > 1. Things launch pretty fast for me, however, it depends if the docke

Re: Interpreter Alias

2016-04-11 Thread Jongyoul Lee
Hi John, Zeppelin doesn't support that feature for now. Regards, Jongyoul On Tue, Apr 12, 2016 at 3:26 AM, John Omernik wrote: > So I copied the %jdbc interpreter. I am looking to create a specific > connection to Apache Drill, and would like to use the jdbc interpreter, but > invoke it by cal

Re: Interpreter Alias

2016-04-11 Thread moon soo Lee
Hi John, Here's a related thread you might also interested. http://apache-zeppelin-users-incubating-mailing-list.75479.x6.nabble.com/Multiple-spark-interpreters-in-the-same-Zeppelin-instance-tp2171.html Thanks, moon On Tue, Apr 12, 2016 at 7:26 AM Jongyoul Lee wrote: > Hi John, > > Zeppelin do

Re: HA for Zeppelin

2016-04-11 Thread vincent gromakowski
1. I am using ansible to deploy zeppelin on all slaves and to launch zeppelin instance for one user. So if zeppelin binaries are already deployed, the launch is very quick through marathon (1 or 2 sec). ooking for velocity solution (based on jfrog) on Mesos to manage binaries and artifacts with ver