Github user rawkintrevo closed the pull request at:
https://github.com/apache/zeppelin/pull/1323
---
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/1323
closing this- lol at "hotfix"
---
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/1758
@AlexanderShoshin I think what @zjffdu and @bzz meant was that you could
possibly call out this notebook in some where like
[`docs/interpreter/flink.md`](https://github.com/apache/zeppelin
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/1772
Since this updates docs only, I feel all fails are flaky and unrelated,
however:
[2](https://travis-ci.org/apache/zeppelin/jobs/184305296) Issue with
dependency loader
GitHub user rawkintrevo opened a pull request:
https://github.com/apache/zeppelin/pull/1772
[ZEPPELIN-1798][DOCS] Update docs with benv instead of env in Flink eâ¦
### What is this PR for?
Several Flink examples reference `env` instead of `benv` which was changed
per
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/1606
lgtm. +1 to merge
again, awesome add @Leemoonsoo
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/1606
@Leemoonsoo still not working for me, but it could be on my end- I have a
hell of a time with git
Did a fetch /hard reset within the last hour.
```bash
$ git fetch
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
Profile 2 failed.
```
16/11/06 14:18:51 INFO PySparkInterpreter: File
/tmp/zeppelin_pyspark-1526538107058607857.py created
Tests run: 1, Failures: 1, Errors: 0, Skipped: 0
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/1606
@Leemoonsoo awesome work! so happy to see this coming through!
I am running into the same problem as @minahlee Thoughts on why `mvn` build
wouldn't make this come out right?
-
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
@felixcheung thanks. Apache Mahout is a kind of special case, in that
really we're a set of libraries that can run on currently Spark, Flink, H20
runners, and possible additionals ones
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
@bzz,
- I could see usecases either way. So i've added the
`--overwrite_existing` flag. If that flag is called it will delete current
`%sparkMahout` and `%flinkMahout` interpr
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
- [x] Fail fasts: changed `call` to
[`check_call`](https://docs.python.org/2/library/subprocess.html#check_call)
where it is appropriate to fail and exit.
- [x] Slang in standard out
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
I can address the 3 bullet points in the next day or two.
Re; the disk space- yea Mahout adds another 300MB or so. I recall seeing
some chat on the mailing list that Zeppelin was
GitHub user rawkintrevo opened a pull request:
https://github.com/apache/zeppelin/pull/1552
[HOTFIX] bump flink to 1.1.3
### What is this PR for?
A few sentences describing the overall goals of the pull request's commits.
First time? Check out the contributing guide -
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
bump- status or thoughts on this?
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
Hey @bzz, did you get a chance to check this out? Anything I can do to help?
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/418
Yea I think I merged it or something. I fixed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/418
@corneadoug done and done.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
Refactored as previously discussed. Added script that sets up two new
terps, one for mahout on flink, one for mahout on spark. Added docs and a
robust tutorial that shows useage.
---
If
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
@bzz, thanks! That is exactly the idea- and I can do the same thing for
Flink.
The cool thing there is, the exact same code* will run on %mahoutSpark and
%mahoutFlink, so that will
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/1409
@Leemoonsoo all systems go on re-triggered build.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/1409
Flink Tests all passing now.
[Profile3](https://travis-ci.org/apache/zeppelin/jobs/158273244) Cassandra
Test Timeout **Unrelated**
[Profile8](https://travis-ci.org/apache
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
@bzz and @Leemoonsoo -
Was talking to some other about this- I think we should take a new tack on
this as Mahout currently only supports Spark v1.5 officially, but also 1.6 in
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
*bump* @bzz should we continue work on this or pursue another route?
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/1409
[Profile 2](https://travis-ci.org/apache/zeppelin/jobs/157948630)
[Profile 3](https://travis-ci.org/apache/zeppelin/jobs/157948631)
[Profile 4](https://travis-ci.org/apache/zeppelin
GitHub user rawkintrevo opened a pull request:
https://github.com/apache/zeppelin/pull/1409
[ZEPPELIN-1461] Update Flink with latest version 1.1.2
### What is this PR for?
Flink has had two releases since 1.0.3, we are now on 1.1.2
This includes new functionality for
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/1323
@cloverhearts
Not going to pretend I'm an expert on HDFS- here's my use case, and I am
happy to offer further explanation.
I have a [BigInsights
Cloud](http://www-
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
for what it's worth (probably not much) this passed travis CI on my branch-
[https://travis-ci.org/rawkintrevo/incubator-zeppelin/builds/154183905](https://travis-ci.org/rawkin
GitHub user rawkintrevo opened a pull request:
https://github.com/apache/zeppelin/pull/1323
[HOTFIX] Add logins authentication for hdfs file interpreter
### What is this PR for?
When attempting to login to a webhdfs file system with credentials as in:
`https
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
@bzz I was really hoping to get it in by 0.6.1 but it's whatever.
[Profile
1](https://s3.amazonaws.com/archive.travis-ci.org/jobs/150945703/log.txt) is
the only failur
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
[Build 6281](https://travis-ci.org/apache/zeppelin/builds/149199571)
Profiles
[1](https://s3.amazonaws.com/archive.travis-ci.org/jobs/149199572/log.txt),
[2](https://s3.amazonaws.com
GitHub user rawkintrevo reopened a pull request:
https://github.com/apache/zeppelin/pull/928
[ZEPPELIN-116] Add Apache Mahout Interpreter
### What is this PR for?
This PR adds Mahout functionality for the Spark Interpreter.
### What type of PR is it
Github user rawkintrevo closed the pull request at:
https://github.com/apache/zeppelin/pull/928
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
## CI Failures in This PR
Profile
[2](https://s3.amazonaws.com/archive.travis-ci.org/jobs/148308638/log.txt)
Hangs a la
[ZEPPELIN-862](https://issues.apache.org/jira/browse
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
@bzz only difference between this and last was squash of commit. all fails
are related to downloading dependencies still.
---
If your project is set up for it, you can reply to this email
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
@bzz and @jongyoul
OK, so two things I think have happened. (In case some thing like this
happens again).
When I rebased, some version of ZeppelinClusterTest.java that had the
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
I resubmitted this a few times, there is something going awry with finding
`org.apache.apache.resources:apache-jar-resource-bundle:jar:1.5-SNAPSHOT`
The build fails on random
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
Currently failing to download deps.
```
[ERROR] Failed to execute goal
org.apache.maven.plugins:maven-remote-resources-plugin:1.4:process (default) on
project zeppelin: Error
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
All failures due to Mahout doesn't support scala 2.11 yet. Adding logic to
detect, similar to detecting spark v < 1.5, and adding to testing suite.
@bzz, pyspark issues s
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/418
@bzz or whoever is in charge of this, failed on flakey test. Can we do a
merge on this?
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/418
Yea, I had intentionally left in 'getting started' as it was an install
tutorial, but I just moved it to the 'more section' per your comment.
---
If your project is se
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/418
Good eye @AhyoungRyu , the navigation made more sense in December, I
didn't look at it again since the rebase.
Thanks!
---
If your project is set up for it, you can reply to
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/418
@bzz took advantage of the break in the action on 116 to finish this off.
Should be GTG now.
Updated to Flink 1.0, Maven 3.3.9, Spark 1.6. Also added some points on
just using Spark
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
@bzz, I can't recreate the build failure.
I can say
- Spark, pySpark, and Mahout notebooks and paragraphs run as expected.
- Spark and pySpark tests pass. Also, integr
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/418
@bzz so much has changed since I wrote this I want to go through it once
and make sure everything is still accurate. I'm f/t trying to get
[ZEPPELIN-116](https://github.com/apache/zep
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
last commit was to get around that web application fail.
Added my own logging to shed some light on the situation:
`21:59:21,200 INFO
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
@Bzz, Re: the three Pyspark related tests in Zeppelin Spark Cluster-
Similar scenarios would always work on my machine, and I am unable to reproduce
the test failure locally. (Have been the
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
@bzz I've rebased to master twice now. The failures are on:
- Zeppelin Server: Connection Refused
- Mahout : Out of Memory
`mvn clean verify` works for me locally un
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
Also OOM errors in the actual Mahout interpretter...
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
Also OOM errors in the actual Mahout interpretter...
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
Merged Master- still failing on Zeppelin Server at
```
15:27:30,479 ERROR org.apache.zeppelin.rest.AbstractTestRestApi:251 -
Exception in AbstractTestRestApi while
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
@bzz I can imagine. Builds are still failling at random,and I'm not sure
what to make of it. mvn clean verify works well on my system. Let me know if
there is anymore I can do to test
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
Lot's of connection test failures when building Zeppelin-Server?
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your pr
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
@bzz and @Leemoonsoo
A big part of the refactor was introducing no new dependencies- instead
loading from maven or MAHOUT_HOME at interpretter start up via dependency
resolver
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
@bzz not quite done. A little more testing and I realized, jars aren't
being properly loaded when Spark is in cluster mode. Think you could take a
peek and try to give me a hint why
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
Per this:
http://stackoverflow.com/questions/32498891/spark-read-and-write-to-parquet-leads-to-outofmemoryerror-java-heap-space
and this:
http://spark.apache.org/docs/latest
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
As I am playing with this, things seem to stop/start working at random...
The Thrift Server error in the Zepplin context, with Java Heap Space errors
related to the kryo serializer in the logs
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
UPDATE:
Sorry for the quick one-two punch. But the above error only occurs in Spark
cluster mode, not in Spark local mode. Leading me to believe jars aren't
getting load
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
Consider the code
```scala
%mahout
val mxRnd = Matrices.symmetricUniformView(5000, 2, 1234)
val drmRand = drmParallelize(mxRnd)
val drmSin = drmRand.mapBlock
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
This appears to be working, but there is a bug when doing OLS regarding the
thift server. It is the same error message one normally gets when trying to use
incompatible versoin of spark. Is
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
If someone could help me out I'd appreciate it...
First of all, this works fine as expected in the notebooks (either way).
In MahoutSparkInterpreter.java line 89, there
Github user rawkintrevo commented on the issue:
https://github.com/apache/zeppelin/pull/928
This says no conflicts but didn't try to rebuild...? I merged with master
hoping to retrigger a build. Not sure what I did wrong here but obviously
something...
---
If your project is s
Github user rawkintrevo commented on the issue:
https://github.com/apache/incubator-zeppelin/pull/928
Total refactor- extends spark interpreter, (not done before as far as I
know, but make more sense imho). Keeps settings local to mahout, not
interfering with main spark terp
Github user rawkintrevo commented on the issue:
https://github.com/apache/incubator-zeppelin/pull/928
my thought is only support spark. If you want to use pyspark/spark/ blah -
use the spark interpreter.
We'll only support additional langs as Mahout introduces interpreter
Github user rawkintrevo commented on the issue:
https://github.com/apache/incubator-zeppelin/pull/928
yea, thats what I'm thinking, then we'll have mahout.spark , mahout.flink,
mahout.h20
---
If your project is set up for it, you can reply to this email and have
Github user rawkintrevo commented on the issue:
https://github.com/apache/incubator-zeppelin/pull/928
@Leemoonsoo definitely the latter... *sigh*. Beginning work on a refactor
(actually already begun bc I had reached the same conclusion).
---
If your project is set up for it, you
Github user rawkintrevo commented on the issue:
https://github.com/apache/incubator-zeppelin/pull/928
Thoughts on why this isn't building?
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not
Github user rawkintrevo commented on the issue:
https://github.com/apache/incubator-zeppelin/pull/928
Good call out @Leemoonsoo I have plans to put together a nice documentation
page, once we settle on the approach/API/settings (which I believe we now have
done? feedback welcome). I
Github user rawkintrevo commented on the issue:
https://github.com/apache/incubator-zeppelin/pull/928
And intellij took a lot of liberties refactoring my imports... going to
change those back and re push
---
If your project is set up for it, you can reply to this email and have your
Github user rawkintrevo commented on the issue:
https://github.com/apache/incubator-zeppelin/pull/928
Refactored the dependencies out of pom.xml
Shouldn't bee any conflicts now.
It is on the use to specify the directory which contain the nessecary
Mahout
Github user rawkintrevo commented on the pull request:
https://github.com/apache/incubator-zeppelin/pull/928
@bzz Thanks for the comments, It builds fine on my machine, I wanted to see
how it did against CI. Been out of town for a wedding over the holiday weekend
(in US Monday was a
GitHub user rawkintrevo opened a pull request:
https://github.com/apache/incubator-zeppelin/pull/928
[ZEPPELIN-116] Add Mahout Support for Spark Interpreter
### What is this PR for?
This PR adds Mahout functionality for the Spark Interpreter.
### What type of PR is
72 matches
Mail list logo