Re: [VOTE] Release Apache Spark 1.5.0 (RC3)

2015-09-03 Thread Denny Lee
+1 Distinct count test is blazing fast - awesome!, On Thu, Sep 3, 2015 at 8:21 PM Krishna Sankar wrote: > +? > > 1. Compiled OSX 10.10 (Yosemite) OK Total time: 26:09 min > mvn clean package -Pyarn -Phadoop-2.6 -DskipTests > 2. Tested pyspark, mllib > 2.1. statistics (min,max,mean,Pearson,

Re: [VOTE] Release Apache Spark 1.5.0 (RC3)

2015-09-03 Thread Krishna Sankar
+? 1. Compiled OSX 10.10 (Yosemite) OK Total time: 26:09 min mvn clean package -Pyarn -Phadoop-2.6 -DskipTests 2. Tested pyspark, mllib 2.1. statistics (min,max,mean,Pearson,Spearman) OK 2.2. Linear/Ridge/Laso Regression OK 2.3. Decision Tree, Naive Bayes OK 2.4. KMeans OK Center And S

Re: Code generation for GPU

2015-09-03 Thread Reynold Xin
See responses inline. On Thu, Sep 3, 2015 at 1:58 AM, kiran lonikar wrote: > Hi, > >1. I found where the code generation > > > happens >in

Re: [VOTE] Release Apache Spark 1.5.0 (RC3)

2015-09-03 Thread saurfang
+1. Compiled on Windows with YARN and Hive. Tested Tungsten aggregation and observed similar (good) performance comparing to 1.4 with unsafe on. Ran a few workloads and tested SparkSQL thrift server -- View this message in context: http://apache-spark-developers-list.1001551.n3.nabble.com/VOTE-

Re: [VOTE] Release Apache Spark 1.5.0 (RC3)

2015-09-03 Thread Davies Liu
+1, built 1.5 from source and ran TPC-DS locally and clusters, ran performance benchmark for aggregation and join with difference scales, all worked well. On Thu, Sep 3, 2015 at 10:05 AM, Michael Armbrust wrote: > +1 Ran TPC-DS and ported several jobs over to 1.5 > > On Thu, Sep 3, 2015 at 9:57 A

Re: [VOTE] Release Apache Spark 1.5.0 (RC3)

2015-09-03 Thread Michael Armbrust
+1 Ran TPC-DS and ported several jobs over to 1.5 On Thu, Sep 3, 2015 at 9:57 AM, Burak Yavuz wrote: > +1. Tested complex R package support (Scala + R code), BLAS and DataFrame > fixes good. > > Burak > > On Thu, Sep 3, 2015 at 8:56 AM, mkhaitman > wrote: > >> Built and tested on CentOS 7, Hado

Re: [VOTE] Release Apache Spark 1.5.0 (RC3)

2015-09-03 Thread Burak Yavuz
+1. Tested complex R package support (Scala + R code), BLAS and DataFrame fixes good. Burak On Thu, Sep 3, 2015 at 8:56 AM, mkhaitman wrote: > Built and tested on CentOS 7, Hadoop 2.7.1 (Built for 2.6 profile), > Standalone without any problems. Re-tested dynamic allocation specifically. > > "L

EOFException on History server reading in progress lz4

2015-09-03 Thread andrew.rowson
I'm trying to solve a problem of the history server spamming my logs with EOFExceptions when it tries to read a history file from HDFS that is both lz4 compressed and incomplete. The actual exception is: java.io.EOFException: Stream ended prematurely at net.jpountz.lz4.LZ4BlockInputStream.

Re: [VOTE] Release Apache Spark 1.5.0 (RC3)

2015-09-03 Thread mkhaitman
Built and tested on CentOS 7, Hadoop 2.7.1 (Built for 2.6 profile), Standalone without any problems. Re-tested dynamic allocation specifically. "Lost executor" messages are still an annoyance since they're expected to occur with dynamic allocation, and shouldn't WARN/ERROR as they do now, however

Re: [VOTE] Release Apache Spark 1.5.0 (RC3)

2015-09-03 Thread Tom Graves
+1. Tested on Yarn with Hadoop 2.6.  A few of the things tested: pyspark, hive integration, aux shuffle handler, history server, basic submit cli behavior, distributed cache behavior, cluster and client mode... Tom On Tuesday, September 1, 2015 3:42 PM, Reynold Xin wrote: Please v

Re: [HELP] Spark 1.4.1 tasks take ridiculously long time to complete

2015-09-03 Thread robineast
I would suggest you move this to the Spark User list, this is the development list for discussion on development of Spark. It would help if you could give some more information about what you are trying to do e.g. what code you are running, how you submitted the job (spark-shell, spark-submit) and