Re: Spark JIRA Report

2014-12-13 Thread Andrew Ash
The goal of increasing visibility on open issues is a good one. How is this different from just a link to Jira though? Some might say this adds noise to the mailing list and doesn't contain any information not already available in Jira. The idea seems good but the formatting leaves a little to b

Governance of the Jenkins whitelist

2014-12-13 Thread Andrew Ash
Jenkins is a really valuable tool for increasing quality of incoming patches to Spark, but I've noticed that there are often a lot of patches waiting for testing because they haven't been approved for testing. Certain users can instruct Jenkins to run on a PR, or add other users to a whitelist. Ho

Spark JIRA Report

2014-12-13 Thread Nicholas Chammas
What do y’all think of a report like this emailed out to the dev list on a monthly basis? The goal would be to increase visibility into our open issues and encourage developers to tend to our issue tracker more frequently. Nick There are 1,236 unresolved issues

Re: Nabble mailing list mirror errors: "This post has NOT been accepted by the mailing list yet"

2014-12-13 Thread Yana Kadiyska
Since you mentioned this, I had a related quandry recently -- it also says that the forum archives "*u...@spark.incubator.apache.org "/* *d...@spark.incubator.apache.org *respectively, yet the "Community page" clearly says to email the @spark.apache.org list (but the nabble archive is linked right

Nabble mailing list mirror errors: "This post has NOT been accepted by the mailing list yet"

2014-12-13 Thread Josh Rosen
I've noticed that several users are attempting to post messages to Spark's user / dev mailing lists using the Nabble web UI ( http://apache-spark-user-list.1001560.n3.nabble.com/). However, there are many posts in Nabble that are not posted to the Apache lists and are flagged with "This post has N

Re: [VOTE] Release Apache Spark 1.2.0 (RC2)

2014-12-13 Thread Sean McNamara
+1 tested on OS X and deployed+tested our apps via YARN into our staging cluster. Sean > On Dec 11, 2014, at 10:40 AM, Reynold Xin wrote: > > +1 > > Tested on OS X. > > On Wednesday, December 10, 2014, Patrick Wendell wrote: > >> Please vote on releasing the following candidate as Apache

Re: [VOTE] Release Apache Spark 1.2.0 (RC2)

2014-12-13 Thread slcclimber
I am building and testing using sbt. I get a lot of "Job aborted due to stage failure: Master removed our application: FAILED" did not contain "cancelled", and "Job aborted due to stage failure: Master removed our application: FAILED" did not contain "killed" errors trying to run tests. (JobCance

Re: [VOTE] Release Apache Spark 1.2.0 (RC2)

2014-12-13 Thread Nick Pentreath
+1 — Sent from Mailbox On Sat, Dec 13, 2014 at 3:12 PM, GuoQiang Li wrote: > +1 (non-binding). Tested on CentOS 6.4 > -- Original -- > From: "Patrick Wendell";; > Date: Thu, Dec 11, 2014 05:08 AM > To: "dev发送@spark.apache.org"; > Subject: [VOTE] Release Apac

Re: [VOTE] Release Apache Spark 1.2.0 (RC2)

2014-12-13 Thread GuoQiang Li
+1 (non-binding). Tested on CentOS 6.4 -- Original -- From: "Patrick Wendell";; Date: Thu, Dec 11, 2014 05:08 AM To: "dev@spark.apache.org"; Subject: [VOTE] Release Apache Spark 1.2.0 (RC2) Please vote on releasing the following candidate as Apache Sp

Re: [VOTE] Release Apache Spark 1.2.0 (RC2)

2014-12-13 Thread Tom Graves
+1 built and tested on Yarn on Hadoop 2.x cluster. Tom On Saturday, December 13, 2014 12:48 AM, Denny Lee wrote: +1 Tested on OSX Tested Scala 2.10.3, SparkSQL with Hive 0.12 / Hadoop 2.5, Thrift Server, MLLib SVD On Fri Dec 12 2014 at 8:57:16 PM Mark Hamstra wrote: > +1 > > On

Re: one hot encoding

2014-12-13 Thread Sandy Ryza
Hi Lochana, We haven't yet added this in 1.2. https://issues.apache.org/jira/browse/SPARK-4081 tracks adding categorical feature indexing, which one-hot encoding can be built on. https://issues.apache.org/jira/browse/SPARK-1216 also tracks a version of this prior to the ML pipelines work. -Sandy