subject:"\[VOTE\] Release Apache Spark 1.3.0 \(RC3\)"

[RESULT] [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-10 Thread Patrick Wendell

20 > 收件人: "GuoQiang Li"; > 主题: Re: [VOTE] Release Apache Spark 1.3.0 (RC3) > > Thanks! But please e-mail the dev list and not just me personally :) > > On Mon, Mar 9, 2015 at 11:08 PM, GuoQiang Li wrote: >> +1 (non-binding) >> >> Test on Mac OS X 10.10.2

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-09 Thread Corey Nolet

+1 (non-binding) - Verified signatures - Built on Mac OS X and Fedora 21. On Mon, Mar 9, 2015 at 11:01 PM, Krishna Sankar wrote: > Excellent, Thanks Xiangrui. The mystery is solved. > Cheers > > > > On Mon, Mar 9, 2015 at 3:30 PM, Xiangrui Meng wrote: > > > Krishna, I tested your linear regre

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-09 Thread Krishna Sankar

Excellent, Thanks Xiangrui. The mystery is solved. Cheers On Mon, Mar 9, 2015 at 3:30 PM, Xiangrui Meng wrote: > Krishna, I tested your linear regression example. For linear > regression, we changed its objective function from 1/n * \|A x - > b\|_2^2 to 1/(2n) * \|Ax - b\|_2^2 to be consistent

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-09 Thread Joseph Bradley

+1 Tested on Mac OS X On Mon, Mar 9, 2015 at 3:30 PM, Xiangrui Meng wrote: > Krishna, I tested your linear regression example. For linear > regression, we changed its objective function from 1/n * \|A x - > b\|_2^2 to 1/(2n) * \|Ax - b\|_2^2 to be consistent with common least > squares formulati

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-09 Thread Xiangrui Meng

Krishna, I tested your linear regression example. For linear regression, we changed its objective function from 1/n * \|A x - b\|_2^2 to 1/(2n) * \|Ax - b\|_2^2 to be consistent with common least squares formulations. It means you could re-produce the same result by multiplying the step size by 2.

Re: Release Scala version vs Hadoop version (was: [VOTE] Release Apache Spark 1.3.0 (RC3))

2015-03-09 Thread Andrew Ash

Does the Apache project team have any ability to measure download counts of the various releases? That data could be useful when it comes time to sunset vendor-specific releases, like CDH4 for example. On Mon, Mar 9, 2015 at 5:34 AM, Mridul Muralidharan wrote: > In ideal situation, +1 on removi

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-09 Thread Sean Owen

I'm +1 as I have not heard of any one else seeing the Hive test failure, which is likely a test issue rather than code issue anyway, and not a blocker. On Fri, Mar 6, 2015 at 9:36 PM, Sean Owen wrote: > Although the problem is small, especially if indeed the essential docs > changes are following

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-09 Thread Kostas Sakellis

+1 on RC3 I agree that this should not block the release. Once we have a fix for it, putting it in a double dot release sounds like a good plan. Kostas On Mon, Mar 9, 2015 at 11:27 AM, Patrick Wendell wrote: > Hey All, > > Today there was a JIRA posted with an observed regression around Spar

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-09 Thread Patrick Wendell

Hey All, Today there was a JIRA posted with an observed regression around Spark Streaming during certain recovery scenarios: https://issues.apache.org/jira/browse/SPARK-6222 My preference is to go ahead and ship this release (RC3) as-is and if this issue is isolated resolved soon, we can make a

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-09 Thread Denny Lee

+1 (non-binding) Spark Standalone and YARN on Hadoop 2.6 on OSX plus various tests (MLLib, SparkSQL, etc.) On Mon, Mar 9, 2015 at 9:18 AM Tom Graves wrote: > +1. Built from source and ran Spark on yarn on hadoop 2.6 in cluster and > client mode. > Tom > > On Thursday, March 5, 2015 8:53 PM

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-09 Thread Tom Graves

+1. Built from source and ran Spark on yarn on hadoop 2.6 in cluster and client mode. Tom On Thursday, March 5, 2015 8:53 PM, Patrick Wendell wrote: Please vote on releasing the following candidate as Apache Spark version 1.3.0! The tag to be voted on is v1.3.0-rc2 (commit 4aaf48d4

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-09 Thread Sean McNamara

+1 Ran local tests and tested our spark apps on a spark+yarn cluster. Cheers, Sean > On Mar 8, 2015, at 11:51 PM, Sandy Ryza wrote: > > +1 (non-binding, doc and packaging issues aside) > > Built from source, ran jobs and spark-shell against a pseudo-distributed > YARN cluster. > > On Sun,

Re: Release Scala version vs Hadoop version (was: [VOTE] Release Apache Spark 1.3.0 (RC3))

2015-03-09 Thread Mridul Muralidharan

In ideal situation, +1 on removing all vendor specific builds and making just hadoop version specific - that is what we should depend on anyway. Though I hope Sean is correct in assuming that vendor specific builds for hadoop 2.4 are just that; and not 2.4- or 2.4+ which cause incompatibilities for

Re: Release Scala version vs Hadoop version (was: [VOTE] Release Apache Spark 1.3.0 (RC3))

2015-03-09 Thread Sean Owen

Yes, you should always find working bits at Apache no matter what -- though 'no matter what' really means 'as long as you use Hadoop distro compatible with upstream Hadoop'. Even distros have a strong interest in that, since the market, the 'pie', is made large by this kind of freedom at the core.

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-08 Thread Sandy Ryza

+1 (non-binding, doc and packaging issues aside) Built from source, ran jobs and spark-shell against a pseudo-distributed YARN cluster. On Sun, Mar 8, 2015 at 2:42 PM, Krishna Sankar wrote: > Yep, otherwise this will become an N^2 problem - Scala versions X Hadoop > Distributions X ... > > May

Re: Release Scala version vs Hadoop version (was: [VOTE] Release Apache Spark 1.3.0 (RC3))

2015-03-08 Thread Matei Zaharia

Yeah, my concern is that people should get Apache Spark from *Apache*, not from a vendor. It helps everyone use the latest features no matter where they are. In the Hadoop distro case, Hadoop made all this effort to have standard APIs (e.g. YARN), so it should be easy. But it is a problem if we'

Re: Release Scala version vs Hadoop version (was: [VOTE] Release Apache Spark 1.3.0 (RC3))

2015-03-08 Thread Sean Owen

Yeah it's not much overhead, but here's an example of where it causes a little issue. I like that reasoning. However, the released builds don't track the later versions of Hadoop that vendors would be distributing -- there's no Hadoop 2.6 build for example. CDH4 is here, but not the far-more-used

Re: Release Scala version vs Hadoop version (was: [VOTE] Release Apache Spark 1.3.0 (RC3))

2015-03-08 Thread Patrick Wendell

I think it's important to separate the goals from the implementation. I agree with Matei on the goal - I think the goal needs to be to allow people to download Apache Spark and use it with CDH, HDP, MapR, whatever... This is the whole reason why HDFS and YARN have stable API's, so that other projec

Re: Release Scala version vs Hadoop version (was: [VOTE] Release Apache Spark 1.3.0 (RC3))

2015-03-08 Thread Matei Zaharia

Our goal is to let people use the latest Apache release even if vendors fall behind or don't want to package everything, so that's why we put out releases for vendors' versions. It's fairly low overhead. Matei > On Mar 8, 2015, at 5:56 PM, Sean Owen wrote: > > Ah. I misunderstood that Matei w

Release Scala version vs Hadoop version (was: [VOTE] Release Apache Spark 1.3.0 (RC3))

2015-03-08 Thread Sean Owen

Ah. I misunderstood that Matei was referring to the Scala 2.11 tarball at http://people.apache.org/~pwendell/spark-1.3.0-rc3/ and not the Maven artifacts. Patrick I see you just commented on SPARK-5134 and will follow up there. Sounds like this may accidentally not be a problem. On binary tarball

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-08 Thread Krishna Sankar

Yep, otherwise this will become an N^2 problem - Scala versions X Hadoop Distributions X ... May be one option is to have a minimum basic set (which I know is what we are discussing) and move the rest to spark-packages.org. There the vendors can add the latest downloads - for example when 1.4 is r

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-08 Thread Patrick Wendell

We probably want to revisit the way we do binaries in general for 1.4+. IMO, something worth forking a separate thread for. I've been hesitating to add new binaries because people (understandably) complain if you ever stop packaging older ones, but on the other hand the ASF has complained that we

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-08 Thread Sean Owen

Yeah, interesting question of what is the better default for the single set of artifacts published to Maven. I think there's an argument for Hadoop 2 and perhaps Hive for the 2.10 build too. Pros and cons discussed more at https://issues.apache.org/jira/browse/SPARK-5134 https://github.com/apache/

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-08 Thread Matei Zaharia

+1 Tested it on Mac OS X. One small issue I noticed is that the Scala 2.11 build is using Hadoop 1 without Hive, which is kind of weird because people will more likely want Hadoop 2 with Hive. So it would be good to publish a build for that configuration instead. We can do it if we do a new RC

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-06 Thread Patrick Wendell

For now, I'll just put this as critical. We can discuss the documentation stuff offline or in another thread. On Fri, Mar 6, 2015 at 1:36 PM, Sean Owen wrote: > Although the problem is small, especially if indeed the essential docs > changes are following just a couple days behind the final relea

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-06 Thread Sean Owen

Although the problem is small, especially if indeed the essential docs changes are following just a couple days behind the final release, I mean, why the rush if they're essential? wait a couple days, finish them, make the release. Answer is, I think these changes aren't actually essential given t

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-06 Thread Tathagata Das

To add to what Patrick said, the only reason that those JIRAs are marked as Blockers (at least I can say for myself) is so that they are at the top of the JIRA list signifying that these are more *immediate* issues than all the Critical issues. To make it less confusing for the community voting, we

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-06 Thread Patrick Wendell

Sean, The docs are distributed and consumed in a fundamentally different way than Spark code itself. So we've always considered the "deadline" for doc changes to be when the release is finally posted. If there are small inconsistencies with the docs present in the source code for that release tag

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-06 Thread Krishna Sankar

+1 (non-binding, of course) 1. Compiled OSX 10.10 (Yosemite) OK Total time: 13:55 min mvn clean package -Pyarn -Dyarn.version=2.6.0 -Phadoop-2.4 -Dhadoop.version=2.6.0 -Phive -DskipTests -Dscala-2.11 2. Tested pyspark, mlib - running as well as compare results with 1.1.x & 1.2.x pyspark wo

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-06 Thread Marcelo Vanzin

+1 (non-binding, doc issues aside) Ran batch of tests against yarn and standalone, including tests for rc2 blockers, all looks fine. On Thu, Mar 5, 2015 at 6:52 PM, Patrick Wendell wrote: > Please vote on releasing the following candidate as Apache Spark version > 1.3.0! > > The tag to be voted

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-06 Thread Sean Owen

Given the title and tagging, it sounds like there could be some must-have doc changes to go with what is being released as 1.3. It can be finished later, and published later, but then the docs source shipped with the release doesn't match the site, and until then, 1.3 is released without some "must

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-06 Thread Patrick Wendell

Hey Sean, > SPARK-5310 Update SQL programming guide for 1.3 > SPARK-5183 Document data source API > SPARK-6128 Update Spark Streaming Guide for Spark 1.3 For these, the issue is that they are documentation JIRA's, which don't need to be timed exactly with the release vote, since we can update the

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-06 Thread Sean Owen

There are still three JIRAs marked as blockers for 1.3.0: SPARK-5310 Update SQL programming guide for 1.3 SPARK-5183 Document data source API SPARK-6128 Update Spark Streaming Guide for Spark 1.3 As a matter of hygiene, let's either mark them resolved if they're resolved, or push them / depriorit

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-06 Thread Patrick Wendell

I'll kick it off with a +1. On Thu, Mar 5, 2015 at 6:52 PM, Patrick Wendell wrote: > Please vote on releasing the following candidate as Apache Spark version > 1.3.0! > > The tag to be voted on is v1.3.0-rc2 (commit 4aaf48d4): > https://git-wip-us.apache.org/repos/asf?p=spark.git;a=commit;h=4aaf

[VOTE] Release Apache Spark 1.3.0 (RC3)

2015-03-05 Thread Patrick Wendell

Please vote on releasing the following candidate as Apache Spark version 1.3.0! The tag to be voted on is v1.3.0-rc2 (commit 4aaf48d4): https://git-wip-us.apache.org/repos/asf?p=spark.git;a=commit;h=4aaf48d46d13129f0f9bdafd771dd80fe568a7dc The release files, including signatures, digests, etc. ca

[RESULT] [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: Release Scala version vs Hadoop version (was: [VOTE] Release Apache Spark 1.3.0 (RC3))

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: Release Scala version vs Hadoop version (was: [VOTE] Release Apache Spark 1.3.0 (RC3))

Re: Release Scala version vs Hadoop version (was: [VOTE] Release Apache Spark 1.3.0 (RC3))

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: Release Scala version vs Hadoop version (was: [VOTE] Release Apache Spark 1.3.0 (RC3))

Re: Release Scala version vs Hadoop version (was: [VOTE] Release Apache Spark 1.3.0 (RC3))

Re: Release Scala version vs Hadoop version (was: [VOTE] Release Apache Spark 1.3.0 (RC3))

Re: Release Scala version vs Hadoop version (was: [VOTE] Release Apache Spark 1.3.0 (RC3))

Release Scala version vs Hadoop version (was: [VOTE] Release Apache Spark 1.3.0 (RC3))

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

Re: [VOTE] Release Apache Spark 1.3.0 (RC3)

[VOTE] Release Apache Spark 1.3.0 (RC3)

35 matches

Site Navigation

Mail list logo

Footer information