Re: Decimals

2017-12-13 Thread Reynold Xin
Responses inline On Tue, Dec 12, 2017 at 2:54 AM, Marco Gaido wrote: > Hi all, > > I saw in these weeks that there are a lot of problems related to decimal > values (SPARK-22036, SPARK-22755, for instance). Some are related to > historical choices, which I don't know, thus please excuse me if I

Hinge Gradient

2017-12-13 Thread Debasish Das
Hi, I looked into the LinearSVC flow and found the gradient for hinge as follows: Our loss function with {0, 1} labels is max(0, 1 - (2y - 1) (f_w(x))) Therefore the gradient is -(2y - 1)*x max is a non-smooth function. Did we try using ReLu/Softmax function and use that to smooth the hinge los

Re: Leveraging S3 select

2017-12-13 Thread Steve Loughran
On 8 Dec 2017, at 17:05, Andrew Duffy mailto:adu...@palantir.com>> wrote: Hey Steve, Happen to have a link to the TPC-DS benchmark data w/random S3 reads? I've done a decent amount of digging, but all I've found is a reference in a slide deck Is that one of mine? We haven't done any benchma

Re: [RESULT][VOTE] Spark 2.2.1 (RC2)

2017-12-13 Thread Sean Owen
Looks like Maven artifacts are up, site's up -- what about the Python and R artifacts? I can also move the spark.apache/docs/latest link to point to 2.2.1 if it's pretty ready. We should announce the release officially too then. On Wed, Dec 6, 2017 at 5:00 PM Felix Cheung wrote: > I saw the svn

Re: [RESULT][VOTE] Spark 2.2.1 (RC2)

2017-12-13 Thread Shivaram Venkataraman
The R artifacts have some issue that Felix and I are debugging. Lets not block the announcement for that. Thanks Shivaram On Wed, Dec 13, 2017 at 5:59 AM, Sean Owen wrote: > Looks like Maven artifacts are up, site's up -- what about the Python and > R artifacts? > I can also move the spark.apac