Re: [build system] emergency jenkins restart in ~10 minutes

2017-05-22 Thread shane knapp
this is done, and we're building again. On Mon, May 22, 2017 at 1:45 PM, shane knapp wrote: > last night i accidentally upgraded a bunch of plugins, which ended up > breaking alluxio's release pipeline. > > to that end, i've downgraded the artifactory plugin and need to do an > emergency restart.

[build system] emergency jenkins restart in ~10 minutes

2017-05-22 Thread shane knapp
last night i accidentally upgraded a bunch of plugins, which ended up breaking alluxio's release pipeline. to that end, i've downgraded the artifactory plugin and need to do an emergency restart. this will be quick, and will happen in about 10 minutes. sorry for all of the flakiness recently, s

Re: [Spark SQL] ceil and floor functions on doubles

2017-05-22 Thread Vadim Semenov
Yes, it was done on purpose to match the behavior of Hive ( https://issues.apache.org/jira/browse/SPARK-10865). And I believe Hive returns `Long`s because they adopted the definition used in MySQL (https://issues.apache.org/jira/browse/HIVE-615). On Fri, May 19, 2017 at 10:51 AM, Anton Okolnychyi

Re: [VOTE] Apache Spark 2.2.0 (RC2)

2017-05-22 Thread Michael Armbrust
I'm waiting for SPARK-20814 at Marcelo's request and I'd also like to include SPARK-20844 . I think we should be able to cut another RC midweek. On Fri, May 19, 2017 at 11:53 AM, Nick Pentreath

Re: Parquet vectorized reader DELTA_BYTE_ARRAY

2017-05-22 Thread Ryan Blue
Michael is right, the delta byte array encoding is a Parquet v2 feature. Parquet v2 isn't finished yet, though some features are in releases and those features will be supported in future releases. In other words, Parquet will maintain backward-compatibility for any released v2 features. I don't r

Re: Parquet vectorized reader DELTA_BYTE_ARRAY

2017-05-22 Thread Michael Allman
Hi AndreiL, Were these files written with the Parquet V2 writer? The Spark 2.1 vectorized reader does not appear to support that format. Michael > On May 9, 2017, at 11:04 AM, andreiL wrote: > > Hi, I am getting an exception in Spark 2.1 reading parquet files where some > columns are DELTA_B

Re: Method for gracefully terminating a driver on a standalone master in Spark 2.1+

2017-05-22 Thread Michael Allman
As I cannot find a way to gracefully kill an app which takes longer than 10 seconds to shut down, I have reported this issue as a bug: https://issues.apache.org/jira/browse/SPARK-20843 Michael > On May 4, 2017, at 4:15 PM, Michael Allman wro

Re: [VOTE] Apache Spark 2.2.0 (RC2)

2017-05-22 Thread MKesselaers
Hi, When would the next vote for RC take place? Thanks, Maarten > On 19 May 2017, at 20:53, Nick Pentreath wrote: > > All the outstanding ML QA doc and user guide items are done for 2.2 so from > that side we should be good to cut another RC :) > > On Thu, 18 May 2017 at 00:18 Russell Spitzer

Re: Starter tasks to start contributing

2017-05-22 Thread Yan Facai
Hi, I think that starter label is useful for you. How about this link: https://issues.apache.org/jira/browse/SPARK-5?jql= project%20=%20SPARK%20%20AND%20component%20in%20%20("Spark% 20Core",%20%20"Structured%20Streaming")%20AND%20status% 20=%20Open%20AND%20labels%20=%20starter%20ORDER%20BY%20

Re: Spark madness

2017-05-22 Thread Jonathan Winandy
Hi Saikat, You may use the wrong mailing list for your question (=> spark user). If you want to make a single string, it's : red.collect.mkString("\n") Be careful of driver explosion ! Cheers, Jonathan On Fri, 19 May 2017, 05:21 Saikat Kanjilal, wrote: > One additional point, the following l

#confused:Whether we should use more specific exception #

2017-05-22 Thread Allianzcortex
Hello,well...What I ask may be a very simple question,but still trouble me.. Before I ask the question,I have read the http://spark.apache.org/contributing.html , And search for the mailing list,But still not sure^_^. The question is like that: In org.apache.spark.streaming.dstream of S