***UNCHECKED*** Re: [VOTE] SPARK 2.3.2 (RC6)

2018-09-18 Thread Marco Gaido
Sorry, I am -1 because of SPARK-25454 which is a regression from 2.2. Il giorno mer 19 set 2018 alle ore 03:45 Dongjoon Hyun < dongjoon.h...@gmail.com> ha scritto: > +1. > > I tested with `-Pyarn -Phadoop-2.7 -Pkinesis-asl -Phive > -Phive-thriftserve` on OpenJDK(1.8.0_181)/CentOS 7.5. > > I hit t

Re: [DISCUSS] upper/lower of special characters

2018-09-18 Thread Reynold Xin
I'd just document it as a known limitation and move on for now, until there are enough end users that need this. Spark is also very powerful with UDFs and end users can easily work around this using UDFs. -- excuse the brevity and lower case due to wrist injury On Tue, Sep 18, 2018 at 11:14 PM s

Re: [VOTE] SPARK 2.4.0 (RC1)

2018-09-18 Thread Yinan Li
FYI: SPARK-23200 has been resolved. On Tue, Sep 18, 2018 at 8:49 AM Felix Cheung wrote: > If we could work on this quickly - it might get on to future RCs. > > > > -- > *From:* Stavros Kontopoulos > *Sent:* Monday, September 17, 2018 2:35 PM > *To:* Yinan Li > *Cc:*

Re: [VOTE] SPARK 2.3.2 (RC6)

2018-09-18 Thread Dongjoon Hyun
+1. I tested with `-Pyarn -Phadoop-2.7 -Pkinesis-asl -Phive -Phive-thriftserve` on OpenJDK(1.8.0_181)/CentOS 7.5. I hit the following test case failure once during testing, but it's not persistent. KafkaContinuousSourceSuite ... subscribing topic by name from earliest offsets (failOn

Re: [VOTE] SPARK 2.4.0 (RC1)

2018-09-18 Thread Felix Cheung
If we could work on this quickly - it might get on to future RCs. From: Stavros Kontopoulos Sent: Monday, September 17, 2018 2:35 PM To: Yinan Li Cc: Xiao Li; eerla...@redhat.com; van...@cloudera.com.invalid; Sean Owen; Wenchen Fan; dev Subject: Re: [VOTE] SPAR

Re: Python friendly API for Spark 3.0

2018-09-18 Thread Erik Erlandson
I like the notion of empowering cross platform bindings. The trend of computing frameworks seems to be that all APIs gradually converge on a stable attractor which could be described as "data frames and SQL" Spark's early API design was RDD focused, but these days the center of gravity is all abo

Re: [VOTE] SPARK 2.4.0 (RC1)

2018-09-18 Thread Wenchen Fan
Thanks Marcelo to point out my gpg key issue! I've re-generated it and uploaded to ASF spark repo. Let's see if it works in the next RC. Thanks Saisai to point out the Python doc issue, I'll fix it in the next RC. This RC fails because: 1. it doesn't include a Scala 2.12 build 2. the gpg key issu

Re: [Discuss] Datasource v2 support for manipulating partitions

2018-09-18 Thread Thakrar, Jayesh
Totally agree with you Dale, that there are situations for efficiency, performance and better control/visibility/manageability that we need to expose partition management. So as described, I suggested two things - the ability to do it in the current V2 API form via options and appropriate imple

Re: [VOTE] SPARK 2.4.0 (RC1)

2018-09-18 Thread Marco Gaido
Sorry but I am -1 because of what was reported here: https://issues.apache.org/jira/browse/SPARK-22036?focusedCommentId=16618104&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-16618104 . It is a regression unfortunately. Despite the impact is not huge and there are