Sorry, I am -1 because of SPARK-25454 which is a regression from 2.2.
Il giorno mer 19 set 2018 alle ore 03:45 Dongjoon Hyun <
dongjoon.h...@gmail.com> ha scritto:
> +1.
>
> I tested with `-Pyarn -Phadoop-2.7 -Pkinesis-asl -Phive
> -Phive-thriftserve` on OpenJDK(1.8.0_181)/CentOS 7.5.
>
> I hit t
I'd just document it as a known limitation and move on for now, until there
are enough end users that need this. Spark is also very powerful with UDFs
and end users can easily work around this using UDFs.
--
excuse the brevity and lower case due to wrist injury
On Tue, Sep 18, 2018 at 11:14 PM s
FYI: SPARK-23200 has been resolved.
On Tue, Sep 18, 2018 at 8:49 AM Felix Cheung
wrote:
> If we could work on this quickly - it might get on to future RCs.
>
>
>
> --
> *From:* Stavros Kontopoulos
> *Sent:* Monday, September 17, 2018 2:35 PM
> *To:* Yinan Li
> *Cc:*
+1.
I tested with `-Pyarn -Phadoop-2.7 -Pkinesis-asl -Phive -Phive-thriftserve`
on OpenJDK(1.8.0_181)/CentOS 7.5.
I hit the following test case failure once during testing, but it's not
persistent.
KafkaContinuousSourceSuite
...
subscribing topic by name from earliest offsets (failOn
If we could work on this quickly - it might get on to future RCs.
From: Stavros Kontopoulos
Sent: Monday, September 17, 2018 2:35 PM
To: Yinan Li
Cc: Xiao Li; eerla...@redhat.com; van...@cloudera.com.invalid; Sean Owen;
Wenchen Fan; dev
Subject: Re: [VOTE] SPAR
I like the notion of empowering cross platform bindings.
The trend of computing frameworks seems to be that all APIs gradually
converge on a stable attractor which could be described as "data frames and
SQL" Spark's early API design was RDD focused, but these days the center
of gravity is all abo
Thanks Marcelo to point out my gpg key issue! I've re-generated it and
uploaded to ASF spark repo. Let's see if it works in the next RC.
Thanks Saisai to point out the Python doc issue, I'll fix it in the next RC.
This RC fails because:
1. it doesn't include a Scala 2.12 build
2. the gpg key issu
Totally agree with you Dale, that there are situations for efficiency,
performance and better control/visibility/manageability that we need to expose
partition management.
So as described, I suggested two things - the ability to do it in the current
V2 API form via options and appropriate imple
Sorry but I am -1 because of what was reported here:
https://issues.apache.org/jira/browse/SPARK-22036?focusedCommentId=16618104&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-16618104
.
It is a regression unfortunately. Despite the impact is not huge and there
are