Re: Spark 3.0 preview release on-going features discussion

2019-09-23 Thread Xingbo Jiang
Thanks everyone, let me first work on the feature list and major changes that have already been finished in the master branch. Cheers! Xingbo Ryan Blue 于2019年9月20日周五 上午10:56写道: > I’m not sure that DSv2 list is accurate. We discussed this in the DSv2 > sync this week (just sent out the notes) a

Re: Spark 3.0 preview release on-going features discussion

2019-09-20 Thread Ryan Blue
I’m not sure that DSv2 list is accurate. We discussed this in the DSv2 sync this week (just sent out the notes) and came up with these items: - Finish TableProvider update to avoid another API change: pass all table config from metastore - Catalog behavior fix: https://issues.apache.org/j

Re: Spark 3.0 preview release on-going features discussion

2019-09-20 Thread Dongjoon Hyun
Thank you for the summarization, Xingbo. I also agree with Sean because I don't think those block 3.0.0 preview release. Especially, correctness issues should not be there. Instead, could you summarize what we have as of now for 3.0.0 preview? I believe JDK11 (SPARK-28684) and Hive 2.3.5 (SPARK-

Re: Spark 3.0 preview release on-going features discussion

2019-09-20 Thread Sean Owen
Is this a list of items that might be focused on for the final 3.0 release? At least, Scala 2.13 support shouldn't be on that list. The others look plausible, or are already done, but there are probably more. As for the 3.0 preview, I wouldn't necessarily block on any particular feature, though, y

Re: Spark 3.0 preview release on-going features discussion

2019-09-20 Thread Wenchen Fan
> New pushdown API for DataSourceV2 One correction: I want to revisit the pushdown API to make sure it works for dynamic partition pruning and can be extended to support limit/aggregate/... pushdown in the future. It should be a small API update instead of a new API. On Fri, Sep 20, 2019 at 3:46