Re: My curation of pending structured streaming PRs to review

2019-08-18 Thread Jungtaek Lim
As a reminder, the list contains two correctness bugs: stream-stream outer join, and multiple stateful operations with watermark. Regarding common theme, yes that's somewhat I'd rather avoid to say, but honestly I feel there's shortage on active committers on 'structured streaming'. Many of them

Re: Release Spark 2.3.4

2019-08-18 Thread Saisai Shao
+1 Wenchen Fan 于2019年8月19日周一 上午10:28写道: > +1 > > On Sat, Aug 17, 2019 at 3:37 PM Hyukjin Kwon wrote: > >> +1 too >> >> 2019년 8월 17일 (토) 오후 3:06, Dilip Biswal 님이 작성: >> >>> +1 >>> >>> Regards, >>> Dilip Biswal >>> Tel: 408-463-4980 >>> dbis...@us.ibm.com >>> >>> >>> >>> - Original message --

Re: Release Spark 2.3.4

2019-08-18 Thread Wenchen Fan
+1 On Sat, Aug 17, 2019 at 3:37 PM Hyukjin Kwon wrote: > +1 too > > 2019년 8월 17일 (토) 오후 3:06, Dilip Biswal 님이 작성: > >> +1 >> >> Regards, >> Dilip Biswal >> Tel: 408-463-4980 >> dbis...@us.ibm.com >> >> >> >> - Original message - >> From: John Zhuge >> To: Xiao Li >> Cc: Takeshi Yamamur

RE: Release Spark 2.3.4

2019-08-18 Thread Kazuaki Ishizaki
Hi all, Thank you. I will prepare RC for 2.3.4 this week in parallel. It will be in parallel with RC for 2.4.4 managed by Dongjoon. Regards, Kazuaki Ishizaki From: Hyukjin Kwon To: Dilip Biswal Cc: jzh...@apache.org, dev , Kazuaki Ishizaki , Takeshi Yamamuro , Xiao Li Date: 20

Aggregate pushdown for data source

2019-08-18 Thread Arun Khetarpal
Hi Folks: I have implemented a data source v2 API for an internal source. As a consequence of generating the data source, we have bunch of statistical information about the source which i can potentially use, only if spark pushes down the aggregates down to the data source itself. I see that ther