Re: Thoughts on dataframe cogroup?

2019-04-09 Thread Chris Martin
Thanks Bryan and Li, that is much appreciated. Hopefully should have the SPIP ready in the next couple of days. thanks, Chris On Mon, Apr 8, 2019 at 7:18 PM Bryan Cutler wrote: > Chirs, an SPIP sounds good to me. I agree with Li that it wouldn't be too > difficult to extend the currently f

Re: Thoughts on dataframe cogroup?

2019-04-15 Thread Chris Martin
). >> >> Li >> >> On Mon, Apr 15, 2019 at 8:20 AM wrote: >> >>> Hi, >>> >>> As promised I’ve raised SPARK-27463 for this. >>> >>> All feedback welcome! >>> >>> Chris >>> >>

Re: Thoughts on dataframe cogroup?

2019-04-15 Thread Chris Martin
> Li > > On Mon, Apr 15, 2019 at 3:58 PM Chris Martin > wrote: > >> I've updated the jira so that the main body is now inside a google doc. >> Anyone should be able to comment- if you want/need write access please drop >> me a mail and I can add you. >&

Re: Thoughts on dataframe cogroup?

2019-04-18 Thread Chris Martin
Also, this isn't really something new, RDD has cogroup function from very > early on. > > With that being said, I'd like to call out again for community's feedback > on the proposal. > > On Mon, Apr 15, 2019 at 4:57 PM Chris Martin > wrote: > >> Ah sorry- I

Hive Bucketing Support

2018-06-06 Thread Chris Martin
Hi All, first off apologies if this is not the correct place to ask this! I've been following SPARK-19256 (Hive Bucketing Support) with interest for some time now as we do a relatively large amount of our data processing in Spark but use Hive f