Re: [DISCUSS] Community maintained extension repos for Datafusion

2021-11-18 Thread QP Hou
Thanks Wes for the confirmation. Yes, we only intend to keep extensions that won't get merged back to datafusion core in the contrib repo. Any code that we intend to go into the core will definitely still be developed within the ASF GH org. On Wed, Nov 17, 2021 at 3:29 PM Wes McKinney wrote: > >

Re: [DISCUSS] Community maintained extension repos for Datafusion

2021-11-17 Thread Wes McKinney
Having a "community" contrib GitHub org outside of Apache sounds fine. If we want to move any packages into the Apache governance structure then we can conduct an IP clearance at that point. Since the term "DataFusion" doesn't have ASF trademark issues like "Arrow" does, we don't need to be as care

Re: [DISCUSS] Community maintained extension repos for Datafusion

2021-11-15 Thread Andrew Lamb
Thank you QP Andrew On Sun, Nov 14, 2021 at 5:02 PM QP Hou wrote: > Thanks Jiayu, Benson, Micah and Andrew for your input on this. I have > created an unofficial Github org [1] as a quick and dirty experiment > for something like spark-packages.org. We should make it clear that > code developed

Re: [DISCUSS] Community maintained extension repos for Datafusion

2021-11-14 Thread QP Hou
Thanks Jiayu, Benson, Micah and Andrew for your input on this. I have created an unofficial Github org [1] as a quick and dirty experiment for something like spark-packages.org. We should make it clear that code developed in this org will still need to go through the donation process in order to ge

Re: [DISCUSS] Community maintained extension repos for Datafusion

2021-11-08 Thread Andrew Lamb
I think a separate non-ASF organization, with a central list of extensions like spark-packages.org sounds like a good idea to me. On Sun, Nov 7, 2021 at 1:34 PM Micah Kornfield wrote: > I'll preface this with not being an expert on these matters but this is my > impression. > > > > Therefore, I

Re: [DISCUSS] Community maintained extension repos for Datafusion

2021-11-07 Thread Micah Kornfield
I'll preface this with not being an expert on these matters but this is my impression. > Therefore, I am proposing that we create an unofficial shared Github > organization to host these Datafusion contrib type projects that are > only maintained by non-PMC community members. I think as long as

Re: [DISCUSS] Community maintained extension repos for Datafusion

2021-11-07 Thread Benson Muite
A community owned GitHub organization would be helpful. Maybe for all other Arrow related projects not just Datafusion. This would make them easier to find, and for community members to contribute. It could also include a listing of relevant projects elsewhere. On 11/7/21 9:40 AM, Jiayu Liu wr

RE: [DISCUSS] Community maintained extension repos for Datafusion

2021-11-06 Thread Jiayu Liu
FWIW if there's a way to contribute code pertaining to datafusion I can contribute my version of Java bindings to it. IMO having a central place (instead of linking) for all bindings, 3rd libraries, etc. for datafusion would mean more synergy across different languages but I won't go as far as a m

[DISCUSS] Community maintained extension repos for Datafusion

2021-11-06 Thread QP Hou
Hi all, I would like to propose a new and more community friendly governance model for community contributed and maintained extensions for the datafusion project. Over the last year, many datafusion extensions have been proposed and created by the community including the java binding, s3 and hdfs