hi Andy -- yes. I'll send out the vote thread shortly
On Tue, Jan 22, 2019 at 8:27 AM Andy Grove <andygrov...@gmail.com> wrote: > > Wes, > > With the 0.12 release out, could we now start the vote for the DataFusion > donation? > > Thanks, > > Andy. > > On Tue, Jan 15, 2019 at 8:16 AM Andy Grove <andygrov...@gmail.com> wrote: > > > Wes, > > > > I went ahead and created a JIRA ( > > https://issues.apache.org/jira/browse/ARROW-4263) and PR ( > > https://github.com/apache/arrow/pull/3399) for the donation so it is > > ready to go if the vote passes. > > > > Thanks, > > > > Andy. > > > > On Mon, Jan 14, 2019 at 1:57 PM Andy Grove <andygrov...@gmail.com> wrote: > > > >> Wes, > >> > >> Thanks. Yes, I'd like to proceed with the vote as soon as you are ready. > >> > >> I don't think I need much time at all at this point to prepare the merge. > >> I already have a branch of DataFusion that is building against the latest > >> Arrow code, so it's really just a case of updating source files with the > >> correct license headers and updating the README. I will start on this > >> tonight. > >> > >> Thanks, > >> > >> Andy. > >> > >> > >> > >> On Mon, Jan 14, 2019 at 1:16 PM Wes McKinney <wesmck...@gmail.com> wrote: > >> > >>> Getting the 0.12 release out is my priority right now, but it seems > >>> that there are no major objections to this code donation. > >>> > >>> @Andy -- I can kick off the vote to accept the code donation in the > >>> next few days if you'd like to proceed with that. How much time do you > >>> think it would take for you to ready the merge? > >>> > >>> Thanks, > >>> Wes > >>> > >>> On Wed, Jan 9, 2019 at 8:28 AM Andy Grove <andygrov...@gmail.com> wrote: > >>> > > >>> > Wes, > >>> > > >>> > Thanks. This sounds great. > >>> > > >>> > Andy. > >>> > > >>> > On Tue, Jan 8, 2019 at 8:28 AM Wes McKinney <wesmck...@gmail.com> > >>> wrote: > >>> > > >>> > > hi Andy -- I'm supportive of the code donation. I see building > >>> > > in-memory, embeddable analytics and query processing as the natural > >>> > > next stage of this project. As I have described on this mailing list, > >>> > > I intend to work on this with my colleagues in C++ with the goal of > >>> > > making such functionality available at least in C, Python, R, and > >>> > > Ruby. I see no reason why such work should be exclusive to C++. > >>> > > > >>> > > Rust seems like a reasonable implementation language for this, and > >>> > > given growing interest in the language, I think it will help grow the > >>> > > Arrow community. > >>> > > > >>> > > I'd like to wait a few more days to allow others to weigh in, but we > >>> > > could conduct a vote about accepting the code donation as early as > >>> > > next week. We would need to go through the ASF IP Clearance process > >>> > > after that. So the entire procedural process would take about 6 days, > >>> > > assuming that there are no licensing issues and the code will be > >>> ready > >>> > > to merge into the Arrow codebase. > >>> > > > >>> > > Thanks > >>> > > Wes > >>> > > > >>> > > On Tue, Jan 8, 2019 at 9:07 AM Neville Dipale <nevilled...@gmail.com > >>> > > >>> > > wrote: > >>> > > > > >>> > > > Hi Andy, > >>> > > > > >>> > > > I can't comment on the voting process, but regarding the addition > >>> of > >>> > > > DataFusion: > >>> > > > > >>> > > > I support the idea to donate the code, mainly as I think that will > >>> help > >>> > > us > >>> > > > accelerate some work on Rust. Out of curiousity, I've been > >>> prototying a > >>> > > > 'Rust dataframe' abstraction which (can/will) have various scalar, > >>> > > > aggregation, array and window functions. > >>> > > > > >>> > > > I'm doing this trying to put on the hat of someone wanting to use > >>> Rust in > >>> > > > their binary or library. I'm already finding some things that > >>> might be > >>> > > > *core* but are still not yet implemented. The presence of > >>> array_ops is > >>> > > also > >>> > > > helpful because in addition to an efficient in-memory rep of data, > >>> they > >>> > > > enable one to do some basic data manipulation on such data. > >>> > > > > >>> > > > Having DataFusion added to Arrow could help fill some gaps in our > >>> > > codebase; > >>> > > > and I'm willing to work there. > >>> > > > > >>> > > > Regards > >>> > > > Neville > >>> > > > > >>> > > > On Tue, 8 Jan 2019 at 16:14, Andy Grove <andygrov...@gmail.com> > >>> wrote: > >>> > > > > >>> > > > > Bumping this thread ... I know everyone is busy with getting the > >>> 0.12 > >>> > > > > release out, but would be good to know the process for raising > >>> this > >>> > > for a > >>> > > > > vote. However, given the lack of comments on this thread I'm > >>> starting > >>> > > to > >>> > > > > suspect that maybe there isn't much of an appetite for this, > >>> which is > >>> > > fine, > >>> > > > > but would be good to find out for sure. > >>> > > > > > >>> > > > > Thanks, > >>> > > > > > >>> > > > > Andy. > >>> > > > > > >>> > > > > On Mon, Jan 7, 2019 at 1:03 PM Andy Grove <andygrov...@gmail.com > >>> > > >>> > > wrote: > >>> > > > > > >>> > > > > > Thanks, Ted! > >>> > > > > > > >>> > > > > > I wish I'd been a bit more specific about my ask in the > >>> original > >>> > > email... > >>> > > > > > I guess my question (for Wes?) is what is the process to raise > >>> this > >>> > > for a > >>> > > > > > vote? > >>> > > > > > > >>> > > > > > Andy. > >>> > > > > > > >>> > > > > > > >>> > > > > > > >>> > > > > > On Sun, Jan 6, 2019 at 2:59 PM Ted Dunning < > >>> ted.dunn...@gmail.com> > >>> > > > > wrote: > >>> > > > > > > >>> > > > > >> Cool! > >>> > > > > >> > >>> > > > > >> > >>> > > > > >> > >>> > > > > >> On Sun, Jan 6, 2019 at 1:52 PM Andy Grove < > >>> andygrov...@gmail.com> > >>> > > > > wrote: > >>> > > > > >> > >>> > > > > >> > I'm starting a new thread for this discussion (this was > >>> previously > >>> > > > > >> > discussed in the Rust Roadmap thread). > >>> > > > > >> > > >>> > > > > >> > The reason I got involved with Arrow is that I have been > >>> working > >>> > > on > >>> > > > > >> > DataFusion[1] which is currently an in-process SQL query > >>> engine > >>> > > on top > >>> > > > > >> of > >>> > > > > >> > Arrow. It allows queries to be executed against the Arrow > >>> CSV > >>> > > reader > >>> > > > > >> (and > >>> > > > > >> > will shortly support the Arrow Parquet reader too) and > >>> presents > >>> > > > > results > >>> > > > > >> as > >>> > > > > >> > a sequence of RecordBatch instances. > >>> > > > > >> > > >>> > > > > >> > I would like to donate this code to the Arrow project so > >>> that > >>> > > Arrow > >>> > > > > has > >>> > > > > >> a > >>> > > > > >> > Rust-native query execution engine built in and to > >>> accelerate > >>> > > > > >> development > >>> > > > > >> > of this capability. > >>> > > > > >> > > >>> > > > > >> > I have a fairly detailed roadmap[2] in mind for the project > >>> and it > >>> > > > > could > >>> > > > > >> > eventually become a standalone project potentially (under > >>> ASF > >>> > > still). > >>> > > > > >> > > >>> > > > > >> > I don't know what the process is to vote on this, so wanted > >>> to > >>> > > discuss > >>> > > > > >> that > >>> > > > > >> > in this thread first. > >>> > > > > >> > > >>> > > > > >> > References: > >>> > > > > >> > > >>> > > > > >> > [1] DataFusion: https://github.com/andygrove/datafusion > >>> > > > > >> > [2] Roadmap: > >>> > > > > >> > > >>> https://github.com/andygrove/datafusion/blob/master/ROADMAP.md > >>> > > > > >> > > >>> > > > > >> > Thanks, > >>> > > > > >> > > >>> > > > > >> > Andy. > >>> > > > > >> > > >>> > > > > >> > >>> > > > > > > >>> > > > > > >>> > > > >>> > >>