[VOTE][RUST][Datafusion] Release Apache Arrow Datafusion 5.0.0 RC3

2021-08-10 Thread QP Hou
Hi, I would like to propose a release of Apache Arrow Datafusion Implementation, version 5.0.0. RC3 fixed a cargo publish issue discovered in RC1. This release candidate is based on commit: deb929369c9aaba728ae0c2c49dcd05bfecc8bf8 [1] The proposed release tarball and signatures are hosted at [2]

Re: [VOTE][RUST][Datafusion] Release Apache Arrow Datafusion 5.0.0 RC2

2021-08-10 Thread QP Hou
Looks like the commit is off, please ignore this vote while I prepare for a new one. On Tue, Aug 10, 2021 at 10:28 PM QP Hou wrote: > > Hi, > > I would like to propose a release of Apache Arrow Datafusion Implementation, > version 5.0.0. > > Compared to RC1, RC2 fixed a cargo publish issue for ba

[VOTE][RUST][Datafusion] Release Apache Arrow Datafusion 5.0.0 RC2

2021-08-10 Thread QP Hou
Hi, I would like to propose a release of Apache Arrow Datafusion Implementation, version 5.0.0. Compared to RC1, RC2 fixed a cargo publish issue for ballista crates. This release candidate is based on commit: 96658eb100436c47601ed10095d74299d2229020 [1] The proposed release tarball and signature

Re: [VOTE][RUST][Datafusion] Release Apache Arrow Datafusion 5.0.0 RC1

2021-08-10 Thread QP Hou
> 4. Verified the signatures using the commands below (not sure if the WARNING is something we should fix) I believe the warning is caused by my key not being signed by others in the network, I will get that fixed. Thank you Andy for the quick fix, I will send a voting thread for rc2 later tonigh

Re: [VOTE][RUST][Datafusion] Release Apache Arrow Datafusion 5.0.0 RC1

2021-08-10 Thread Andy Grove
Here is a PR to resolve this https://github.com/apache/arrow-datafusion/pull/852 On Tue, Aug 10, 2021 at 4:24 PM Andy Grove wrote: > Hi QP, > > This looks great overall but I did run into one issue. The Ballista crates > have relative paths for dependencies rather than depending on versioned >

Re: [VOTE][RUST][Datafusion] Release Apache Arrow Datafusion 5.0.0 RC1

2021-08-10 Thread Andy Grove
Hi QP, This looks great overall but I did run into one issue. The Ballista crates have relative paths for dependencies rather than depending on versioned crates, so we would not be able to publish them to crates.io *$ find ballista -name Cargo.toml -exec grep -H "path" {} \;* ballista/rust/schedu

Re: [VOTE][RUST][Datafusion] Release Apache Arrow Datafusion 5.0.0 RC1

2021-08-10 Thread Andrew Lamb
+1 (binding) I did the following 1. downloaded the tarball and ran `cargo test --all` on mac OS (x86_64) 2. Verified the versions in Cargo.toml files for datafusion, ballista and python crates (some of the unpublished crates like datafusion-cli still say 4.0.0-SNAPSHOT) 3. Verified the shasum usin

[VOTE][RUST][Datafusion] Release Apache Arrow Datafusion 5.0.0 RC1

2021-08-10 Thread QP Hou
Hi, I would like to propose a release of Apache Arrow Datafusion Implementation, version 5.0.0. This release candidate is based on commit: 96658eb100436c47601ed10095d74299d2229020 [1] The proposed release tarball and signatures are hosted at [2]. The changelog is located at [3]. Please download,

[WEBSITE] Hosting cookbook on arrow.apache.org

2021-08-10 Thread Weston Pace
After some discussion on [1] we are planning on publishing the Arrow cookbook at https://arrow.apache.org/cookbook . I figured I'd run this by the ML real quick before I started publishing. If you have any thoughts or comments feel free to add them here or on [1]. I'll also note that the ASF Inf

Re: [DISCUSS] Developing an "Arrow Compute IR [Intermediate Representation]" to decouple language front ends from Arrow-native compute engines

2021-08-10 Thread Wes McKinney
Thank you for all the feedback and comments on the document. I'm on vacation this week, so I'm delayed responding to everything, but I will get to it as quickly as I can. I will be at VLDB in Copenhagen next week if anyone would like to chat in person about it, and we can relay the content of any d

Re: [DISCUSS] Dropping support for Visual Studio 2015

2021-08-10 Thread Wes McKinney
+1 for dropping it also. On Mon, Aug 9, 2021 at 7:03 PM Keith Kraus wrote: > > +1 as well. Is there any build platforms that we're currently supporting > that still use vs2015? > > Conda-forge did its migration ~1.5 years ago: > https://github.com/conda-forge/conda-forge-pinning-feedstock/pull/50

Re: [DISCUSS] Binary Values in Key value pairs WAS: Re: [INFO_REQUEST][FLIGHT] - Dynamic schema changes in ArrowFlight streams

2021-08-10 Thread Wes McKinney
Ah, that's definitely a no-go then (I believe we verify messages unconditionally in C++). That's unfortunate (and I feel responsible for missing this, but I suppose we had a lot of opportunities to fix it prior to the 1.0.0 format version) — so to have actual binary values (which was the intention

Re: [DISCUSS] Developing an "Arrow Compute IR [Intermediate Representation]" to decouple language front ends from Arrow-native compute engines

2021-08-10 Thread Dimitri Vorona
Hi Wes, cool initiative! Reminded me of "Building Advanced SQL Analytics From Low-Level Plan Operators" from SIGMOD 2021 ( http://db.in.tum.de/~kohn/papers/lolepops-sigmod21.pdf) which proposes a set of building block for advanced aggregation. Cheers, Dimitri. On Thu, Aug 5, 2021 at 7:59 PM Juli

[RUST][DISCUSS] Timestamp Arithmetic in arrow-rs

2021-08-10 Thread Andrew Lamb
Hello, I wanted to point out a proposal[1] for how to handle timestamp arithmetic in the arrow-rs library. It involves the potential addition of new traits to the library, and thus I think additional feedback from the community on the approach or concerns would be valuable. Thank you, Andrew [1]