Re: [VOTE] Release Apache Arrow 8.0.0 - RC3

2022-05-04 Thread Bryan Cutler
+1 (non-binding) I ran: TEST_DEFAULT=0 TEST_INTEGRATION_CPP=1 TEST_INTEGRATION_JAVA=1 ARROW_GANDIVA=OFF ARROW_PLASMA=OFF dev/release/verify-release-candidate.sh 8.0.0 3 On Wed, May 4, 2022 at 3:23 PM Sutou Kouhei wrote: > +1 > > I ran the followings on Debian GNU/Linux sid: > > * TEST_DEFAULT

Re: [ANNOUNCE] New Arrow committer: Liang-Chi Hsieh

2022-05-04 Thread L. C. Hsieh
Thank you, Daniël! On Wed, May 4, 2022 at 10:02 PM Daniël Heres wrote: > > Congrats! > > Op do 5 mei 2022 om 06:48 schreef L. C. Hsieh : > > > Thank you, Jorge! > > > > On Fri, Apr 29, 2022 at 10:42 PM Jorge Cardoso Leitão > > wrote: > > > > > > Congratulations, great work! > > > > > > On Sat, A

Re: [ANNOUNCE] New Arrow committer: Liang-Chi Hsieh

2022-05-04 Thread Daniël Heres
Congrats! Op do 5 mei 2022 om 06:48 schreef L. C. Hsieh : > Thank you, Jorge! > > On Fri, Apr 29, 2022 at 10:42 PM Jorge Cardoso Leitão > wrote: > > > > Congratulations, great work! > > > > On Sat, Apr 30, 2022 at 3:30 AM L. C. Hsieh wrote: > > > > > Thanks all! > > > > > > On Fri, Apr 29, 2022

Re: [ANNOUNCE] New Arrow committer: Liang-Chi Hsieh

2022-05-04 Thread L. C. Hsieh
Thank you, Jorge! On Fri, Apr 29, 2022 at 10:42 PM Jorge Cardoso Leitão wrote: > > Congratulations, great work! > > On Sat, Apr 30, 2022 at 3:30 AM L. C. Hsieh wrote: > > > Thanks all! > > > > On Fri, Apr 29, 2022 at 7:19 PM Yijie Shen > > wrote: > > > > > > Congrats Liang-Chi! > > > > > > > >

Re: [Rust] Enable GitHub discussions for Rust projects?

2022-05-04 Thread Jacques Nadeau
No vote here but a little feedback. We've generally found Github Discussions somewhat lacking in Substrait. If other people find it good, great. I might be more inclined to just drive people to something like StackOverflow or the mailing list. We were initially quite enthusiastic but the experience

Re: RFC: Out of Process Python UDFs in Arrow Compute

2022-05-04 Thread Weston Pace
> However, if I > understand correctly, the UDF implemented in the PR above are still > "composition of existing C++ kernels" instead of > "arbitrary pandas/numpy" code, so it kind of resolves a > different problem IMO. That is not a correct understanding, though it is an understandable one as we

Re: [Rust] Enable GitHub discussions for Rust projects?

2022-05-04 Thread Andrew Lamb
I am also +1 on the idea of enabling discussions on github. I personally don't think this needs a formal vote of PMC members, but rather that Apache Infra wants to make sure the topic was discussed in the community prior to enabling the feature. Thank you Andy for bringing this up, Andrew On Wed

Re: [VOTE] Release Apache Arrow 8.0.0 - RC3

2022-05-04 Thread Sutou Kouhei
+1 I ran the followings on Debian GNU/Linux sid: * TEST_DEFAULT=0 \ TEST_SOURCE=1 \ LANG=C \ TZ=UTC \ ARROW_CMAKE_OPTIONS="-DBoost_NO_BOOST_CMAKE=ON -DCUDAToolkit_ROOT=/usr" \ dev/release/verify-release-candidate.sh 8.0.0 3 * TEST_DEFAULT=0 \ TEST_APT=1 \

Re: RFC: Out of Process Python UDFs in Arrow Compute

2022-05-04 Thread Li Jin
Weston - Yes I have seen the pull request above (very cool). However, if I understand correctly, the UDF implemented in the PR above are still "composition of existing C++ kernels" instead of "arbitrary pandas/numpy" code, so it kind of resolves a different problem IMO. For example, if I want to c

Re: RFC: Out of Process Python UDFs in Arrow Compute

2022-05-04 Thread Weston Pace
Hi Li, have you seen the python UDF prototype that we just recently merged into the execution engine at [1]? It adds support for scalar UDFs. Comparing your proposal to what we've done so far I would ask: 1. Why do you want to run these UDFs in a separate process? Is this for robustness (if th

Re: [Rust] Enable GitHub discussions for Rust projects?

2022-05-04 Thread Gavin Ray
How does voting on ASF mailing lists work? I assume random people don't get votes. If so, consider this email an informal voice of support -- otherwise +1 from me =) On Wed, May 4, 2022 at 11:40 AM Matthew Turner wrote: > +1 on enabling GitHub discussions for both arrow-rs and datafusion. I > t

RFC: Out of Process Python UDFs in Arrow Compute

2022-05-04 Thread Li Jin
Hello, I have a somewhat controversial idea to introduce a "bridge" solution for Python UDFs in Arrow Compute and have write up my thoughts in this proposal: https://docs.google.com/document/d/1s7Gchq_LoNuiZO5bHq9PZx9RdoCWSavuS58KrTYXVMU/edit?usp=sharing I am curious to hear what the community t

Re: [VOTE] Release Apache Arrow 8.0.0 - RC3

2022-05-04 Thread Jacob Wujciak
+ 1 (non-binding) Verified on Windows 10 Verified source, integration and binaries on Ubuntu 20.04 with dev/release/verify-release-candidate.sh 8.0.0 3 On Tue, May 3, 2022 at 10:08 PM Krisztián Szűcs wrote: > Hi, > > I would like to propose the following release candidate (RC3) of Apache > Arro

Re: [DISC] (Java) Add Windows binaries to Maven packages

2022-05-04 Thread Larry White
I updated the link and opened Jira tickets for the work On Wed, May 4, 2022 at 11:21 AM Alessandro Molina < alessan...@ursacomputing.com> wrote: > The proposal seems reasonable to me, we should do our best at providing > users the same experience on the various systems whenever possible. > > As l

RE: [Rust] Enable GitHub discussions for Rust projects?

2022-05-04 Thread Matthew Turner
+1 on enabling GitHub discussions for both arrow-rs and datafusion. I think there is a lot of value in distinguishing actual "issues" with questions / conversations. I believe this would also complement the datafusion site which doesn't have any type of forum for conversations. -Original

Re: [DISC] (Java) Add Windows binaries to Maven packages

2022-05-04 Thread David Li
Netty (one of the Arrow dependencies) already has per-OS JARs (though those deps are optional), I would also be slightly in favor of doing this so long as the way to use them is well documented. Netty also splits its native code among different dependencies since there is some common code, is t

[Rust] Enable GitHub discussions for Rust projects?

2022-05-04 Thread Andy Grove
We have a request [1] to enable GitHub discussions for DataFusion. Personally, I am in favor of doing this for DataFusion as well as arrow-rs. We need to file an infra ticket to get this enabled and have to provide a link to "consensus discussion thread" [2] so I would like to gather opinions here

Re: [DISC] (Java) Add Windows binaries to Maven packages

2022-05-04 Thread Antoine Pitrou
Le 04/05/2022 à 17:21, Alessandro Molina a écrit : The proposal seems reasonable to me, we should do our best at providing users the same experience on the various systems whenever possible. As long as we don't receive complaints about the package size, I think we can live with it. If it becom

Re: [DISC] (Java) Add Windows binaries to Maven packages

2022-05-04 Thread Alessandro Molina
The proposal seems reasonable to me, we should do our best at providing users the same experience on the various systems whenever possible. As long as we don't receive complaints about the package size, I think we can live with it. If it becomes a problem for our users, we can always make per-syst

Re: [Rust] DataFusion 8.0.0 release

2022-05-04 Thread Andy Grove
Two issues came up during prep work for this release that I would appreciate some feedback on: - There is a circular dependency between different versions of the datafusion crates [1] - It looks like we have some inconsistencies in the way crates are organized in the repo [2] Thanks, Andy. [1]

RE: [VOTE] Release Apache Arrow 8.0.0 - RC3

2022-05-04 Thread Yibo Cai
+1. Verified cpp/python/go source and apt binaries on ubuntu20.04, aarch64. TEST_DEFAULT=0 TEST_CPP=1 TEST_PYTHON=1 TEST_GO=1 dev/release/verify-release-candidate.sh 8.0.0 3 TEST_DEFAULT=0 TEST_APT=1 dev/release/verify-release-candidate.sh 8.0.0 3 -Original Message- From: Krisztián Szűc