Re: [VOTE] Release Apache Arrow 17.0.0 - RC2

2024-07-15 Thread Sutou Kouhei
+1 (binding) I ran the followings on Debian GNU/Linux sid: * TEST_DEFAULT=0 \ TEST_SOURCE=1 \ LANG=C \ TZ=UTC \ CUDAToolkit_ROOT=/usr \ ARROW_CMAKE_OPTIONS="-DBoost_NO_BOOST_CMAKE=ON -Dxsimd_SOURCE=BUNDLED" \ dev/release/verify-release-candidate.sh 17.0.0 2

Re: [DISCUSS][C++] Empty directory marker creation in S3FileSystem

2024-07-15 Thread Aldrin
And to clarify, by "other clients" I mean "other remote clients on other systems concurrently accessing the same data." I still think that many cients on a single system could use a local filesystem to gate directory-based operations more efficiently (since a local filesystem is optimized for t

Re: [DISCUSS][C++] Empty directory marker creation in S3FileSystem

2024-07-15 Thread Aldrin
Thanks Antoine! Preserving the property across multiple clients (and presumably across independent sessions of the same client) is the part that I was missing. >From the link you shared, I saw an aws page discussing the use of folders in >the s3 console [1]. Their approach is to create the mark

Re: [DISCUSS][C++] Empty directory marker creation in S3FileSystem

2024-07-15 Thread Antoine Pitrou
No, because these markers also communicate the information to other implementations of S3 abstractions. An example of this is: https://docs.cyberduck.io/protocols/s3/#folders Regards Antoine. Le 13/07/2024 à 07:15, Aldrin a écrit : ...then I still expect the directory /foo to exist Rig

Re: [VOTE] Release Apache Arrow 17.0.0 - RC2

2024-07-15 Thread Raúl Cumplido
+1 (binding) I have tested both: TEST_DEFAULT=0 TEST_SOURCE=1 TEST_BINARIES=0 dev/release/verify-release-candidate.sh 17.0.0 2 and TEST_DEFAULT=0 TEST_SOURCE=0 TEST_BINARIES=1 dev/release/verify-release-candidate.sh 17.0.0 2 On: * Ubuntu 22.04 LTS * Python 3.10.12 * gcc (Ubuntu 11.4.0-1ubu

Re: [VOTE] Release Apache Arrow 17.0.0 - RC2

2024-07-15 Thread Neal Richardson
Adam, Yes, I saw that when it was introduced here, on a PR I did: https://github.com/apache/arrow/runs/25894658283 It's on the smallest scale factor and on two of the queries ( https://github.com/voltrondata-labs/arrowbench/blob/main/R/tpch-queries.R#L510-L535 and https://github.com/voltrondata-la

Re: [VOTE] Release Apache Arrow 17.0.0 - RC2

2024-07-15 Thread Adam Lippai
Do I read it correctly that there is tpch regression in the R benchmark? Best regards, Adam Lippai On Mon, Jul 15, 2024 at 06:05 Fokko Driesprong wrote: > Thanks to everyone who contributed to the new release! > > +1 (non-binding) > > I've tested against PyIceberg >

Re: [VOTE] Release Apache Arrow 17.0.0 - RC2

2024-07-15 Thread Fokko Driesprong
Thanks to everyone who contributed to the new release! +1 (non-binding) I've tested against PyIceberg . Kind regards, Fokko Op ma 15 jul 2024 om 10:29 schreef David Li : > +1 (binding) > > Tested on Debian 12/x86_64 > > On Mon, Jul 15, 2024, a

Re: [VOTE] Release Apache Arrow 17.0.0 - RC2

2024-07-15 Thread David Li
+1 (binding) Tested on Debian 12/x86_64 On Mon, Jul 15, 2024, at 15:31, Gang Wu wrote: > +1 (non-binding) > > Verified C++ on my M1 Mac by running: > - TEST_DEFAULT=0 TEST_CPP=1 ./verify-release-candidate.sh 17.0.0 2 > > BTW, I ran into this issue as well: > https://github.com/apache/arrow/issues