Re: Help regarding setting up the r package in arrow apache

2023-10-16 Thread Benson Muite
Dpending on context, this message maybe for the users list: https://arrow.apache.org/community/ Consider examining the CI files: https://github.com/apache/arrow/blob/main/.github/workflows/r.yml An alternative to docker is Guile: https://packages.guix.gnu.org/packages/r-arrow/13.0.0.1/ On 10/16/

Re: [DISCUSS] Blog on improved DataFusion Grouping in 28.0.0

2023-08-05 Thread Benson Muite
On 8/5/23 21:51, Andrew Lamb wrote: > I propose posting a blog article by myself, Daniël Heres, and Raphael > Tustvold-Davies wrote about datafusion grouping performance [1] > > This content was originally published on the InfluxData blog[2] but we > would like to repost it on the Arrow site blog

Re: Apache Arrow | Graph Algorithms & Data Structures

2023-06-29 Thread Benson Muite
On 6/30/23 04:21, Bechir Ben Daadouch wrote: > Dear Apache Arrow Dev Community, > > My name is Bechir, I am currently working on a project that involves > implementing graph algorithms in Apache Arrow. > > The initial plan was to construct a node structure and a subsequent graph > that would enco

Re: [DISCUSS] Migrate s390x from Travis to ASF Jenkins

2023-04-21 Thread Benson Muite
al email. > > Thanks! I thought I added the Go one too :) > >>>> On Thu, Apr 20, 2023 at 2:42 PM Benson Muite >>>> wrote: >>>> >>>>> Might also consider testing farm for Centos Stream, Fedora and/or RHEL >>>>> builds[1][2].

Re: [DISCUSS] Migrate s390x from Travis to ASF Jenkins

2023-04-20 Thread Benson Muite
Might also consider testing farm for Centos Stream, Fedora and/or RHEL builds[1][2]. 1) https://docs.testing-farm.io/general/0.1/test-environment.html 2) https://fedoramagazine.org/test-github-projects-with-github-actions-and-testing-farm/ On 4/20/23 19:43, Antoine Pitrou wrote: > > Hi Raul, >

Re: New Pandas-Apache repo

2023-01-22 Thread Benson Muite
On 1/22/23 13:15, Adesola Adedewe wrote: > i'm working on a project where big financial data needs to be loaded stored > and manipulated. the data is stored as parquet. my initial version had > arrow just load the parquet data and i used the basic unorderedmap but this > limited me to only one data

Re: New Pandas-Apache repo

2023-01-22 Thread Benson Muite
On 1/22/23 11:41, Adesola Adedewe wrote: > The project was initially meant to provide a simpler interface over arrow > apache so pretty much what was done with the python api, but it has > evolved to be more than that ,with indexing and other panda operations > implemented like reindex, resample, c

Re: New Pandas-Apache repo

2023-01-22 Thread Benson Muite
On 1/22/23 06:23, Adesola Adedewe wrote: > okay thanks for your consideration. > > On Sat, Jan 21, 2023 at 4:49 PM Sutou Kouhei wrote: > >> Hi, >> >> I'm not sure pandas like API is suitable for our official >> data frame API. >> >> FYI: >> >> * GitHub issue of this: >> https://github.com/ap

Re: [VOTE] Release Apache Arrow ADBC 0.1.0 - RC6

2023-01-09 Thread Benson Muite
Non-binding +1 Fedora Rawhide, aarch64: TEST_APT=0 TEST_YUM=0 ./dev/release/verify-release-candidate.sh 0.1.0 6 Using go installed by the script. Script hung when using go in repositories. Ruby gems can be installed using gem install --user-local red-arrow which does not require sudo permissions,

Re: Arrow sync call January 4 at 12:00 US/Eastern, 17:00 UTC

2023-01-06 Thread Benson Muite
On 1/7/23 05:54, Ian Cook wrote: >> If a Google Doc is used, can it be configured to send out notifications of > the summary to the list? > > Not as far as I know, but I think we can continue to send a copy of the > notes to the mailing list after each biweekly meeting, copied and pasted > from th

Re: Arrow sync call January 4 at 12:00 US/Eastern, 17:00 UTC

2023-01-06 Thread Benson Muite
> Proposal to move sync call meeting notes into a Google Doc > > - Will proposed that we share notes from sync calls in a publicly > viewable Google Doc instead of in emails to the mailing list [2] > - There was a discussion about whether managing edit access to this > Google Doc would be diffic

Re: [ANNOUNCE] New Arrow PMC chair: Andrew Lamb

2022-12-26 Thread Benson Muite
Congratulations! On 12/27/22 05:44, Yibo Cai wrote: Congratulations! -Original Message- From: Rok Mihevc Sent: Tuesday, December 27, 2022 7:57 AM To: dev@arrow.apache.org Subject: Re: [ANNOUNCE] New Arrow PMC chair: Andrew Lamb Congratulations Andrew! Rok On Mon, Dec 26, 2022 at 11:2

Re: Current state of using GitHub issues for Arrow

2022-12-08 Thread Benson Muite
bsite issues to a different repo than documentation issues. Regards Antoine. Le 08/12/2022 à 11:50, Joris Van den Bossche a écrit : On Tue, 6 Dec 2022 at 08:41, Benson Muite wrote: For sure the exact workflows will still be further refined while starting to use this. And if there are t

Re: Current state of using GitHub issues for Arrow

2022-12-05 Thread Benson Muite
For sure the exact workflows will still be further refined while starting to use this. And if there are things missing or unclear in the current practices around how to handle GitHub issues or any other feedback or ideas, this thread is yours! Maybe helpful to also update website bot: https://

Re: [ANNOUNCE] New Arrow committer: Raúl Cumplido

2022-12-05 Thread Benson Muite
On 12/6/22 05:53, Sutou Kouhei wrote: On behalf of the Arrow PMC, I'm happy to announce that Raúl Cumplido has accepted an invitation to become a committer on Apache Arrow. Welcome, and thank you for your contributions! Congratulations Raúl

Re: [DISCUSS] Maintenance policy

2022-11-23 Thread Benson Muite
On 10/19/22 20:47, Will Jones wrote: One particular type of defect we might want to consider backporting to supported versions are ones that silently produce incorrect data. Unlike ones that cause a crash, it's not easy for a user to know they are affected. Here are a few examples: * ARROW-17

Re: [ANNOUNCE] New Arrow committer: Curtis Vogt

2022-11-03 Thread Benson Muite
Congratulations On 11/4/22 01:29, Vibhatha Abeykoon wrote: Congratulations On Thu, Nov 3, 2022 at 7:09 PM Rok Mihevc wrote: Congratulations! On Thu, Nov 3, 2022 at 12:31 AM David Li wrote: Welcome, Curtis! On Tue, Nov 1, 2022, at 17:14, Sutou Kouhei wrote: On behalf of the Arrow PMC, I'

Re: [ANNOUNCE] New Arrow committer: Yang Jiang

2022-11-03 Thread Benson Muite
Congratulations! On 11/3/22 21:09, Percy Camilo Triveño Aucahuasi wrote: Congratulations! On Thu, Nov 3, 2022 at 8:39 AM Rok Mihevc wrote: Congrats! On Thu, Nov 3, 2022 at 2:27 PM Weston Pace wrote: Congratulations On Thu, Nov 3, 2022, 6:25 AM Patrick Horan wrote: Congrats Jiang! On

Re: Using Arrow on RHEL/CentOS/Rocky and related linux distros

2022-11-02 Thread Benson Muite
On 11/2/22 10:32, Sutou Kouhei wrote: Hi, As an example Arrow is packaged in Fedora/EPEL. The spec file does not bundle Abseil, thrift, gRPC, https://src.fedoraproject.org/rpms/libarrow/blob/rawhide/f/libarrow.spec Because Fedora ships recent Abseil, Thrift and gRPC. It doesn't use software c

Re: Using Arrow on RHEL/CentOS/Rocky and related linux distros

2022-10-31 Thread Benson Muite
On 10/31/22 00:14, Sutou Kouhei wrote: Hi, Thanks for the suggestion. But what do we need to do for it? For example, our RPMs for AlmaLinux 9 bundle the following libraries: https://github.com/ursacomputing/crossbow/actions/runs/3354778483/jobs/5558561346#step:6:463 * Protocol Buffers * jemal

Using Arrow on RHEL/CentOS/Rocky and related linux distros

2022-10-30 Thread Benson Muite
Arrow releases are distributed as an RPM package for these distributions. However, many dependencies are bundled with the released RPMs, which may make using them in other software problematic. Software collections[1] are similar to Python virtual envs for RPM based distributions. They would

Re: [VOTE][Julia] Release Apache Arrow Julia 2.4.0 RC1

2022-10-26 Thread Benson Muite
+1 non binding, Rocky linux 9 $ dev/release/verify_rc.sh 2.4.0 1 and also Cent OS 7 (Signature verification fails due to older libraries) $ VERIFY_SIGN=0 dev/release/verify_rc.sh 2.4.0 1

Re: [ANNOUNCE] New Arrow PMC member: Nicola Crane

2022-10-25 Thread Benson Muite
Congratulations Nic! On 10/26/22 04:11, Vibhatha Abeykoon wrote: Congrats Nic! On Wed, Oct 26, 2022 at 5:30 AM Ashish wrote: Congrats ! On Wednesday, October 26, 2022, Anja wrote: Congrats!! On Tue, 25 Oct 2022 at 15:45, Rok Mihevc wrote: Congrats Nic! Rok On Tue, Oct 25, 2022 at 11

Re: [ANNOUNCE] New Arrow committer: Bogumił Kamiński

2022-10-25 Thread Benson Muite
Congratulations Bogumił! On 10/26/22 04:13, Vibhatha Abeykoon wrote: Congrats Bogumił! On Wed, Oct 26, 2022 at 4:24 AM Rok Mihevc wrote: Congrats Bogumił! Rok On Tue, Oct 25, 2022 at 11:15 PM David Li wrote: Welcome Bogumił! On Tue, Oct 25, 2022, at 17:05, Sutou Kouhei wrote: Hi, On b

Re: [ANNOUNCE] New Arrow PMC member: Jacob Quinn

2022-10-25 Thread Benson Muite
Congratulations Jacob! On 10/26/22 04:12, Vibhatha Abeykoon wrote: Congratulations Jacob! On Wed, Oct 26, 2022 at 4:23 AM Rok Mihevc wrote: Congratulations Jacob! Rok On Tue, Oct 25, 2022 at 11:15 PM David Li wrote: Congrats Jacob!! On Tue, Oct 25, 2022, at 17:06, Sutou Kouhei wrote: T

Re: [DISCUSS] Move issue tracking to

2022-10-23 Thread Benson Muite
It is unclear why the infrastructure team cannot allow a variety of authentication mechanisms - Gitee for example enables SMS authentication and validation by any validated Gitee user to obtain basic functionality. My expectation is that any committer or validated contributor (not just PMC) ca

Re: [VOTE] Release Apache Arrow 10.0.0 - RC0

2022-10-23 Thread Benson Muite
] TestFlightSqlServer.TestCommandGetExportedKeys (5 ms) /root/apache-arrow-10.0.0/cpp/src/arrow/flight/sql/server_test.cc:708: Failure Failed '_error_or_value113.status()' failed with Invalid: Can't prepare statement: near "(": syntax error. gRPC client deb

Re: [VOTE] Release Apache Arrow 10.0.0 - RC0

2022-10-23 Thread Benson Muite
WIP but source verification fails for me on CentOS 7 due to unsigned key from Neville Dipale: TEST_DEFAULT=0 TEST_SOURCE=1 dev/release/verify-release-candidate.sh 10.0.0 0 gpg: key 717D3FB2: no valid user IDs gpg: this may be caused by a missing self-signature ... gpg: Total number proces

Re: [VOTE][RUST] Release Apache Arrow Rust 22.0.0 RC1

2022-09-03 Thread Benson Muite
This probably now binding. Congratulations. On 9/3/22 03:08, L. C. Hsieh wrote: +1 (non-binding) Verified on Intel Mac. Thanks, Andrew. On Fri, Sep 2, 2022 at 3:45 PM Andy Grove wrote: +1 (binding) Verified on Ubuntu 20.04.4 LTS Thanks, Andrew. On Fri, Sep 2, 2022 at 12:25 PM Ian Joiner

Re: Proposal: Unassign idle issues

2022-07-08 Thread Benson Muite
On 7/8/22 18:49, Todd Farmer wrote: Hello, The backlog of ARROW issues currently stands at 2585 open issues [1]. The size of the backlog presents challenges to users and developers alike, and I believe the project would benefit from establishing guidance around issue handling. I'll be submitting

Re: [ANNOUNCE] New Arrow committers: Dewey Dunnington, Alenka Frim, and Rok Mihevc

2022-06-22 Thread Benson Muite
Well deserved! Congratulations! On 6/22/22 21:02, Andrew Lamb wrote: Congratulations! On Wed, Jun 22, 2022 at 1:27 PM Dragoș Moldovan-Grünfeld < dragos.m...@gmail.com> wrote: Congratulations! Sent from my iPhone On 22 Jun 2022, at 18:13, Neal Richardson wrote: On behalf of the Arrow PM

Re: Arrow sync call April 27 at 12:00 US/Eastern, 16:00 UTC

2022-04-27 Thread Benson Muite
Attendees: Ian Joiner Matthew Topol Benson Muite Discussion points: 1) New book on Arrow - covers C++, Python and Go, out in June 2) Building ORC bindings in R would be useful, extensions to parallel R? 3) Comparing ORC and Parquet for IO 4) IO optimization vs SIMD optimization - Parquet seems

Re: Arrow sync call April 13 at 12:00 US/Eastern, 16:00 UTC

2022-04-27 Thread Benson Muite
On 4/25/22 2:49 PM, David Li wrote: Following up here: N.B. The Voltron Data folks have a scheduling conflict on 4/27 and will not be able to host the fortnightly sync call. Is anyone available to run the meeting that day? Is anyone available to run the sync call this Wednesday? On Wed, Ap

Re: Arrow sync call April 27 at 12:00 US/Eastern, 16:00 UTC

2022-04-27 Thread Benson Muite
Hi, Can host if required, though the timing is not ideal for me. It may be helpful to vary the timing in future. Benson On 4/25/22 2:49 PM, David Li wrote: Following up here: N.B. The Voltron Data folks have a scheduling conflict on 4/27 and will not be able to host the fortnightly sync c

Re: Perf/Benchmark for temporal operations

2022-04-17 Thread Benson Muite
On 4/13/22 7:58 PM, Rok Mihevc wrote: Thanks for describing the use case Li! The examples we ran are on UTC timestamp without any timezone complications, perhaps there is room for short circuits when there are no timezone complications... I think using UTC zoned timestamp array might currentl

Re: [ANNOUNCE] New Arrow committers: Raphael Taylor-Davies, Wang Xudong, Yijie Shen, and Kun Liu

2022-03-09 Thread Benson Muite
Congratulations! On 3/9/22 9:56 PM, David Li wrote: Congrats everyone! On Wed, Mar 9, 2022, at 13:47, Rok Mihevc wrote: Congrats all! Rok On Wed, Mar 9, 2022 at 7:16 PM QP Hou wrote: Congratulations to all, well deserved! On Wed, Mar 9, 2022 at 9:37 AM Daniël Heres wrote: Congratulati

Re: [DISCUSS][Rust] Biweekly sync call for arrow/datafusion again?

2022-03-02 Thread Benson Muite
Interested in learning more about this. Can work through the code and discuss on 17 March either 4:00 or 16:00 UTC. Benson On 3/3/22 12:03 AM, Andrew Lamb wrote: I noticed that Matthew Turner added a note to the agenda[1] for a walk through of the JIT code. I would be interested in this as wel

Re: [ANNOUNCE] New Arrow PMC member: QP Hou

2022-02-17 Thread Benson Muite
Congratulations QP! On 2/18/22 8:35 AM, Jiayu Liu wrote: Congratulations QP! On Fri, Feb 18, 2022 at 1:32 PM Micah Kornfield wrote: Congrats! On Thu, Feb 17, 2022 at 7:27 PM Weston Pace wrote: Congratulations QP! On Thu, Feb 17, 2022 at 3:22 PM hao Yang <1371656737...@gmail.com> wrote:

Re: Building Arrow Cpp: Cannot find Boost on MacOS

2022-02-02 Thread Benson Muite
On 2/2/22 7:40 PM, Li Jin wrote: David - Will give it a try. I am using Apple clang 12 on MacOS. Related issue https://issues.apache.org/jira/browse/ARROW-15531

Re: Building Arrow Cpp: Cannot find Boost on MacOS

2022-02-02 Thread Benson Muite
ry, std::string class_desc, " On Wed, Feb 2, 2022 at 10:11 AM Benson Muite wrote: Can you try using one of the CMAKE options: -DARROW_DEPENDENCY_SOURCE=BREW -DARROW_DEPENDENCY_SOURCE=BUNDLED see https://arrow.apache.org/docs/developers/cpp/building.html

Re: Building Arrow Cpp: Cannot find Boost on MacOS

2022-02-02 Thread Benson Muite
Can you try using one of the CMAKE options: -DARROW_DEPENDENCY_SOURCE=BREW -DARROW_DEPENDENCY_SOURCE=BUNDLED see https://arrow.apache.org/docs/developers/cpp/building.html On 2/2/22 5:44 PM, Li Jin wrote: Also tried to test a basic CMake file with boost on my machine and it appears to find it

Re: [VOTE] Release Apache Arrow 7.0.0 - RC10

2022-01-30 Thread Benson Muite
+1 Non binding Checks on Rocky Linux 8, x86-64, GNU 8.5 Source C++, Python, GLib, Ruby, Java, Go, Javascript Wheels Binaries Some of the warnings below: /tmp/arrow-7.0.0.v8rVF/apache-arrow-7.0.0/cpp/src/arrow/compute/exec/ir_consumer.cc: In function ‘arrow::Result arrow::compute::Convert(co

Re: [ANNOUNCE] New Arrow PMC chair: Kouhei Sutou

2022-01-25 Thread Benson Muite
Congratulations Kou! On 1/25/22 8:44 PM, Vibhatha Abeykoon wrote: Congrats Kou! On Tue, Jan 25, 2022 at 11:13 PM Ian Joiner wrote: Congrats Kou! On Tuesday, January 25, 2022, Wes McKinney wrote: I am pleased to announce that we have a new PMC chair and VP as per our newly started traditi

Re: [DISCUSS] Annual rotation of Arrow PMC chair

2022-01-05 Thread Benson Muite
Congratulations! On 1/6/22 12:54 AM, Sutou Kouhei wrote: Hi, Thanks for nominating me. I'm happy to serve. Thanks,

Re: [ANNOUNCE] New Arrow committer: Alessandro Molina

2022-01-05 Thread Benson Muite
Congratulations On 1/5/22 8:39 PM, Vibhatha Abeykoon wrote: Congratulations On Wed, Jan 5, 2022 at 9:29 PM Supun Kamburugamuve wrote: Congratulations! On Wed, Jan 5, 2022 at 10:17 AM Niranda Perera wrote: Congrats Alessandro! :-) On Wed, Jan 5, 2022 at 9:54 AM David Li wrote: Congrat

Re: [ANNOUNCE] New Arrow PMC member: Yibo Cai

2022-01-04 Thread Benson Muite
Congratulations! On 1/4/22 6:00 PM, Wang Xudong wrote: Congratulations! xudong Andrew Lamb 于2022年1月4日周二 21:43写道: Congratulations, Yibo! Andrew On Tue, Jan 4, 2022 at 8:14 AM Neal Richardson < neal.p.richard...@gmail.com> wrote: Congratulations, Yibo! Neal On Tue, Jan 4, 2022 at 7:15 A

Re: Arrow in HPC

2021-12-28 Thread Benson Muite
This is very nice. Look forward to trying it out. One should get performance improvements on hardware with better interconnects, so performance just with TCP is not illustrative of all cases. On 12/28/21 11:41 PM, David Li wrote: Thanks for the feedback! Collective operations: unfortunately,

Re: Arrow vs Artus

2021-12-28 Thread Benson Muite
) and ORC then Arrow. There are interesting ideas from the Procella paper which covers Artus that might be worth thinking about in the context of these formats (or a new one). Arrow has not spent much focus on optimizing storage size. Cheers, Micah On Wednesday, December 22, 2021, Benson Muite

Re: Jira Access

2021-12-22 Thread Benson Muite
On 12/23/21 8:01 AM, Dulvin Witharane wrote: Hi, I would love to have access to JIRA. Please enroll me or let me know the due process. Thanks and regards, You should be able to make a JIRA account but do need to request contributor permissions as described in: https://github.com/apache/arro

Re: Arrow sync call December 22 at 12:00 US/Eastern, 17:00 UTC

2021-12-22 Thread Benson Muite
On 12/22/21 11:04 PM, Ian Cook wrote: Discussion of how best to use this meeting in 2022 - Consider changing the day/time? How can we best accommodate time zones, people doing Arrow dev work in day jobs vs. on evenings and weekends, etc? Further discussion about this is welcome Mining the Arrow

Re: Arrow vs Artus

2021-12-22 Thread Benson Muite
On 12/23/21 7:14 AM, Hayden Livingston wrote: Has anyone been able to benchmark the Artus file format vs Arrow? It seems that the Artus file format is gaining traction inside Google, replacing their current columnar format Capacitor. Hayden, Do you have a link to a specification or implementat

Re: [ANNOUNCE] New Arrow PMC member: Daniël Heres

2021-12-21 Thread Benson Muite
Congratulations! On 12/22/21 9:23 AM, QP Hou wrote: Congrats Daniël! Thank you for all your awesome work on the rust implementation and datafusion! On Tue, Dec 21, 2021 at 9:49 PM Eduardo Ponce wrote: Congrats! On Dec 21, 2021, at 12:18 PM, Wes McKinney wrote: The Project Management Comm

Re: [VOTE][RUST] Release Apache Arrow Rust 6.4.0 RC1

2021-12-12 Thread Benson Muite
+1 non binding. Ran script on Rocky Linux 8. Steps: dnf -y update dnf -y install gcc tar git git clone https://github.com/apache/arrow-rs cd arrow-rs/dev/release bash verify-release-candidate.sh 6.4.0 1 On 12/10/21 10:30 PM, Andrew Lamb wrote: Hi, I would like to propose a release of Apache Ar

Re: [DISCUSS][Rust] Biweekly sync call for arrow/datafusion again?

2021-12-08 Thread Benson Muite
The slides are very helpful. Is the timing correct, 4 UTC? On 12/9/21 12:16 AM, Andrew Lamb wrote: I plan to do the biweekly call tomorrow We are experimenting a bit with the agenda[1] for this one. After discussing any other agenda items people want, I plan to do a code walkthrough with Matth

Re: [ANNOUNCE] New Arrow committer: Rémi Dattai

2021-12-08 Thread Benson Muite
Congratulations Rémi On 12/9/21 3:39 AM, QP Hou wrote: Congrats Rémi, thank you for your epic work on datafusion :) On Wed, Dec 8, 2021 at 9:00 AM Andrew Lamb wrote: Congratulations Rémi! On Wed, Dec 8, 2021 at 10:56 AM Nic wrote: Congratulations! :) On Wed, 8 Dec 2021 at 07:20, Jorge C

Re: [VOTE] Release Apache Arrow JS 6.0.2

2021-12-07 Thread Benson Muite
to Arrow especially when combined with WebGPU. Either way, I think we should release the 6.0.2 version soon. @PMC, could you vote on the patch release? On Nov 28, 2021 at 04:33:41, Benson Muite wrote: Rust implementation can be compiled to WebAssembly and is released biweekly. The Javascript

Re: [VOTE][RUST] Release Apache Arrow Rust 6.3.0 RC1

2021-11-28 Thread Benson Muite
+1 non-binding tests pass, steps on Rocky Linux 8 dnf -y update dnf -y install git tar gcc git clone https://github.com/apache/arrow-rs cd arrow-rs/dev/release ./verify-release-candidate.sh 6.3.0 1 On 11/26/21 6:15 PM, Jörn Horstmann wrote: +1 non-binding Updated our query engine (which

Re: [VOTE] Release Apache Arrow JS 6.0.2

2021-11-28 Thread Benson Muite
for the clarification. There are no breaking changes in this point release, just fixes. @PMC, could you please vote on this point release. Would anyone volunteer as the release manager with me to give me a better understanding of the process? On Nov 23, 2021 at 13:09:47, Benson Muite wrote

Re: [VOTE] Release Apache Arrow JS 6.0.2

2021-11-23 Thread Benson Muite
. On Nov 20, 2021 at 09:57:53, Dominik Moritz wrote: Thanks for catching that. Jest is used for running the tests and jest supports node 14.15. Could we switch to node 14.15 instead of 14.0 for this test? On Nov 20, 2021 at 05:37:00, Benson Muite wrote: Hi, Tested this on AlmaLinux 8

Re: [VOTE] Release Apache Arrow JS 6.0.2

2021-11-20 Thread Benson Muite
Hi, Tested this on AlmaLinux 8. Following steps: export NVM_DIR="`pwd`/.nvm" mkdir -p $NVM_DIR curl -o- https://raw.githubusercontent.com/nvm-sh/nvm/v0.35.3/install.sh | \ PROFILE=/dev/null bash [ -s "$NVM_DIR/nvm.sh" ] && \. "$NVM_DIR/nvm.sh" nvm install --lts n

Re: [DISCUSS] A repository for collaborative prototyping + algorithms / performance research?

2021-11-18 Thread Benson Muite
On 11/18/21 6:29 PM, Wes McKinney wrote: On Thu, Nov 18, 2021 at 2:25 AM Antoine Pitrou wrote: Le 18/11/2021 à 02:54, Wes McKinney a écrit : In short I wanted to propose creating a separate git repository under apache/arrow-* for this purpose, to invite these kinds of contributions to our p

Re: [ANNOUNCE] New Arrow PMC member: Joris Van den Bossche

2021-11-18 Thread Benson Muite
Congratulations! On 11/18/21 2:17 PM, Rok Mihevc wrote: Congrats Joris! On Thu, Nov 18, 2021 at 11:40 AM Krisztián Szűcs wrote: Congrats Joris! On Thu, Nov 18, 2021 at 10:03 AM Maarten Breddels wrote: Nice Joris, congratulations! On Thu, Nov 18, 2021 at 9:34 AM Nic wrote: Congrat

Re: [VOTE][RUST][Datafusion] Release Apache Arrow Datafusion 6.0.0 RC0

2021-11-14 Thread Benson Muite
+1 (non-binding) Tested on Rocky Linux 8 using the verification script On 11/14/21 2:43 PM, Wang Xudong wrote: +1 (non-binding) I checked on macOS Monterey. Thanks, — xudong963 QP Hou 于2021年11月14日周日 下午5:03写道: Hi, I would like to propose a release of Apache Arrow Datafusion Implementation

Re: [VOTE][RUST] Release Apache Arrow Rust 6.2.0 RC1

2021-11-13 Thread Benson Muite
+1 (non-binding) Checked signature and ran release verification script on Rocky Linux 8 Benson On 11/13/21 9:00 AM, dong a wrote: +1 (non-binding) I checked signatures and ran the release verification script on macOS Monterey. Thanks, — xudong963 On 2021/11/12 12:24:16 Andrew Lamb wrote: Hi,

Re: [VOTE] Release Apache Arrow 6.0.1 - RC1

2021-11-12 Thread Benson Muite
+1 non binding Verified C++/Go/Java/Javascript/Python/Ruby sources (Rocky Linux 8) bash verify-release-candidate.sh source 6.0.1 1 bash verify-release-candidate.sh wheels 6.0.1 1 bash verify-release-candidate.sh binaries 6.0.1 1 g++ (GCC) 8.4.1 20200928 (Red Hat 8.4.1-1) ruby 2.7.4p191 (2021-07

Re: [VOTE] Release Apache Arrow 6.0.1 - RC0

2021-11-08 Thread Benson Muite
On AlmaLinux 8 Python 3.6.8 openjdk version "1.8.0_312" gcc (GCC) 8.4.1 20200928 ruby 2.7.4p191 Docker version 20.10.10, build b485636 dev/release/verify-release-candidate.sh source 6.0.1 0 passes dev/release/verify-release-candidate.sh wheels 6.0.1 0 Cannot run the verification script directly

Re: [DISCUSS] Community maintained extension repos for Datafusion

2021-11-07 Thread Benson Muite
A community owned GitHub organization would be helpful. Maybe for all other Arrow related projects not just Datafusion. This would make them easier to find, and for community members to contribute. It could also include a listing of relevant projects elsewhere. On 11/7/21 9:40 AM, Jiayu Liu wr

Re: [RUST] 6.0.0 Release Communication

2021-10-28 Thread Benson Muite
Andrew, Can write something over the weekend. Benson On 10/28/21 2:29 PM, Andrew Lamb wrote: Does anyone want to write up a blog post or more details on the 6.0.0 Rust Release? The 6.0.0 arrow blog post[1] is about to ship — I added a brief summary of the Rust content, but additional conten

Re: [DISCUSS][Rust] Biweekly sync call for arrow/datafusion again?

2021-10-27 Thread Benson Muite
Cannot host at 4:00 UTC on 28 October, but can host at 4:00 UTC on 11 November. Benson On 10/27/21 1:02 PM, Andrew Lamb wrote: We have some proposed agenda items[1] for the Rust sync[1] this week so I will plan to see anyone who is interested tomorrow. Andrew [1] https://docs.google.com/docu

Re: Arrow in HPC

2021-10-27 Thread Benson Muite
UCX is interesting, relatively new and seems like it may be easier to integrate. MPI is the most commonly used backend for HPC. Influencing the development of UCX is more difficult than influencing the development of MPI, but both have a slower pace of development than Arrow. One may want to co

Re: [VOTE] Release Apache Arrow 6.0.0 - RC3

2021-10-26 Thread Benson Muite
3b8445c3ef69fc4131764 On Sat, Oct 23, 2021 at 1:59 AM Benson Muite wrote: on Ubuntu 20.04 x86 Checked sources (C++, Python, Java, Ruby, Glib, C#, Javascript) bash dev/release/verify-release-candidate.sh source 6.0.0 3 gcc (Ubuntu 9.3.0-17ubuntu1~20.04) 9.3.0 Ubuntu clang version 10.0.1-++2021

Re: [VOTE] Release Apache Arrow 6.0.0 - RC3

2021-10-22 Thread Benson Muite
on Ubuntu 20.04 x86 Checked sources (C++, Python, Java, Ruby, Glib, C#, Javascript) bash dev/release/verify-release-candidate.sh source 6.0.0 3 gcc (Ubuntu 9.3.0-17ubuntu1~20.04) 9.3.0 Ubuntu clang version 10.0.1-++20211003085942+ef32c611aa21-1~exp1~20211003090334.2 ruby 3.0.2p107 (2021-07-07

[DISCUSS][Rust] Biweekly sync call for arrow/datafusion

2021-10-13 Thread Benson Muite
In case there is a need for a UTC 4:00 sync: https://trybbb.ml/#/rust-arrow-call

[C++] Comparison functions for strings in Between Ternary Kernel

2021-10-11 Thread Benson Muite
When comparing strings using C++, the default behavior is to order by UTF8 codepoints which impacts comparing strings such as a < b < c [1][2]. This may not be appropriate in all cases and like in the sort function [3], it may be helpful to have an optional field for comparison keys. An examp

Re: [ANNOUNCE] New Arrow committer: Jiayu Liu

2021-10-07 Thread Benson Muite
Congratulations Jiayu Liu! On 10/7/21 1:56 PM, Andrew Lamb wrote: Hi, On behalf of the Arrow PMC, I'm happy to announce that Jiayu Liu has accepted an invitation to become a committer on Apache Arrow. Welcome, and thank you for your contributions! Andrew

Re: [DISCUSS] Deprecate user@ in favor for github issues/discussions

2021-10-01 Thread Benson Muite
Mail is archived at [1] and [2], which uses Pony mail [3][4]. Contributed to an issue to make this more search engine friendly[5]. Search is really helpful to find answers as a user before posting a question. Arrow is developing rapidly, at present with greater engagement between developers b

Re: [DISCUSS][Rust] Biweekly sync call for arrow/datafusion again?

2021-09-29 Thread Benson Muite
Attendees: Ruihang Benson Discussion items: Self-introduction OpenVidu seemed to work Data Fusion introduction Speed of Arrow development process and intended use cases Maybe get time zones of attendees? On 9/30/21 6:58 AM, Benson Muite wrote: Join link: https://mkutano.nairuby.org/#/soft

Re: [DISCUSS][Rust] Biweekly sync call for arrow/datafusion again?

2021-09-29 Thread Benson Muite
Join link: https://mkutano.nairuby.org/#/soft-amaranth-alpaca Sorry it is late. Meeting should be short, as it seems there is a preference for one meeting. On 9/29/21 10:59 AM, Benson Muite wrote: Hi, Will send a link to a BigBlueButton/OpenVidu instance at 3:45 UTC tomorrow. Update the

Re: [DISCUSS][Rust] Biweekly sync call for arrow/datafusion again?

2021-09-29 Thread Benson Muite
/1atCVnoff5SR4eM4Lwf2M1BBJTY6g3_HUNR6qswYJW_U/edit On 9/24/21 4:26 PM, Andrew Lamb wrote: Thank you! On Thu, Sep 23, 2021 at 4:17 PM Benson Muite wrote: Can host 4:00 UTC, will likely use a self-hosted video conferencing solution that should just work in the browser. Benson On 9/22/21 11:15

Re: C++ Boost GitHub URL in ThirdpartyToolchain.cmake

2021-09-28 Thread Benson Muite
Hmm, it should. Can you open a JIRA with the full build logs? In the meantime though, you can also install the Developer Toolset to get a much newer gcc version: https://www.softwarecollections.org/en/scls/rhscl/devtoolset-8/ Regards Antoine. Ticket created: https://issues.apache.org/jira/br

Re: C++ Boost GitHub URL in ThirdpartyToolchain.cmake

2021-09-28 Thread Benson Muite
On 9/28/21 10:47 AM, Antoine Pitrou wrote: Le 28/09/2021 à 09:41, Benson Muite a écrit : Sorry, second one should have -DARROW_BUILD_TESTS=ON instead of -DARROW_BUILD_TESTS=OFF I see. What is the gcc version? 4.8 (will need to rebuild to get minor version) is default on Cent OS 7 - expect

Re: C++ Boost GitHub URL in ThirdpartyToolchain.cmake

2021-09-28 Thread Benson Muite
On 9/28/21 10:36 AM, Antoine Pitrou wrote: Hi, Le 28/09/2021 à 09:25, Benson Muite a écrit : Maybe helpful to create a ticket at: https://issues.apache.org/jira/projects/ARROW for more documentation on setup with Cent OS 7 Currently trying this on commit 1f481d9 (tagged as apache-arrow-6.0.0

Re: C++ Boost GitHub URL in ThirdpartyToolchain.cmake

2021-09-28 Thread Benson Muite
The lines > mkdir build > cd build should be mkdir cpp/build cd cpp/build Let me know if other configurations/bindings are needed. On 9/28/21 10:25 AM, Benson Muite wrote: Maybe helpful to create a ticket at: https://issues.apache.org/jira/projects/ARROW for more documentation on setu

Re: C++ Boost GitHub URL in ThirdpartyToolchain.cmake

2021-09-28 Thread Benson Muite
string_view.hpp:483:20: note: constexpr nonstd::sv_lite::basic_string_viewTraits>::basic_string_view() [with CharT = char; Traits = std::char_traits] nssv_constexpr basic_string_view() nssv_noexcept ^ /root/arrow/cpp/src/arrow/vendored/string_view.hpp:483:20: note:

Re: C++ Boost GitHub URL in ThirdpartyToolchain.cmake

2021-09-27 Thread Benson Muite
Hi Rares, What operating system are you using? Benson On 9/28/21 7:38 AM, Rares Vernica wrote: Hello, I'm still struggling to build Arrow with Parquet. I compiled Thrift myself but I'm running into dependency issues with Boost. It looks like the Boost download URL provided in ThirdpartyToolchai

Re: [DISCUSS][Rust] Biweekly sync call for arrow/datafusion again?

2021-09-23 Thread Benson Muite
Can host 4:00 UTC, will likely use a self-hosted video conferencing solution that should just work in the browser. Benson On 9/22/21 11:15 PM, Andrew Lamb wrote: The idea of time variation sounds great. As I am not typically available at 4:00 UTC I would appreciate it if someone else could pl

Re: [DISCUSS][Rust] Biweekly sync call for arrow/datafusion again?

2021-09-19 Thread Benson Muite
New to this. A suggestion may be to consider two of the times, eg. 4:00 UTC and 16:00 UTC perhaps alternating allowing geographic diversity in joining convenience. On 9/20/21 6:45 AM, QP Hou wrote: 16 UTC works for me too. On Sun, Sep 19, 2021 at 10:00 AM zied bf wrote: HI everyone, Still