Re: [ANNOUNCE] New Arrow PMC member: Jacob Wujciak

2025-03-17 Thread Jonathan Keane
Congratulations! > On Mar 17, 2025, at 19:22, Andrew Lamb wrote: > > Congratulations! > > Andrew > > On Mon, Mar 17, 2025 at 5:17 PM Antoine Pitrou wrote: > >> >> Congratulations Jacob :) >> >> Le 17/03/2025 à 18:28, Jacob Wujciak a écrit : >>> Thank you everyone! >>> >>> Bryce Mecum sch

Re: [DISCUSS] Split R release process

2025-03-03 Thread Jonathan Keane
I agree with others and don't see much upside to splitting right now. One small additional note: "The R bindings can also work with old C++ versions" is technically true, for some version (pairs), but it turns out enforcing this is awkward, and the consensus [1] (so far) is that we actually want t

Re: [C++] Bump required CMake version

2024-12-12 Thread Jonathan Keane
I’ve commented on that PR [1] one issue I saw (us possibly overloading the CMAKE env var). As for CRAN: we have/use whatever is on the machines doing the building. We do download newer cmake if it’s required (and our internet isn’t blocked) for all builds except for Windows. The windows failur

Re: [C++] Can we drop support for macOS 12?

2024-09-23 Thread Jonathan Keane
One place we will need to support some older macOSes is CRAN. Currently their oldrel macos arm64 runner runs 11.6.7: https://cran.r-project.org/web/checks/check_flavors.html#r-oldrel-macos-x86_64 we have generally not had problems with this in the past, even without dedicated CI runners for it, but

Re: Policy on access to ursacomputing/crossbow?

2024-05-29 Thread Jonathan Keane
d. > > Regards, > Raúl > > El sáb, 25 may 2024 a las 2:02, Jonathan Keane () > escribió: > > > > Over my time with the project I've had access to the github repository > > ursacomputing/crossbow to be able to manually trigger crossbow jobs. I > find

Policy on access to ursacomputing/crossbow?

2024-05-24 Thread Jonathan Keane
Over my time with the project I've had access to the github repository ursacomputing/crossbow to be able to manually trigger crossbow jobs. I find it incredibly helpful when working on the extended R CI to be able to iterate more quickly than waiting for the comment bot. But also over the time I'v

Re: [ANNOUNCE] New Arrow committer: Bryce Mecum

2024-03-18 Thread Jonathan Keane
Congrats and welcome, Bryce. -Jon On Mon, Mar 18, 2024 at 6:47 AM Antoine Pitrou wrote: > > Congratulations Bryce, and keep up the good work! > > Regards > > Antoine. > > Le 18/03/2024 à 03:21, Nic Crane a écrit : > > On behalf of the Arrow PMC, I'm happy to announce that Bryce Mecum has > > a

Re: New tag for releases for R-universe

2024-02-10 Thread Jonathan Keane
Thanks for this Nic. And just to clarify: the latest here is the latest _release_ of Apache Arrow with this new set up. Prior to this the build available on R-universe were effectively dev builds (commits to main), but with this new tag, R-universe will only have (or at least default to having) th

Re: [ANNOUNCE] New Arrow PMC member: Raúl Cumplido

2023-11-13 Thread Jonathan Keane
Congratulations and welcome! -Jon

Re: [ANNOUNCE] New Arrow PMC member: Jonathan Keane

2023-10-23 Thread Jonathan Keane
; > > Congratulations, Jonathan! > > > > > > From: Dane Pitkin > > Sent: Monday, October 16, 2023 11:52 AM > > To: dev@arrow.apache.org > > Subject: Re: [ANNOUNCE] New Arrow PMC member: Jonathan Keane > > >

Re: Help regarding setting up the r package in arrow apache

2023-10-20 Thread Jonathan Keane
tially be made more clear, and it's > always > > useful to get feedback on docs like these. > > > > Nic > > > > [1] https://arrow.apache.org/docs/r/articles/developers/docker.html > > > > On Fri, 20 Oct 2023 at 08:13, Divyansh Khatri < > divyans

Re: [VOTE][Format] C data interface format strings for Utf8View and BinaryView

2023-10-18 Thread Jonathan Keane
+1 -Jon On Wed, Oct 18, 2023 at 2:26 PM Felipe Oliveira Carvalho < felipe...@gmail.com> wrote: > +1 > > On Wed, Oct 18, 2023 at 2:49 PM Dewey Dunnington > wrote: > > > +1! > > > > On Wed, Oct 18, 2023 at 2:14 PM Matt Topol > wrote: > > > > > > +1 > > > > > > On Wed, Oct 18, 2023 at 1:05 PM An

Re: Help regarding setting up the r package in arrow apache

2023-10-18 Thread Jonathan Keane
For development of the R package with docker containers, the link [1] that Nic sent in this same thread is the place to go. In addition to that docker-focused one, there are a handful of others that might prove useful to you in getting your development environment setup [2]. If you run into any is

Re: [Vote][Format] (new proposal) C data interface format string for ListView and LargeListView arrays

2023-10-07 Thread Jonathan Keane
+1 -Jon On Sat, Oct 7, 2023 at 3:54 AM Joris Van den Bossche < jorisvandenboss...@gmail.com> wrote: > +1 > > On Sat, 7 Oct 2023 at 10:44, Antoine Pitrou wrote: > > > > > > +1 from me. > > > > But I also reiterate my plea that these existing parsers get fixed so as > > to entirely validate the

Re: [Python][Discuss] PyArrow Dataset as a Python protocol

2023-06-28 Thread Jonathan Keane
> I would understand this objection more if DuckDB hasn't been relying on > being able to pass PyArrow expressions for 18 months now [1]. Unless, do we > just think this isn't widely used enough that we don't care? This isn't a pro or a con of specifically adopting the PyArrow expression semantics

Re: [VOTE] Move issue tracking to GitHub Issues

2022-10-26 Thread Jonathan Keane
+1, I'm very glad to see what will hopefully be a _slightly smoother_ experience for new contributors + issue reporters -Jon On Wed, Oct 26, 2022 at 7:05 PM David Li wrote: > +1 > > On Wed, Oct 26, 2022, at 20:01, Andy Grove wrote: > > +1 > > > > On Wed, Oct 26, 2022 at 5:50 PM L. C. Hsieh wr

Re: [ANNOUNCE] New Arrow PMC member: Nicola Crane

2022-10-25 Thread Jonathan Keane
Congratulations! Your contributions to the project have been immeasurable. -Jon On Tue, Oct 25, 2022 at 8:12 PM Vibhatha Abeykoon wrote: > Congrats Nic! > > On Wed, Oct 26, 2022 at 5:30 AM Ashish wrote: > > > Congrats ! > > > > On Wednesday, October 26, 2022, Anja wrote: > > > > > Congrats!!

Re: [VOTE] Mark C Stream Interface as Stable

2022-06-08 Thread Jonathan Keane
+1 (non binding) -Jon On Wed, Jun 8, 2022 at 4:52 PM Jorge Cardoso Leitão wrote: > > Sorry, I got a bit confused on what we were voting on. Thank you for the > clarification. > > +1 > > Best, > Jorge > > > On Wed, Jun 8, 2022 at 9:53 PM Antoine Pitrou wrote: > > > > > Le 08/06/2022 à 20:55, Jor

Re: Existence/name/scope for minimal C/C++ Arrow C Data interface helpers

2022-06-03 Thread Jonathan Keane
time ago [1]. If we want to encourage > database driver libraries to add new APIs that emit the Arrow C > interface, we need to make it easier to generate the C interface > without requiring a new library dependency. > > [1]: https://lists.apache.org/thread/gnz1kz2rj3rb8rh8qz7l0m

Re: [Discuss][Java] macOS minimum requirements

2022-06-01 Thread Jonathan Keane
This isn't Java related directly, but for the R bindings we have to support at least 10.13.6 to be on CRAN, so bumping up to 10.13 would be fine for that too. -Jon On Wed, Jun 1, 2022 at 9:24 AM Antoine Pitrou wrote: > > > Sorry, I put "C++" in the title but this really affects Java via JNI. > >

Re: Existence/name/scope for minimal C/C++ Arrow C Data interface helpers

2022-05-30 Thread Jonathan Keane
Thanks for working on this. I've heard people asking about something like this from a number of different fronts on top of the obvious use case in geoarrow | other geospatial libraries. I think a minimal piece of Arrow that other packages could depend on without needing to bring in all of arrow wou

Re: DISCUSS: Stabilize Arrow C Stream Interface?

2022-05-26 Thread Jonathan Keane
I too am +1 (nonbinding) to marking it as stable -Jon On Thu, May 26, 2022 at 1:05 PM Neal Richardson wrote: > +1 from me too to mark it as stable. De facto it is stable: there have been > no modifications to > https://github.com/apache/arrow/blob/master/cpp/src/arrow/c/abi.h since > the > str

Re: [VOTE] Release Apache Arrow 7.0.0 - RC8

2022-01-27 Thread Jonathan Keane
+0 most things validate, though I haven't been able to run the C++ tests successfully Thank you for the huge effort Krisztián. I verified the signature + checksums on [3]. I've run the following (on macOS 12.1): The binary verification — successful. I've also run the source verification on: *

Re: [Parquet][C++][Python] Maximum Row Group Length Default

2021-11-17 Thread Jonathan Keane
This doesn't address the large number of row groups ticket that was raised, but for some visibility: there is some work to change the row group sizing based on the size of data instead of a static number of rows [1] as well as exposing a few more knobs to tune [2] There is a bit of prior art in th

Re: Arrow sync call November 10 at 12:00 US/Eastern, 17:00 UTC

2021-11-10 Thread Jonathan Keane
Meeting notes: # Participants Nic Weston David Eduardo Benson Rok Antoine Alenka James Matt Micah # 6.0.1 patch release The RC1 for 6.0.1 is on its way and will have a vote shortly # Flight SQL David wanted to talk about Flight SQL from Dremio. We are close, would like someone to revi

Re: [DISCUSS] Deprecate user@ in favor for github issues/discussions

2021-09-29 Thread Jonathan Keane
I am also +1 for all of the same reasons both Neal and Philip mention. Lowering that barrier to participation for getting help + having that information more easily findable will make it easiest for folks to use and adopt Arrow. I will add personally I didn't realize I already do this when working

Re: [VOTE] Restart the Julia implementation with new repository and process

2021-09-27 Thread Jonathan Keane
+1 -Jon On Mon, Sep 27, 2021 at 2:26 PM Mauricio Vargas wrote: > > +1 > > On Mon, Sep 27, 2021 at 3:18 PM Neal Richardson > wrote: > > > +1 (binding) > > > > Neal > > > > On Mon, Sep 27, 2021 at 6:54 AM Andrew Lamb wrote: > > > > > +1 (binding) > > > > > > On Mon, Sep 27, 2021 at 12:17 AM Andy

Re: Arrow sync call August 3 at 12:00 US/Eastern, 16:00 UTC

2021-08-04 Thread Jonathan Keane
Notes for the meeting, it was relatively short and sparsely attended this fortnight: Attendees: * David Li * Jonathan Keane * Nic Crane * Neal Richardson Topics discussed * Compute IR proposal: There's been some discussion, check it out * CRAN resubmission, we have the fixes we need, will

Arrow sync call August 3 at 12:00 US/Eastern, 16:00 UTC

2021-08-03 Thread Jonathan Keane
Hello everyone, Our biweekly sync call is tomorrow (3 August) at 12:00 noon Eastern time. For today's call, let's please us this Google Meet URL (different from the usual one): https://meet.google.com/vbq-yufg-zwr?authuser=0 All are welcome to join. Notes will be shared with the mailing list aft

[Discuss] If and how we should integrate geospatial data (specs) in Arrow

2021-06-25 Thread Jonathan Keane
Hello, There is an emerging spec[1] for how to store geospatial data in Arrow + pass through parquet files in the geopandas world. There is even a new R package that implements a wrapper to do the same in R[2]. These both define a serialization[3] for storing geospatial data as an Arrow table (and

Re: [VOTE] Clarify meaning of timestamp without time zone to equal the concept of "LocalDateTime"

2021-06-25 Thread Jonathan Keane
+1 -Jon On Fri, Jun 25, 2021 at 5:30 AM Rok Mihevc wrote: > > +1 (non-binding) > > On Fri, Jun 25, 2021 at 11:21 AM Eduardo Ponce wrote: > > > +1 (non-binding) > > > > On Fri, Jun 25, 2021 at 4:31 AM Joris Peeters > > wrote: > > > > > +1 > > > > > > On Fri, Jun 25, 2021 at 9:29 AM Joris Van de

Re: [C++][Discuss] Switch to C++17

2021-06-11 Thread Jonathan Keane
ting the "correct" metrics > > * designing the scale rating > > * How do we get the community to provide their opinion for the qualitative > > metrics? What is a "good enough" coverage? > > * How do we summarize the results into a binary decision: up

Re: [C++][Discuss] Switch to C++17

2021-06-08 Thread Jonathan Keane
I've been digging a bit to try and put numbers on those users the Neal mentions. Specifically, we know that requiring C++17 will mean that R users on windows using versions of R before 4.0.0 will not be able to compile/install arrow. Although R version 3.6 is no longer supported by CRAN [1], many p

Re: [NIGHTLY] Arrow Build Report for Job nightly-2021-06-06-0

2021-06-07 Thread Jonathan Keane
Yes, I absolutely agree that more triaging, visibility, and info into these would be massively helpful for tracking some of these down. The conda-osx-py* builds seem to all be related to this LLVM mismatch https://issues.apache.org/jira/browse/ARROW-12738 which I've clarified more on that ticket.

Re: Moving automated nightly build e-mails to a separate mailing list

2021-05-24 Thread Jonathan Keane
I also very much agree with all of the sentiments above. One of the things that I'm hoping this new site/dashboard/whatever we come up with will have is some more information / context around the failures that hopefully will help make them less overwhelming and have a higher signal to noise ratio.

Re: Nightly Builds Repors 2021-05-17

2021-05-18 Thread Jonathan Keane
Thanks for the comments + tickets Krisztián all of those sound like good enhancements to this process. On the point of: >> Error type: Internal > I find it really useful to categorize the errors, especially if we > have an error out of our direct reach. > I can't think of an easy way to automate

Re: String reverse kernel

2021-05-17 Thread Jonathan Keane
Yeah, piggybacking on what Weston said: is the line that we want to draw is code point, combining character sequences, or graphemes [1]. IME, most people would want/assume that combining characters would stay combined in reversals (using Weston's example: "tréma" becoming "aḿert" (though this spe

Re: [VOTE] Release Apache Arrow 4.0.0 - RC3

2021-04-22 Thread Jonathan Keane
+1 (non-binding) Verified wheels, sources, and binaries on macOS 11.2 using the verification script (except for Java Integration, Glib, and Ruby). Like Antoine I ran into the same issue with Ruby. I also installed Arrow and the R package locally + ran some adhoc tests using some of our benchmarks

Re: [VOTE] Release Apache Arrow 4.0.0 - RC1

2021-04-20 Thread Jonathan Keane
gt; > > > I'm not sure if it is blocking (and it might even be expected given > > > the current status of jfrog) but I attempted to install the CentOS 7 > > > RPM and got the following error when I ran `sudo yum update` after > > > installing the arrow repo rpm.

Re: [VOTE] Release Apache Arrow 4.0.0 - RC1

2021-04-20 Thread Jonathan Keane
I'm still working on my verification, but as part of that noticed that https://issues.apache.org/jira/browse/ARROW-12316 which we thought changed the default memory allocator didn't fully accomplish that. Nothing is broken per se, but jemalloc is still the default on macOS. I've made https://issues

Re: Setting Affects Version in Arrow Jira bug issues

2021-04-07 Thread Jonathan Keane
I think this proposal is great and will help a lot when scanning through Jira issues. I wonder if it's possible to automate this? I'm thinking something along the lines of: If it's a Type = Bug, could have a yes/no or checkbox where we ask "is this a bug reproducible in the most recent arrow relea

Re: Arrow sync call March 31 at 12:00 US/Eastern, 16:00 UTC

2021-03-31 Thread Jonathan Keane
Thank you everyone who attended, here are the notes. Attendees: Jonathan Keane Colin Alworth David Sanders Micah Kornfield Rok Mihevc Projjal Chanda Eduardo Ponce Krill Lykov Discussion: - 4.0 release - zstd compression for the java library (has PR that is approved but

Re: Arrow sync call March 31 at 12:00 US/Eastern, 16:00 UTC

2021-03-31 Thread Jonathan Keane
I'm experiencing the same here. On Wed, Mar 31, 2021 at 11:06 AM Kirill Lykov wrote: > Hi, > > I don't know about the others but I cannot join because someone needs to > let me in. > Might be it the problem also for other people? > > On Tue, Mar 30, 2021 at 5:53 PM Neal Richardson < > neal.p.ric

[jira] [Created] (ARROW-8734) [R] Compilation error on macOS

2020-05-07 Thread Jonathan Keane (Jira)
Jonathan Keane created ARROW-8734: - Summary: [R] Compilation error on macOS Key: ARROW-8734 URL: https://issues.apache.org/jira/browse/ARROW-8734 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-8726) segfault with a mis-specified partition

2020-05-06 Thread Jonathan Keane (Jira)
Jonathan Keane created ARROW-8726: - Summary: segfault with a mis-specified partition Key: ARROW-8726 URL: https://issues.apache.org/jira/browse/ARROW-8726 Project: Apache Arrow Issue Type