In the dev meeting today, folks suggested grabbing the code that's being
used in the dashboard and looking at creating an LLM summary of the dev
mailing list to use as a starting point.  I extracted threads from May to
now, and added in a previous report as a rough reference for tone/structure
etc.  I won't paste any of this into the document as I'm unsure how useful
it is - if other folks could give some input here, that would be helpful.

# Project Activity:

Selected subproject updates:
## Community & Ecosystem

The community is discussing a proposal to canonicalize the variant
extension type across Arrow implementations. This would improve
cross-language interoperability for mixed-type columns (e.g., from
Parquet). Contributors are weighing whether Arrow should define and
maintain a standard set of extension types going forward.

We’ve also seen strong uptake of the Kapa.ai documentation assistant (~850
queries/month), which is helping users discover relevant docs and resolve
common questions without needing to open issues or join mailing lists.

There is renewed interest in community health tooling—such as dashboards
tracking stale issues and unmaintained components—driven by a desire to
improve contributor visibility and sustainability.
## GPU Support

A new arrow-gpu backend has been proposed for donation. Built on the wgpu
abstraction layer, it aims to provide vendor-neutral GPU acceleration
across compute kernels. The community is evaluating how this could
integrate into Arrow’s compute model without introducing unnecessary
complexity.
## Feather Format

Feather V1 support has been deprecated in the C++ implementation, in line
with the project's goal of simplifying its scope. Contributors agreed to
remove both write and read support, though there is some discussion around
retaining read-only fallback for legacy data.
## Intervals and Standards Alignment

The Iceberg community has proposed a new 128-bit ANSI SQL-compatible
interval type. Arrow contributors are considering whether this should be
adopted natively or handled as an extension type. This work is part of
broader efforts to ensure Arrow remains compatible with evolving lakehouse
and SQL engine standards.
## Skyhook

Skyhook has been removed from the Arrow monorepo and moved to its own
repository. This aligns with our strategy of offloading niche components to
simplify the core project and reduce maintenance burden.
## CI & Infrastructure

Thanks to newly granted AWS credits, we are expanding our CI and
benchmarking infrastructure. This includes adding GPU runners, improving
performance tracking, and supporting larger-memory workloads. These
improvements are expected to improve release robustness and enable
performance monitoring across languages.
## Benchmarks

Several contributors are working to make benchmarking more visible and
comparable across C++, Python, and R. This includes discussions around
standardized benchmark suites and dashboards to detect regressions
earlier—motivated in part by recent performance issues observed in Arrow R
19.x and 20.x.

On Wed, 2 Jul 2025 at 16:50, Neal Richardson <neal.p.richard...@gmail.com>
wrote:

> Hi all,
> It's time for our quarterly ASF board update again. I haven't gotten the
> reminder email yet but know it is coming (whenever the July meeting is),
> and to give everyone enough time to provide input on the document, I wanted
> to get started. Please have a look at [1] and add any updates you'd like to
> include. I've filled in the basic facts to get us started.
>
> Thanks,
> Neal
>
> [1]:
>
> https://docs.google.com/document/d/1fSYEmfGtmzjt1prHkdRoyxwGMmcX3onAStWArfiSBis/edit?tab=t.0
>

Reply via email to