Re: [FlightSQL] [Go] Starting a FlightSQL client for Go

2022-06-06 Thread Matt Topol
assistance). Thanks for taking this on! --Matt On Mon, Jun 6, 2022 at 2:44 PM Jacob Marble wrote: > (cc Matt Topol aka zeroshade) > > Good morning, > > I plan to start working on the FlightSQL client for Golang. My first task > is to implement DenseUnion, as it is missing from the G

[Go][Release][Discussion] Patch release for Go libraries to address CVE-2022-28948

2022-06-10 Thread Matt Topol
I've merged the PR to master and want to propose cherry-picking it to create patch releases. Technically, for Go, all we need to do is create the appropriate tags named like "go/v6.0.2", and so on. Since this vulnerability only affects Go we don't necessarily need to release patches for the other l

Re: [Go][Release][Discussion] Patch release for Go libraries to address CVE-2022-28948

2022-06-13 Thread Matt Topol
on] Patch release for Go libraries to address > CVE-2022-28948" on Fri, 10 Jun 2022 11:43:26 -0400, > Neal Richardson wrote: > > > Personally, I don't have a problem with doing `git tag` just for Go. I > > don't think this needs a full patch release process

Re: [ANNOUNCE] New Arrow committers: Dewey Dunnington, Alenka Frim, and Rok Mihevc

2022-06-22 Thread Matt Topol
Congratulations!! Welcome! On Wed, Jun 22, 2022, 6:40 PM David Li wrote: > Congratulations Alenka, Dewey, and Rok! > > On Wed, Jun 22, 2022, at 14:55, L. C. Hsieh wrote: > > Congratulations! > > > > On Wed, Jun 22, 2022 at 11:28 AM Antoine Pitrou > wrote: > >> > >> > >> Welcome to our new commi

Re: [FlightSQL] [Go] Starting a FlightSQL client for Go

2022-07-06 Thread Matt Topol
for your support. I'll certainly look closely at DenseUnion and > SparseUnion before implementing. I mentioned DenseUnion only because it is > explicitly named in the FlightSQL proto definition, so it appears to be on > the critical path to FlightSQL. > > On Mon, Jun 6, 2022

Re: [VOTE] Release Apache Arrow 7.0.1 - RC0

2022-07-18 Thread Matt Topol
As long as running `go install` / `go build` works when run against go version 1.16, which as far as I'm aware is the version that is run for our current CI, then there shouldn't be any issues with the other extraneous package version bumps. In my opinion, it's fine to release as-is. --Matt On M

Re: [DISCUSS][Format] Starting to do some concrete work on the new "StringView" columnar data type

2022-07-31 Thread Matt Topol
Hey Wes, The StringView/ByteView structure presents some complications with Go in memory handling (particularly with the C ABI) so I added a comment on the RFC outlining this. TL;DR: Garbage Collection makes it difficult to handle because it's not maintaining a properly typed pointer. I'm gonna

Re: [ANNOUNCE] New Arrow PMC member: L. C. Hsieh

2022-09-04 Thread Matt Topol
Congrats!! On Sun, Sep 4, 2022, 8:30 AM Kazuaki Ishizaki wrote: > Congrats, Liang-Chi. > > Kazuaki Ishizaki > > 2022/09/04 20:54 に、"David Li" を書き込みました: > > Congrats & welcome Liang-Chi! > > On Sun, Sep 4, 2022, at 06:22, Andrew Lamb wrote: > > Congratulations L. C. -- very well des

Re: [ANNOUNCE] New Arrow PMC member: Weston Pace

2022-09-05 Thread Matt Topol
Congrats Weston!! On Mon, Sep 5, 2022, 11:01 AM QP Hou wrote: > Congrats Weston! > > On Mon, Sep 5, 2022 at 9:38 AM Yaron Gvili wrote: > > > Congratulations Weston! > > > > From: Raul Cumplido Dominguez > > Sent: Monday, September 5, 2022 10:04 AM > > To: dev@a

Re: [ANNOUNCE] New Arrow committer: Yanghong Zhong

2022-09-08 Thread Matt Topol
Congrats and welcome!! On Thu, Sep 8, 2022, 10:32 PM Vibhatha Abeykoon wrote: > Congratulations! > > On Fri, Sep 9, 2022 at 7:07 AM Weston Pace wrote: > > > Congratulations! > > > > On Thu, Sep 8, 2022 at 8:32 AM David Li wrote: > > > > > > Congrats & welcome, Yanghong! > > > > > > On Thu, Sep

Re: PRs for RLE support

2022-09-15 Thread Matt Topol
IMHO I think it's worth parameterizing for the 16/32-bit case. Despite it being nice to be able to just assume it's a 32bit signed int in terms of code simplicity, I think it would be a good benefit for memory usage of RLE arrays. That said I don't have anything to back that up as I don't regularl

Re: RLE array slicing

2022-09-15 Thread Matt Topol
> why would the run ends and values have the same offset? That's why I liked the idea of the children arrays and having the parent offset being a "logical offset" and children being "physical offsets" because it maintains the independence of the arrays. Slicing the RLE is simply just setting the l

Re: [DISCUSS] Apache Iceberg / Apache Hudi support in Arrow

2022-10-03 Thread Matt Topol
I wanted to chime in that a current long term goal I am working towards is a Golang iceberg implementation that will also integrate with the Golang Arrow modules. I'm not sure how much desire there is for it, but I do know at least two consumers that would greatly benefit from it. But, at least fo

Re: [DISCUSS] Maintenance policy

2022-10-19 Thread Matt Topol
I agree with Will's suggestion, and I also suggest that another type we should always backport would be any identified security vulnerability or issue. --Matt On Wed, Oct 19, 2022 at 1:48 PM Will Jones wrote: > One particular type of defect we might want to consider backporting to > supported v

Re: Go: pqarrow (github.com/apache/arrow/go/v9/parquet/pqarrow) cannot handle arrow's DICTIONARY field

2022-10-20 Thread Matt Topol
Hey Zhonghao! I'm glad you're getting good use out of the Arrow and Parquet Go implementations! You're absolutely right, pqarrow does not currently support Dictionary Arrow arrays. Support for Dictionary Arrays in the Go Arrow implementation was developed *after* the pqarrow package was created, s

Re: [VOTE] Release Apache Arrow 10.0.0 - RC0

2022-10-24 Thread Matt Topol
@kou What version of Go is your system Go? Also, can you go to the `go` subdirectory of the repo and run `go install ./...` then try again? Running that `go install ./...` command should build the binaries that were missing. Unless you have the environment variable `GOBIN` set, it will place the bu

Re: [DISCUSS] Migrating away from Travis-CI

2022-10-24 Thread Matt Topol
I'd prefer not to remove them as there are definitely known users of both architectures for the Golang libraries. Is CircleCI an option? On Mon, Oct 24, 2022 at 9:13 PM Sutou Kouhei wrote: > Hi, > > https://cwiki.apache.org/confluence/display/INFRA/Travis+Migrations > > > On November 2nd, 2020,

Re: [VOTE] Release Apache Arrow 10.0.0 - RC0

2022-10-25 Thread Matt Topol
verify-release-candidate.sh doesn't output any files outside > $ARROW_TMPDIR. > > > Thanks, > -- > kou > > In > "Re: [VOTE] Release Apache Arrow 10.0.0 - RC0" on Mon, 24 Oct 2022 > 19:58:40 -0400, > Matt Topol wrote: > > > @kou What ver

Re: [ANNOUNCE] New Arrow PMC member: Nicola Crane

2022-10-25 Thread Matt Topol
Congrats!! On Tue, Oct 25, 2022 at 5:06 PM Sutou Kouhei wrote: > The Project Management Committee (PMC) for Apache Arrow has invited > Nicola Crane to become a PMC member and we are pleased to announce > that Nicola Crane has accepted. > > Congratulations and welcome! >

Re: [VOTE] Move issue tracking to GitHub Issues

2022-10-27 Thread Matt Topol
+1 (Non-binding) On Thu, Oct 27, 2022, 4:19 PM Joris Van den Bossche < jorisvandenboss...@gmail.com> wrote: > +1 > > On Thu, 27 Oct 2022 at 07:27, Jacob Quinn wrote: > > > > +1 > > > > On Wed, Oct 26, 2022 at 5:04 PM Neal Richardson < > neal.p.richard...@gmail.com> > > wrote: > > > > > I propose

Re: [ANNOUNCE] New Arrow committer: Will Jones

2022-10-27 Thread Matt Topol
Congrats Will! On Thu, Oct 27, 2022 at 9:02 PM Ian Cook wrote: > Congratulations Will! > > On Thu, Oct 27, 2022 at 19:56 Sutou Kouhei wrote: > > > On behalf of the Arrow PMC, I'm happy to announce that Will Jones > > has accepted an invitation to become a committer on Apache > > Arrow. Welcome,

Proposal for C Data Interface Integration testing

2022-10-28 Thread Matt Topol
Hey all! Given various discussions about our testing and the C Data Interface, I've put together a small proposal for developing the integration testing for the C Data interface. For the most part it's an extension of Archery and the way Archery's integration testing works. I've put it into a Goog

Re: Proposal for C Data Interface Integration testing

2022-11-04 Thread Matt Topol
"lazy consensus" and stick this on my list of stuff to try implementing and eventually put a PR up to try to hook up something small based on the proposal. Thanks everyone! --Matt On Fri, Oct 28, 2022 at 5:02 PM Matt Topol wrote: > Hey all! Given various discussions about our t

Request for Patch release of 10.0.1

2022-11-08 Thread Matt Topol
Hey all, On JIRA[1] there was a request by Laurent who is working on the Open Telemetry Beta using Arrow as their transport to release a fix in a patch release as v10.0.1. I've opened up a draft PR[2] which cherry-picks the change onto the maint-10.0.0 branch as a preliminary step to getting feedb

Re: Request for Patch release of 10.0.1

2022-11-08 Thread Matt Topol
to ensure the fix for it gets > cherry-picked on to the 10.0.x maintenance/release branch (no need to do it > manually in a PR for a specific JIRA) > > Joris > > On Tue, 8 Nov 2022 at 16:25, Matt Topol wrote: > > > Hey all, > > > > On JIRA[1] there was a r

Re: [ANNOUNCE] New Arrow committer: Raúl Cumplido

2022-12-06 Thread Matt Topol
Congrats Raúl!! On Tue, Dec 6, 2022 at 9:53 AM Dewey Dunnington wrote: > Congrats! Welcome! > > On Tue, Dec 6, 2022 at 10:35 AM Larry White wrote: > > > Congrats, Raúl! > > > > On Tue, Dec 6, 2022 at 9:20 AM David Li wrote: > > > > > Welcome Raúl! > > > > > > On Tue, Dec 6, 2022, at 08:41, Nea

[VOTE] Add RLE Arrays to Arrow Format

2022-12-14 Thread Matt Topol
Hello, I'd like to propose adding the RLE type based on earlier discussions[1][2] to the Arrow format: - Columnar Format description: https://github.com/apache/arrow/pull/1/files#diff-8b68cf6859e881f2357f5df64bb073135d7ff6eeb51f116418660b3856564c60 - Flatbuffers changes: https://github.com/apa

Re: [VOTE] Add RLE Arrays to Arrow Format

2022-12-14 Thread Matt Topol
ng (which I think is a more serious problem in > practice). > > I believe that we should either: > (a) rename this to "run-end encoding" > (b) change this to a parameterized type called "run encoding" that > takes a Boolean parameter specifying whether run le

Re: Request for Patch release of 10.0.2

2022-12-15 Thread Matt Topol
following > two > >> PRs: > >> - https://github.com/apache/arrow/pull/14892 > >> - https://github.com/apache/arrow/pull/14904 > >> > >> The first one fixes a memory leak issue when the built-in compression is > >> enabled. > >> The o

Re: Request for Patch release of 10.0.2

2022-12-15 Thread Matt Topol
n Thu, Dec 15, 2022 at 11:47 AM Neal Richardson < neal.p.richard...@gmail.com> wrote: > I don't see a problem cherry-picking commits to the maintenance > branch--seems like that's what it should be for, right? > > Neal > > On Thu, Dec 15, 2022 at 11:17 AM Matt T

Re: [ANNOUNCE] New Arrow committer: Jacob Wujciak

2022-12-15 Thread Matt Topol
Congrats Jacob!! On Thu, Dec 15, 2022, 7:53 PM Neal Richardson wrote: > Congrats! > > On Thu, Dec 15, 2022 at 7:00 PM Ian Cook wrote: > > > Herzlichen Glückwunsch, Jacob! > > > > On Thu, Dec 15, 2022 at 6:56 PM Rok Mihevc wrote: > > > > > > Congrats Jacob!! > > > > > > Rok > > > > > > On Fri,

Re: [DISC] Self-Hosted Runners for Arrow

2022-12-16 Thread Matt Topol
These are awesome! Has there been any luck in reaching out to IBM to see if they could donate one or more s390x VMs to use as runners for testing the s390x builds? That is probably my only concern with Travis going away at EOY, since we don't have a way currently to test those builds on GH Actions.

Re: Request for Patch release of 10.0.2

2022-12-16 Thread Matt Topol
> On Thu, Dec 15, 2022 at 9:51 AM Matt Topol wrote: > > > I've created a PR for the cherry-picked changes here: > > https://github.com/apache/arrow/pull/14980 > > > > @Kou or @Neal could one of you take a look and approve the PR before I > > merge it? It fe

Re: [VOTE] Add RLE Arrays to Arrow Format

2022-12-19 Thread Matt Topol
irectly into such an array and avoid unpacking. RLE encoded dictionary > are > > quite compelling) > > > > I would prefer to use the term Run-End-Encoding (which would also follow > > the naming of the internal fields) but I don't view that as a deal > blocker. >

Re: [ANNOUNCE] New Arrow PMC chair: Andrew Lamb

2022-12-26 Thread Matt Topol
Congrats!!! On Mon, Dec 26, 2022, 12:47 PM Jacob Wujciak wrote: > Congratulations Andrew! > > Matthew Turner schrieb am Mo., 26. Dez. > 2022, 16:44: > > > Congratulations, Andrew! > > > > From: Yijie Shen > > Date: Monday, December 26, 2022 at 8:14 AM > > To: dev@arrow.apache.org > > Subject:

Re: [VOTE] Release Apache Arrow ADBC 0.1.0 - RC6

2022-12-27 Thread Matt Topol
Will the updated scripts also create the proper GitHub tag for a Go release? On Tue, Dec 27, 2022, 5:11 PM Will Jones wrote: > Thanks for the info Kou. Then I am +1 (non-binding). > > On Tue, Dec 27, 2022 at 2:09 PM Sutou Kouhei wrote: > > > Hi, > > > > > On MacOS M1, the debian package tests f

Re: [VOTE] Release Apache Arrow ADBC 0.1.0 - RC6

2022-12-27 Thread Matt Topol
Awesome, then +1 (Non-binding) :) On Tue, Dec 27, 2022, 7:03 PM David Li wrote: > The tag is created as part of the post release process (IIRC this is the > same as the main Arrow process) > > On Tue, Dec 27, 2022, at 17:29, Matt Topol wrote: > > Will the updated scripts also

Re: [Monorepo] Add labels breaking-change and critical-fix

2023-01-06 Thread Matt Topol
I'm extremely in favor of both of these labels for the reasons you state Will. It would be great to see us shift towards being able to do minor releases and not *always* having to do a major version release. --Matt On Fri, Jan 6, 2023 at 12:14 PM Micah Kornfield wrote: > These sounds good to me

Re: [VOTE] Add RLE Arrays to Arrow Format

2023-01-09 Thread Matt Topol
Thanks Antoine! I'll go respond to your comments now! On Mon, Jan 9, 2023 at 11:01 AM Antoine Pitrou wrote: > > I've commented on the PR. I'm +1 on the principle and on the proposed > format / layout additions. > > Regards > > Antoine. > > > Le 14/12

DISCUSS: ADBC More Canonical Options

2023-01-11 Thread Matt Topol
Hey all, I've filed a PR with ADBC (https://github.com/apache/arrow-adbc/pull/316) to add some more explicitly defined canonical options. This then leads the an interesting question that should be posed: For changes like this in general along with other potential updates, should we do a series of

Re: [FlightSQL] servers / client reference implementations supporting parameterized statements

2023-02-01 Thread Matt Topol
To this point, the Go flightsql sqlite server example is used to test the Parameter Support for the ADBC flightsql driver: - CI: https://github.com/apache/arrow-adbc/blob/main/.github/workflows/native-unix.yml#L293 - Dockerfile to run SQLite flightsql server: https://github.com/apache/arrow-adbc/

Re: [VOTE] Release Apache Arrow ADBC 0.2.0 - RC1

2023-02-10 Thread Matt Topol
Ran into a problem running on Pop!_OS 22.04 LTS/amd64: TEST_APT=0 USE_CONDA=1 ./dev/release/verify-release-candidate.sh 0.2.0 1 > Downloading and Extracting Packages > > Preparing transaction: done > Verifying transaction: done > Executing transaction: done > Bundler version 2.3.7 > The Meson bui

Re: [ANNOUNCE] New Arrow PMC member: Will Jones

2023-03-13 Thread Matt Topol
Congrats Will! On Mon, Mar 13, 2023, 2:02 PM Jacob Wujciak wrote: > Congratulations Will, well deserved! > > On Mon, Mar 13, 2023 at 6:58 PM Andrew Lamb wrote: > > > The Project Management Committee (PMC) for Apache Arrow has invited > > Will Jones to become a PMC member and we are pleased to a

Re: [VOTE] Release Apache Arrow ADBC 0.3.0 - RC1

2023-03-17 Thread Matt Topol
+1 (non-binding) I successfully ran the following on Pop!_OS 22.04 LTS USE_CONDA=1 ./dev/release-verify-release-candidate.sh 0.3.0 1 On Fri, Mar 17, 2023 at 12:01 PM Raúl Cumplido wrote: > +1 (non-binding) > > I have run successfully the following on Ubuntu 22.04: > USE_CONDA=1 ./dev/release/

Re: OpenTelemetry + Arrow

2023-03-30 Thread Matt Topol
+1 (non -binding) I'm glad others on here are finding this as useful and interesting as I did. Great job Laurent! --Matt On Thu, Mar 30, 2023, 3:26 PM Raphael Taylor-Davies wrote: > Hi Laurent, > > I gave the first blog post a read and I also really like it and would be > +1 on publishing it,

[DISCUSSION] C-Data API for Non-CPU Use Cases

2023-04-07 Thread Matt Topol
Hey all, In order to facilitate the propagation of use cases that want to pass data allocated on non-cpu devices around between environments (like between Python and C++) we should enhance the C-Data API to account for passing memory and device information alongside the arrays themselves. In this

Re: [DISCUSSION] C-Data API for Non-CPU Use Cases

2023-04-10 Thread Matt Topol
> The ArrowArray struct is not allowed to change, as it would break the ABI: https://arrow.apache.org/docs/format/CDataInterface.html#updating-this-specification I was referring more to the future case where we might need to introduce an `ArrowArrayV2` or something similar precisely because the Ar

Re: [DISCUSSION] C-Data API for Non-CPU Use Cases

2023-04-10 Thread Matt Topol
> * how will updates to dlpack be mirrored into arrow? > * instead of just repeating/vendoring the enum can we simply refer to it > and treat this as an opaque integer?) > 2. Providing an example of how you can tag arrays with metadata > > > > On Mon, Apr 10, 2023 at

Re: [CROWDSOURCING] Apache Arrow Board Report - April 12, 2023

2023-04-11 Thread Matt Topol
My apologies, I forgot to add updates for the Go section previously, I've added to the Google doc now for the Go updates. On Tue, Apr 11, 2023 at 9:29 AM Andrew Lamb wrote: > As a reminder, I will submit the ASF board report [1] tomorrow summarizing > the state of the project. Thank you to ever

Re: [ANNOUNCE] New Arrow committer: Ruihang Xia

2023-04-11 Thread Matt Topol
Congrats!! Welcome! On Tue, Apr 11, 2023, 11:29 PM Jacob Wujciak wrote: > Congratulations and welcome! > > On Mon, Apr 10, 2023 at 8:13 AM Wang Xudong > wrote: > > > Congratulations! > > > > Yang Jiang 于2023年4月10日周一 13:37写道: > > > > > > > > Congratulations !!! > > > > > > On 2023/04/09 11:25:1

Re: [DISCUSS] Migrate s390x from Travis to ASF Jenkins

2023-04-20 Thread Matt Topol
I just wanted to add on that there was a Go on s390x job too that needs to get migrated and wasn't on the list in Raul's original email. On Thu, Apr 20, 2023 at 2:42 PM Benson Muite wrote: > Might also consider testing farm for Centos Stream, Fedora and/or RHEL > builds[1][2]. > > 1) https://doc

Re: [DISCUSS] Migrate s390x from Travis to ASF Jenkins

2023-04-20 Thread Matt Topol
big endian is, of course, a choice, > although I imagine it would be more work/require more input to do so > than to migrate a CI job. > > I use Arrow on s380x, although it's a bit of circular logic because > I'm using it to make sure nanoarrow works on big endian. > >

Re: [VOTE] Release Apache Arrow 12.0.0 - RC0

2023-04-24 Thread Matt Topol
@Jacob I'm currently seeing if i can replicate the Majaro failure you found via a docker image for manjaro. I'll report back If I am and what I figure out. On Mon, Apr 24, 2023 at 3:12 PM Raúl Cumplido wrote: > El lun, 24 abr 2023 a las 18:53, Will Jones > () escribió: > > > > I'm seeing failing

Re: [VOTE] Release Apache Arrow 12.0.0 - RC0

2023-04-24 Thread Matt Topol
some reason that would be the cause of this failure. Though looking through the cmake modules, I don't see why it would be requiring LLVM-16 and discounting 15.0.7, so I'm not sure what's going on yet. I'll try to dig a bit and see if i can come up with something. On Mon, Apr

Re: [VOTE] Release Apache Arrow 12.0.0 - RC0

2023-04-24 Thread Matt Topol
CMake error log.) > > > Thanks, > -- > kou > > In > "Re: [VOTE] Release Apache Arrow 12.0.0 - RC0" on Mon, 24 Apr 2023 > 20:08:59 -0400, > Matt Topol wrote: > > > I was able to replicate the same llvm issue that Jacob saw, does v12 make > > l

Re: [VOTE] Formalize how to change format

2023-04-26 Thread Matt Topol
+1 (Non-binding) On Wed, Apr 26, 2023 at 5:16 AM Joris Van den Bossche < jorisvandenboss...@gmail.com> wrote: > +1 > > On Wed, 26 Apr 2023 at 04:18, Sutou Kouhei wrote: > > > > Hi, > > > > I've added one more note about documentation: > > > > We must update the corresponding documentation (fil

[WEBSITE] [DISCUSS] Arrow-Site blog post

2023-04-28 Thread Matt Topol
Hey All, Yevgeny Pats has contributed a blog post to the Arrow Site via PR[1]. detailing his company's usage of Arrow for their type system. I've reviewed it and it looks good to me, but as I'm not a PMC member I didn't want to go merging it and having it get published without input from others fi

Re: [VOTE] Release Apache Arrow ADBC 0.4.0 - RC0

2023-05-10 Thread Matt Topol
Using a manjaro linux image (in honor of the issues we found for Arrow v12 rc) I ran: USE_CONDA=1 ./dev/release/verify-release-candidate.sh 0.4.0 0 My first attempt failed because the default base image doesn't have make and such installed. should we install that via conda too since we install the

Re: [ANNOUNCE] New Arrow committer: Marco Neumann

2023-05-11 Thread Matt Topol
Congrats Marco! On Thu, May 11, 2023 at 9:18 AM Joris Van den Bossche < jorisvandenboss...@gmail.com> wrote: > Congrats Marco! > > On Thu, 11 May 2023 at 15:05, Weston Pace wrote: > > > > Congratulations! > > > > On Thu, May 11, 2023 at 4:28 AM vin jake wrote: > > > > > Congratulations Marco! >

Re: [Go] Scalar Question

2023-05-11 Thread Matt Topol
I don't know how many people are using the scalar package directly, but I'm definitely open to chatting about refactoring it. On Thu, May 11, 2023, 10:35 AM Yevgeny Pats wrote: > Hi Folks, > > I'm curious if anyone here is using the Go scalar >

Re: [DISCUSSION] C-Data API for Non-CPU Use Cases

2023-05-17 Thread Matt Topol
gt; > > > > > There are libraries today that have encountered this problem and have > > > proposed similar workarounds. > > > > > > * The changes to the stream interface are more than just "metadata" > > > > > > I did not look clo

Re: [DISCUSS] Interest in a 12.0.1 patch?

2023-05-18 Thread Matt Topol
I think it's worthwhile enough to justify the work for the patch. If we do end up doing the patch, then we should also include this [1] change for the Go side which, while significant, I didn't believe to be significant enough to warrant a patch on its own. But it is definitely a good idea to inclu

[VOTE][Format] Add experimental ArrowDeviceArray to C-Data API

2023-05-22 Thread Matt Topol
Hello, Now that there's a rough consensus and a toy example POC[1], I would like to propose an official enhancement to the Arrow C-Data API specification as described in the PR[2]. The new ArrowDeviceArray/ArrowDeviceArrayStream structs would be considered "experimental" and the documentation woul

Re: [VOTE][Format] Add experimental ArrowDeviceArray to C-Data API

2023-05-23 Thread Matt Topol
> > > > > > The C declarations seem fine to me (I'm a bit lukewarm on the reserved > > > bits, but I understand the motivation), however I've posted comments as > > > to how to document the interface. The current PR entirely lacks a prose > > >

Re: [VOTE][Format] Add experimental ArrowDeviceArray to C-Data API

2023-05-26 Thread Matt Topol
t; >> > Regards > > >> > > > >> > Antoine. > > >> > > > >> > > > >> > Le 23/05/2023 à 16:32, Antoine Pitrou a écrit : > > >> > > > > >> > > Depends on what we're voti

Re: [VOTE][Format] Add experimental ArrowDeviceArray to C-Data API

2023-06-05 Thread Matt Topol
solicit input from > > other stakeholder communities. > > > > On Mon, May 22, 2023 at 12:02 PM Matt Topol > wrote: > > > > > Hello, > > > > > > Now that there's a rough consensus and a toy example POC[1], I would > like > &g

Re: [ANNOUNCE] New Arrow committer: Mehmet Ozan Kabak

2023-06-08 Thread Matt Topol
Congrats! Welcome Ozan! On Thu, Jun 8, 2023 at 8:53 AM Raúl Cumplido wrote: > Congratulations and welcome! > > El jue, 8 jun 2023 a las 14:45, Metehan Yıldırım > () escribió: > > > > Congrats Ozan! > > > > On Thu, Jun 8, 2023 at 1:09 PM Andrew Lamb wrote: > > > > > On behalf of the Arrow PMC, I

Re: [ANNOUNCE] New Arrow PMC member: Jie Wen (jakevin / jackwener)

2023-06-12 Thread Matt Topol
Congrats Jie! On Sun, Jun 11, 2023 at 9:20 AM Andrew Lamb wrote: > The Project Management Committee (PMC) for Apache Arrow has invited > Jie Wen to become a PMC member and we are pleased to announce > that Jie Wen has accepted. > > Congratulations and welcome! >

Re: [DISCUSS][Format] Draft implementation of string view array format

2023-06-15 Thread Matt Topol
Based on my understanding, in theory a buffer *could* be shared within a batch since the flatbuffers message just uses an offset and length to identify the buffers. That said, I don't believe any current implementation actually does this or takes advantage of this in any meaningful way. --Matt O

Re: [VOTE] Release Apache Arrow ADBC 0.5.0 - RC0

2023-06-19 Thread Matt Topol
+1 Tested on Pop!_Os (Ubuntu 22.04) x86_64 On Mon, Jun 19, 2023, 10:55 AM Jacob Wujciak-Jens wrote: > +1 (nb) with conda on ubuntu > > On Mon, Jun 19, 2023 at 2:18 PM David Li wrote: > > > My vote: +1 (Ubuntu Linux 20.04/x86_64) > > > > On Fri, Jun 16, 2023, at 05:24, Raúl Cumplido wrote: > >

Re: [ANNOUNCE] New Arrow PMC member: Ben Baumgold,

2023-06-20 Thread Matt Topol
Congrats Ben! On Tue, Jun 20, 2023, 11:00 AM Weston Pace wrote: > Congratulations Ben! > > On Tue, Jun 20, 2023 at 7:38 AM Jacob Quinn > wrote: > > > Yay! Congrats Ben! Love to see more Julia folks here! > > > > -Jacob > > > > On Tue, Jun 20, 2023 at 4:15 AM Andrew Lamb > wrote: > > > > > The

[DISCUSS] ADBC 0.5.1 patch release?

2023-06-21 Thread Matt Topol
Given the upcoming Snowflake Summit talk on ADBC with the Snowflake driver, and potential deadlock condition addressed by [1], it might make sense for us to do a v0.5.1 patch release of ADBC. Unfortunately I only discovered the issue just as the voting for 0.5.0 closed and the release was finalize

Re: [DISCUSS][Format][Flight] Result set expiration support

2023-06-22 Thread Matt Topol
> That said, I think it's reasonable to only have Cancel at the protocol level. I'd be in favor of only having Cancel too. In theory calling Cancel on something that has already completed should just be equivalent to calling Close anyways rather than requiring a client to guess and call Close if C

Re: [ANNOUNCE] New Arrow PMC member: Dewey Dunnington

2023-06-23 Thread Matt Topol
Congrats Dewey!! On Fri, Jun 23, 2023, 9:35 AM Dane Pitkin wrote: > Congrats Dewey! > > On Fri, Jun 23, 2023 at 9:15 AM Nic Crane wrote: > > > Well-deserved Dewey, congratulations! > > > > On Fri, 23 Jun 2023 at 11:53, Vibhatha Abeykoon > > wrote: > > > > > Congratulations Dewey! > > > > > > O

Re: [VOTE] Release Apache Arrow ADBC 0.5.1 - RC1

2023-06-23 Thread Matt Topol
+1 tested on Pop!_Os 22.04 with go 1.19 On Fri, Jun 23, 2023, 4:52 PM Sutou Kouhei wrote: > +1 > > I ran the following on Debian GNU/Linux sid: > > JAVA_HOME=/usr/lib/jvm/default-java \ > dev/release/verify-release-candidate.sh 0.5.1 1 > > with: > > * Python 3.11.4 > * g++ (Debian 12.3

Re: [VOTE][Format][Flight] Result set expiration support

2023-06-28 Thread Matt Topol
+1 Thanks kou! On Wed, Jun 28, 2023, 10:33 AM David Li wrote: > +1 > > Thanks Kou! > > On Tue, Jun 27, 2023, at 21:31, Sutou Kouhei wrote: > > +1 > > > > In <20230628.103017.2111667987485891680@clear-code.com> > > "[VOTE][Format][Flight] Result set expiration support" on Wed, 28 Jun > > 2

Re: [ANNOUNCE] New Arrow committer: Kevin Gurney

2023-07-04 Thread Matt Topol
Welcome! On Tue, Jul 4, 2023, 11:06 AM Joris Van den Bossche < jorisvandenboss...@gmail.com> wrote: > Congrats Kevin! > > On Tue, 4 Jul 2023 at 13:47, David Li wrote: > > > > Welcome Kevin! > > > > On Tue, Jul 4, 2023, at 05:55, Raúl Cumplido wrote: > > > Congratulations Kevin!!! > > > > > > El

Re: Do we need CODEOWNERS ?

2023-07-04 Thread Matt Topol
I've found it useful for me so far since it auto adds me on any Go related PRs so I don't need to sift through the notifications or active PRs, and instead can easily find them in my reviews on GitHub notifications. But if everyone else finds it more detrimental than helpful I can set up a custom

Re: [DISCUSS] Canonical alternative layout proposal

2023-07-13 Thread Matt Topol
I don't have much to add but I do want to second Jacob's comments. I agree that this is a good way to avoid the fragmentation while keeping Arrow relevant, and likely something we need to do so that we can ensure Arrow remains the way to do this data integration and interoperability. On Wed, Jul 1

Re: [QUESTION][BLOG] Contributing a Blog Post

2023-07-14 Thread Matt Topol
I think this would be a great idea! It's been great seeing various organizations posting on the Arrow blog and this would be a great contribution. Assuming that no one objects, you can contribute a PR to https://github.com/apache/arrow-site --Matt On Fri, Jul 14, 2023 at 10:17 AM Christopher Akik

Re: [DISCUSS][Format] Draft implementation of string view array format

2023-07-31 Thread Matt Topol
;>> still > >>>>>> contains data that was supposedly filtered out. My 2 cents is that > >>>> explicit > >>>>>> selection vectors are a less surprising way to defer selection than > >>>> baking > >>>>>&g

Re: [VOTE] Apache Arrow ADBC (API) 1.1.0

2023-08-14 Thread Matt Topol
will be open for at least 72 hours. > > > > [ ] +1 Adopt the ADBC 1.1.0 specification > > [ ] 0 > > [ ] -1 Do not adopt the specification because... > > > > Thanks to Sutou Kouhei, Matt Topol, Dewey Dunnington, Antoine Pitrou, > Will > > Ayd, a

Re: [Format] C data interface format string for run-end encoded arrays

2023-08-15 Thread Matt Topol
I'm in favor of this as the C Data format string. Though since this is technically a format/spec change do others think we should take a vote on this? --Matt On Tue, Aug 15, 2023, 12:19 PM Felipe Oliveira Carvalho wrote: > Hello, > > I'm writing to inform you that I'm proposing "+r" as format s

Re: [Format] C data interface format string for run-end encoded arrays

2023-08-15 Thread Matt Topol
Sounds good, I'll send out an email starting the vote On Tue, Aug 15, 2023 at 2:30 PM Antoine Pitrou wrote: > > I think we should. > > Regards > > Antoine. > > > Le 15/08/2023 à 19:58, Matt Topol a écrit : > > I'm in favor of this as the

[Vote][Format] C Data Interface Format string for REE

2023-08-16 Thread Matt Topol
Hey All, As proposed by Felipe [1] I'm starting a vote on the proposed update to the Format Spec of adding "+r" as the format string for passing Run-End Encoded arrays through the Arrow C Data Interface. A PR containing an update to the C++ Arrow implementation to add support for this format stri

Re: [Vote][Format] C Data Interface Format string for REE

2023-08-16 Thread Matt Topol
It would be nice to get approval from authors of other implementations > such as Rust, C#, Javascript... > > Thanks for doing this! > > > Le 16/08/2023 à 16:16, Matt Topol a écrit : > > Hey All, > > > > As proposed by Felipe [1] I'm starting a vote on the p

Re: [Vote][Format] C Data Interface Format string for REE

2023-08-22 Thread Matt Topol
wrote: > > > > +1 (binding) > > > > Cheers, > > > > -Jacob > > > > On Wed, Aug 16, 2023 at 8:16 AM Matt Topol > > > wrote: > > > > > Hey All, > > > > > > As proposed by Felipe [1] I'm starting a vote on

[VOTE] [Format] Add app_metadata to FlightInfo and FlightEndpoint

2023-09-12 Thread Matt Topol
Hey all, I would like to propose adding a new app_metadata field to both the FlightInfo and FlightEndpoint message types of the Arrow Flight protocol. There has been discussion of doing so for a while and has now been brought back up in regards to [1]. More specifically, this enables adding applic

Re: [VOTE] Release Apache Arrow Flight SQL adapter for PostgreSQL 0.1.0 - RC6

2023-09-12 Thread Matt Topol
+1 Though I ran into the same issue as David, but the verify script ran successfully On Tue, Sep 12, 2023 at 10:56 AM David Li wrote: > +1 > > Though, I couldn't figure out how to get run-postgresql.sh to work for my > setup (postgres installed via Conda), as initdb complained about the > arrow

Re: [VOTE] [Format] Add app_metadata to FlightInfo and FlightEndpoint

2023-09-12 Thread Matt Topol
e > implementation)? > > On Tue, Sep 12, 2023, at 11:48, Matt Topol wrote: > > Hey all, > > > > I would like to propose adding a new app_metadata field to both the > > FlightInfo and FlightEndpoint message types of the Arrow Flight protocol. > > There has been disc

Re: [VOTE] [Format] Add app_metadata to FlightInfo and FlightEndpoint

2023-09-14 Thread Matt Topol
The PR has been updated for a bit with both C++ and Go implementations, hopefully I can get some more votes on this thread? On Tue, Sep 12, 2023 at 12:16 PM Matt Topol wrote: > The C++ code gets auto-generated during build right? Ah, fair point the > C++ still uses it's own objects.

Re: [LAST CALL][DISCUSS] Unsigned integers in Utf8View

2023-09-20 Thread Matt Topol
Just to chime in (and add yet another voice into the mix here), I'd have a preference for it being signed integers for the same reasons as most everyone else: consistency with everything else in the spec. Since we use signed integers everywhere, I'd prefer to keep it consistent rather than introduc

Re: [DISCUSS][C++] Raw pointer string views

2023-09-26 Thread Matt Topol
I believe the motivation is to avoid the cost of the data copy that would have to happen to convert from a pointer based to offset based scenario. Allowing the pointer-based implementation will ensure that we can maintain zero-copy communication with both DuckDB and Velox in a common workflow scena

Re: [VOTE][Format] Add ListView and LargeListView Arrays to Arrow Format

2023-09-29 Thread Matt Topol
+1, thanks Felipe for your perseverance here! On Fri, Sep 29, 2023, 12:55 PM wish maple wrote: > +1 > > LGTM, thanks! > > Ian Cook 于2023年9月30日周六 00:49写道: > > > +1 (non-binding) > > > > Thanks very much Felipe for your persistence and your commitment to > > addressing the numerous questions and

Re: [VOTE][Format] Variable shape tensor canonical extension type

2023-09-29 Thread Matt Topol
+1 Thanks for all the work here! On Fri, Sep 29, 2023 at 11:04 AM Dewey Dunnington wrote: > +1! Thank you for iterating on this with all of us! > > On Fri, Sep 29, 2023 at 11:28 AM Alenka Frim > wrote: > > > > +1 > > Thanks for pushing this through! > > > > On Wed, Sep 27, 2023 at 2:44 PM Rok

Re: [VOTE][Format] Add ListView and LargeListView Arrays to Arrow Format

2023-10-02 Thread Matt Topol
+1 On Mon, Oct 2, 2023 at 8:54 AM Raphael Taylor-Davies wrote: > +1 > > On 02/10/2023 13:53, Antoine Pitrou wrote: > > > > Hello, > > > > +1 and thanks for working on this! > > > > There'll probably be some minor comments to the format PR, but those > > don't deter from accepting these new layou

Re: [VOTE][Format] Add ListView and LargeListView Arrays to Arrow Format

2023-10-02 Thread Matt Topol
Should have expanded my messages, i forgot that i already +1'd this d'oh! Sorry for the spam! --Matt On Mon, Oct 2, 2023 at 2:19 PM Matt Topol wrote: > +1 > > On Mon, Oct 2, 2023 at 8:54 AM Raphael Taylor-Davies > wrote: > >> +1 >> >>

Re: [DISCUSS][C++] Raw pointer string views

2023-10-02 Thread Matt Topol
Given the benchmarks that Ben provided, I think I still have one concern if we only support the offset-based representation: @Raphael: > Conversion between the two view representations is relatively fast, especially for small strings I think this is a bit of an oversimplification given Ben's ass

[RESULT] [VOTE] [Format] Add app_metadata to FlightInfo and FlightEndpoint

2023-10-03 Thread Matt Topol
nding `FlightEndpoint`s and `FlightData` chunks. > > > Le 12/09/2023 à 17:48, Matt Topol a écrit : > > Hey all, > > > > I would like to propose adding a new app_metadata field to both the > > FlightInfo and FlightEndpoint message types of the Arrow Flight protocol. >

[RESULT][VOTE][Format] Add ListView and LargeListView Arrays to Arrow Format

2023-10-04 Thread Matt Topol
38C17-L238C17 > > > On Tue, 3 Oct 2023 at 00:22 Micah Kornfield wrote: > > > Sorry to chime in late. In practice I'm not sure how much LargeList is > > used? Are we doing this just for symmetry purposes? Is there a known > > use-case for it? > > > > On

  1   2   3   >