Re: [C++][Parquet] Support different compression algorithms per row group

2024-03-20 Thread Gang Wu
Hi Andrei, What is your use case? IMHO, exposing this kind of configuration will force users to know how will the writer split row groups, which does not look simple to me. Best, Gang On Thu, Mar 21, 2024 at 2:25 AM Andrei Lazăr wrote: > Hi all, > > I would like proposing adding support for wr

[VOTE] Stateless prepared statements in FlightSQL

2024-03-20 Thread Adam C
Hello, I would like to propose a change to the FlightSQL specification as originally described in this Github issue [1] by Andrew Lamb. The specification change would allow servers to support prepared statements with parameters, without needing to manage state between client requests. There is a

[C++][Parquet] Support different compression algorithms per row group

2024-03-20 Thread Andrei Lazăr
Hi all, I would like proposing adding support for writing a Parquet file with different compression algorithms for every row group. In my understanding, the Parquet format allows this, however it seems to me that there is no way to achieve this from the C++ implementation. Does anyone have any t

Re: Apache Arrow Flight - From Rust to Javascript (FlightData)

2024-03-20 Thread Matt Topol
I don't think there is currently a direct equivalent to `FlightRecordBatchStream` in the arrow javascript library, but you should be able to combine the data header + body and then read it using the `fromIPC` functions since it's just the Arrow IPC format On Fri, Mar 15, 2024 at 5:39 AM Alexander

[ANNOUNCE] Apache Arrow 15.0.2 released

2024-03-20 Thread Raúl Cumplido
The Apache Arrow community is pleased to announce the 15.0.2 release. It includes 8 resolved issues ([1]) since the 15.0.1 release. The release is available now from our website and [2]: http://arrow.apache.org/install/ Read about what's new in the release https://arrow.apache.org/blog/2024/0

Re: ADBC - OS-level driver manager

2024-03-20 Thread Antoine Pitrou
Also, with ADBC driver implementations currently in flux (none of them has reached the "stable" status in https://arrow.apache.org/adbc/main/driver/status.html), it might be a disservice to users to implicitly fetch drivers from potentially outdated DLLs on the current system. Regards Ant

Re: ADBC - OS-level driver manager

2024-03-20 Thread Wenbo Hu
Wenbo Hu 于2024年3月20日周三 22:03写道: > > Hi David, > > I've been working on xDBC with Arrow for a while. I have some thoughts on > ODBC. > > We connect to the DBMS in Arrow stream using Python through four > different methods: JDBC, ADBC, ODBC, and the Python DB client library. > Typically, ADBC is t

Re: ADBC - OS-level driver manager

2024-03-20 Thread Ian Cook
I have given this some thought and discussed it with some colleagues at Voltron Data. Something like this could be valuable in managed environments where there is a need to centrally define data sources across a fleet of systems. Perhaps it would also be valuable for individual system-level configu

Re: ADBC - OS-level driver manager

2024-03-20 Thread Matt Topol
> it seems like the current driver manager work has been largely targeting an app-specific implementation. Yup, that was the intention. So far discussions of ADBC having a system-wide driver registration paradigm like ODBC have mostly been to discuss how much we dislike that paradigm and would pre

Re: ADBC - OS-level driver manager

2024-03-20 Thread Wenbo Hu
Hi David, I've been working on xDBC with Arrow for a while. I have some thoughts on ODBC. We connect to the DBMS in Arrow stream using Python through four different methods: JDBC, ADBC, ODBC, and the Python DB client library. Typically, ADBC is the preferred method as it provides a "native" way f

ADBC - OS-level driver manager

2024-03-20 Thread David Coe
ODBC has different OS-level driver managers available on their respective systems. It seems like the current driver manager work has been largely targeting an app-specific implementation. Have there been any discussions of ADBC having

[DISCUSS][DataFusion] make dfschema wrap schemaref

2024-03-20 Thread huaijin hao
Hi, We want to make dfschema wrap arrow schema to improve datafusion planner performance, as discussed at [1], [2], [3]. We welcome anyone interested in participating in the reviews. Please take a look and add comments/suggestions to the PR [3]. Thanks, huaijin [1]: https://github.com/apache/ar