Re: Transactional semantics in Acero

2022-09-11 Thread Jayjeet Chakraborty
Thanks a lot, Micah On Sun, Sep 11, 2022 at 10:11 PM Micah Kornfield wrote: > Delta Lake has rust bindings which should in theory be linkable into > native code. > > Iceberg is actively developing a python library and there has been talk of > Rust/native bindings. I'd like to see a C++ impleme

Re: Transactional semantics in Acero

2022-09-11 Thread Micah Kornfield
Delta Lake has rust bindings which should in theory be linkable into native code. Iceberg is actively developing a python library and there has been talk of Rust/native bindings. I'd like to see a C++ implementation that can be incorporated into Datasets but don't currently have bandwidth to wor

Re: Transactional semantics in Acero

2022-09-09 Thread Jayjeet Chakraborty
Thanks a lot everyone for your comments. Sorry, I meant to say adding transaction/update/append functionalities in the Dataset API, but it seems like it would be a duplication of work as in Apache Iceberg. The only problem with Iceberg/Delta Lake is that it is heavily locked into the JVM ecosystem,

Re: Transactional semantics in Acero

2022-09-09 Thread Weston Pace
I'd agree with Micah. I'm also not aware of anyone working on this. The docs clarify a bit more on the details[1]. I think we'd need a bit more thinking around an "update/append" workflow too. That being said, updates, transactions, and appends are something that the Iceberg project has thought

Re: Transactional semantics in Acero

2022-09-09 Thread Micah Kornfield
I would think any transaction concerns would live at the peripheries? e.g. the Datasets? Or at least that is where compatibility would have to be built first. On Fri, Sep 9, 2022 at 12:01 PM Sasha Krassovsky wrote: > Hi Jayjeet, > Transactions are currently out of scope for Acero - Acero is on

Re: Transactional semantics in Acero

2022-09-09 Thread Sasha Krassovsky
Hi Jayjeet, Transactions are currently out of scope for Acero - Acero is only meant to be a query execution engine. That said, it can definitely be used as a component for building a full database engine, which could implement its own locking of rows while Acero executes on them. You could also

Transactional semantics in Acero

2022-09-09 Thread Jayjeet Chakraborty
Hi Arrow Community, Since Acero is developing very fast into a full fledged compute engine, are there plans to add transaction semantics to acero, so that it can also be used as a database layer over already supported storage backends ? What I am referring to is like a Delta Lake/Iceberg kind of i