Re: [WEBSITE] Blog posts on representing Structured Data with Parquet and Arrow

2022-10-05 Thread Andrew Lamb
Let's just say the posting process has been a comedy of errors by me 🤦‍♂️. I apologize for the noise On Wed, Oct 5, 2022 at 4:17 PM Will Jones wrote: > https://arrow.apache.org/blog/2022/10/05/arrow-parquet-encoding-part-1/ > > On Wed, Oct 5, 2022 at 12:48 PM Sasha Krassovsky < > krassovskysa...

Re: Parser for expressions

2022-10-05 Thread Sasha Krassovsky
Hi David, Could you elaborate on which part of my proposal overlaps with Substrait? I don’t see anything in Substrait that allows me to do something along the lines of Expression e = Expression::FromString(“(add !.a $int32:1)”); in the code. Sasha > On Oct 5, 2022, at 1:35 PM, Lee, David

RE: Parser for expressions

2022-10-05 Thread Lee, David
I believe this is what substrait.io is trying to accomplish.. Here's some additional info: https://substrait.io/ https://www.youtube.com/watch?v=5JjaB7p3Sjk -Original Message- From: Sasha Krassovsky Sent: Wednesday, October 5, 2022 11:29 AM To: dev@arrow.apache.org Subject: Parser for

Re: Integration between ibis-substrait and Acero

2022-10-05 Thread Will Jones
Some of these issues may be fixed in ARROW-17915 [1]. [1] https://github.com/apache/arrow/pull/14295 On Wed, Oct 5, 2022 at 12:07 PM Will Jones wrote: > I can confirm that fixes that issue in the simple case. But if result is > either of these, we get an error: > > result = compiler.compile(t.s

Re: [WEBSITE] Blog posts on representing Structured Data with Parquet and Arrow

2022-10-05 Thread Will Jones
https://arrow.apache.org/blog/2022/10/05/arrow-parquet-encoding-part-1/ On Wed, Oct 5, 2022 at 12:48 PM Sasha Krassovsky wrote: > Hi, we aren’t able to connect to your localhost 😀 > > > On Oct 5, 2022, at 12:44 PM, Andrew Lamb wrote: > > > > We have published the first post: > > http://localhos

Re: [WEBSITE] Blog posts on representing Structured Data with Parquet and Arrow

2022-10-05 Thread Sasha Krassovsky
Hi, we aren’t able to connect to your localhost 😀 > On Oct 5, 2022, at 12:44 PM, Andrew Lamb wrote: > > We have published the first post: > http://localhost:4000/blog/2022/10/05/arrow-parquet-encoding-part-1/ > > On Sun, Oct 2, 2022 at 7:00 AM Andrew Lamb wrote: > >> We are working on a seri

Re: [WEBSITE] Blog posts on representing Structured Data with Parquet and Arrow

2022-10-05 Thread Andrew Lamb
We have published the first post: http://localhost:4000/blog/2022/10/05/arrow-parquet-encoding-part-1/ On Sun, Oct 2, 2022 at 7:00 AM Andrew Lamb wrote: > We are working on a series of blog posts[1][2][3] that we plan to publish > to the arrow blog about how structured data is represented using

Re: [VOTE] Adopt ADBC database client connectivity specification

2022-10-05 Thread David Li
Kirill (CC'd) mentioned the organization could be improved [1] - I've put up a PR to move the definitions around to make it easier for implementors [2]. I'll leave this thread open for a little while longer for any interested parties/see if anyone has PR comments before I'll merge the PR/vote/cl

Re: [VOTE] Adopt ADBC database client connectivity specification

2022-10-05 Thread Neal Richardson
+1 (I think this makes 4 binding +1s, if I count correctly) On Wed, Oct 5, 2022 at 11:30 AM Antoine Pitrou wrote: > > +1 (binding), with the caveat that I looked mostly at the C API. > > Regards > > Antoine. > > > Le 21/09/2022 à 17:40, David Li a écrit : > > Hello, > > > > We have been discuss

Re: Integration between ibis-substrait and Acero

2022-10-05 Thread Will Jones
I can confirm that fixes that issue in the simple case. But if result is either of these, we get an error: result = compiler.compile(t.select("b")) # leads to: Traceback (most recent call last): File "", line 1, in File "pyarrow/_substrait.pyx", line 140, in pyarrow._substrait.run_query c

Re: Integration between ibis-substrait and Acero

2022-10-05 Thread Li Jin
Ok I think I got a working version now: t = ibis.table([("a", "int64"), ("b", "int64")], name="table0") test_table_0 = pa.Table.from_pydict({"a": [1, 2, 3], "b": [4, 5, 6]}) result = self.compiler.compile(t) def table_provider(names): if not name

Re: Integration between ibis-substrait and Acero

2022-10-05 Thread Li Jin
Hmm. Thanks for the update - Now I searched the code more, it seems perhaps I should be using "compile" rather than "translate"; https://github.com/ibis-project/ibis-substrait/blob/main/ibis_substrait/compiler/core.py#L82 Let me try some more On Wed, Oct 5, 2022 at 1:42 PM Will Jones wrote: >

Parser for expressions

2022-10-05 Thread Sasha Krassovsky
Hi everyone, I’ve noticed on the mailing list a few times people asking for a more convenient way to construct an Expression, namely using a string of some sort. I’ve found myself wishing for something like this too when constructing ExecPlans, and so I’ve gone ahead and implemented a parser [0]

Re: Integration between ibis-substrait and Acero

2022-10-05 Thread Will Jones
Hi Li Jin, The original segfault seems to occur because you are passing a Python bytes object and not a PyArrow Buffer object. You can wrap the bytes object using pa.py_buffer(): pa.substrait.run_query(pa.py_buffer(result_bytes), table_provider) That being said, when I run your full example wit

Re: [VOTE] Adopt ADBC database client connectivity specification

2022-10-05 Thread Antoine Pitrou
+1 (binding), with the caveat that I looked mostly at the C API. Regards Antoine. Le 21/09/2022 à 17:40, David Li a écrit : Hello, We have been discussing [1] standard interfaces for Arrow-based database access and have been working on implementations of the proposed interfaces [2], all u

Re: [VOTE] Adopt ADBC database client connectivity specification

2022-10-05 Thread Jacob Wujciak
+ 1 (non-binding) On Tue, Oct 4, 2022 at 11:05 PM David Li wrote: > Just another reminder - this has been open for a while with only one > binding vote. I know it's near 10.0.0 and I don't expect this to make the > release, but I'd appreciate if we could get this wrapped up one way or > another.