Re: Question about pyarrow.substrait.run_query

2022-10-13 Thread Li Jin
Thank you Weston! On Thu, Oct 13, 2022 at 1:05 AM Weston Pace wrote: > 1. Yes. > 2. I was going to say yes but...on closer examination...it appears > that it is not applying backpressure. > > The SinkNode accumulates batches in a queue and applies backpressure. > I thought we were using a sink n

Re: Question about pyarrow.substrait.run_query

2022-10-12 Thread Weston Pace
1. Yes. 2. I was going to say yes but...on closer examination...it appears that it is not applying backpressure. The SinkNode accumulates batches in a queue and applies backpressure. I thought we were using a sink node since it is the normal "accumulate batches into a queue" sink. However, the Su

Question about pyarrow.substrait.run_query

2022-10-12 Thread Li Jin
Hello! I have some questions about how "pyarrow.substrait.run_query" works. Currently run_query returns a record batch reader. Since Acero is a push-based model and the reader is pull-based, I'd assume the reader object somehow accumulates the batches that are pushed to it. And I wonder (1) Does