First, an update on progress. Once the PRs for ARROW-9711 and ARROW-9716
are merged, it is possible to run TPC-H query 1 against a 100 GB data set
with similar performance to Apache Spark in local mode. I plan on testing
larger datasets over the weekend.
To answer Kirill's question, I wouldn't nec
Sounds interesting as we wanted to start using DataFusion.
Btw, I vaguely remember that in the original repository you had issue
like "investigate DataFusion with Gandiva", I'm curious why you have
decided to give up with it?
On Thu, Aug 13, 2020 at 5:11 PM Andy Grove wrote:
>
> Some of you may