Re: My focus for Rust implementation for 2.0.0

2020-08-14 Thread Andy Grove
First, an update on progress. Once the PRs for ARROW-9711 and ARROW-9716 are merged, it is possible to run TPC-H query 1 against a 100 GB data set with similar performance to Apache Spark in local mode. I plan on testing larger datasets over the weekend. To answer Kirill's question, I wouldn't nec

Re: My focus for Rust implementation for 2.0.0

2020-08-14 Thread Kirill Lykov
Sounds interesting as we wanted to start using DataFusion. Btw, I vaguely remember that in the original repository you had issue like "investigate DataFusion with Gandiva", I'm curious why you have decided to give up with it? On Thu, Aug 13, 2020 at 5:11 PM Andy Grove wrote: > > Some of you may