Hi Wes,

Yes, I have a SQL parser (actually this is a separate crate) and DataFusion
has the query planner and execution engine. Here is a blog post from last
summer with some performance comparisons with Apache Spark:

https://andygrove.io/2018/05/datafusion-aggregate-performance/

I have recently been updating the code to work with my fork of Arrow and
currently it only works with CSV and not Parquet, but adding Parquet
support again will be simple once the Arrow reader is added (others are
working on this already).

I guess I should write this up in more detail and we can open it up to a
vote here to see if there is an appetite to donate and support this code
here?

Thanks,

Andy.

Reply via email to