Hi Wes, Yes, I have a SQL parser (actually this is a separate crate) and DataFusion has the query planner and execution engine. Here is a blog post from last summer with some performance comparisons with Apache Spark:
https://andygrove.io/2018/05/datafusion-aggregate-performance/ I have recently been updating the code to work with my fork of Arrow and currently it only works with CSV and not Parquet, but adding Parquet support again will be simple once the Arrow reader is added (others are working on this already). I guess I should write this up in more detail and we can open it up to a vote here to see if there is an appetite to donate and support this code here? Thanks, Andy.