Re: [DISCUSS][C++] Group by operation for RecordBatch and Table

2020-08-10 Thread Kenta Murata
Hi Wes, Thank you very much giving us the detail explanation of your thoughts. I need the knowledge of the SOTA of query engine you pointed out if I’ll contribute to C++ Query Engine or just write the binding of it. I’m studying the article and the codes. Regards, Kenta Murata On Thu, Aug 6, 2

Re: [DISCUSS][C++] Group by operation for RecordBatch and Table

2020-08-05 Thread Wes McKinney
I see there's a bunch of additional aggregation code in Dremio that might serve as inspiration (some of which is related to distributed aggregation, so may not be relevant) https://github.com/dremio/dremio-oss/tree/master/sabot/kernel/src/main/java/com/dremio/sabot/op/aggregate Maybe Andy or one

Re: [DISCUSS][C++] Group by operation for RecordBatch and Table

2020-08-05 Thread Wes McKinney
hi Kenta, Yes, I think it only makes sense to implement this in the context of the query engine project. Here's a list of assorted thoughts about it: * I have been mentally planning to follow the Vectorwise-type query engine architecture that's discussed in [1] [2] and many other academic papers.