Any update of this task? We are very interested in this feature. Thanks.
Lirong Jian HashData Inc. Chendi.Xue (Jira) <j...@apache.org> 于2019年11月14日周四 下午2:02写道: > Chendi.Xue created ARROW-7165: > --------------------------------- > > Summary: [C++] Arrow Compute Group By Support > Key: ARROW-7165 > URL: https://issues.apache.org/jira/browse/ARROW-7165 > Project: Apache Arrow > Issue Type: New Feature > Components: C++ - Compute > Reporter: Chendi.Xue > > > Not sure if there is any plan to support groupby in arrow? > > Here is some to do in my mind: > # To make current arrow/compute/kernels/hash supporting received a > memo_table as input, so multiple array will be able to get dictencode and > valuecount based on same hashmap with a unified index. > # To add a split array function instead of using take multiple time to > split one array to several ones. > # so the output array can use current funcs under compute/kernels, such > as sum/count/sort to support group by. > > But this is some of my basic idea, wanna know if there is a on going plan > on this? > > > > -- > This message was sent by Atlassian Jira > (v8.3.4#803005) >