[ https://issues.apache.org/jira/browse/ARROW-4748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17661770#comment-17661770 ]
Rok Mihevc commented on ARROW-4748: ----------------------------------- This issue has been migrated to [issue #21267|https://github.com/apache/arrow/issues/21267] on GitHub. Please see the [migration documentation|https://github.com/apache/arrow/issues/14542] for further details. > [Rust] [DataFusion] GROUP BY performance could be optimized > ----------------------------------------------------------- > > Key: ARROW-4748 > URL: https://issues.apache.org/jira/browse/ARROW-4748 > Project: Apache Arrow > Issue Type: Improvement > Components: Rust, Rust - DataFusion > Affects Versions: 0.12.0 > Reporter: Andy Grove > Assignee: Andy Grove > Priority: Major > Labels: pull-request-available > Fix For: 0.16.0 > > Time Spent: 0.5h > Remaining Estimate: 0h > > The logic to build the group by keys is row-based, performing an array > downcast on every single group by value. This could be done in a columnar way > instead. > > I also wonder if it is possible to avoid converting the result map to an > array of map entries. -- This message was sent by Atlassian Jira (v8.20.10#820010)