[ https://issues.apache.org/jira/browse/HIVE-28489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17894391#comment-17894391 ]
Seonggon Namgung commented on HIVE-28489: ----------------------------------------- [~zabetak] , PR#5424 is ready for review. And I changed the file format. Please let me know if there remain any issues with the slides. > Partitioning the input data of Grouping Set GroupBy operator > ------------------------------------------------------------ > > Key: HIVE-28489 > URL: https://issues.apache.org/jira/browse/HIVE-28489 > Project: Hive > Issue Type: New Feature > Reporter: Seonggon Namgung > Assignee: Seonggon Namgung > Priority: Major > Labels: pull-request-available > Attachments: 2.PartitionDataBeforeGroupingSet.pdf > > > GroupBy operator with grouping sets often emits too many rows, which becomes > the bottleneck of query execution. To reduce the number output rows, this > JIRA proposes partitioning the input data of such GroupBy operator. > Please check out the attached slides for detailed explanation. -- This message was sent by Atlassian Jira (v8.20.10#820010)