[ https://issues.apache.org/jira/browse/HIVE-28489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Shohei Okumiya updated HIVE-28489: ---------------------------------- Fix Version/s: 4.1.0 > Partitioning the input data of Grouping Set GroupBy operator > ------------------------------------------------------------ > > Key: HIVE-28489 > URL: https://issues.apache.org/jira/browse/HIVE-28489 > Project: Hive > Issue Type: New Feature > Components: Physical Optimizer > Reporter: Seonggon Namgung > Assignee: Seonggon Namgung > Priority: Major > Labels: hive-4.1.0-must, pull-request-available > Fix For: 4.1.0 > > Attachments: 2.PartitionDataBeforeGroupingSet.pdf > > > GroupBy operator with grouping sets often emits too many rows, which becomes > the bottleneck of query execution. To reduce the number output rows, this > JIRA proposes partitioning the input data of such GroupBy operator. > Please check out the attached slides for detailed explanation. -- This message was sent by Atlassian Jira (v8.20.10#820010)