[ https://issues.apache.org/jira/browse/HIVE-14803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15675748#comment-15675748 ]
Rajesh Balamohan commented on HIVE-14803: ----------------------------------------- Thanks [~pxiong]. {noformat}org.apache.hadoop.hive.cli.TestHBaseCliDriver.testCliDriver[hbase_bulk]{noformat} is not related to this patch. HIVE-14937 tracks that as it failed for several runs. > S3: Stats gathering for insert queries can be expensive for partitioned > dataset > ------------------------------------------------------------------------------- > > Key: HIVE-14803 > URL: https://issues.apache.org/jira/browse/HIVE-14803 > Project: Hive > Issue Type: Improvement > Components: Metastore > Affects Versions: 2.1.0 > Reporter: Rajesh Balamohan > Assignee: Rajesh Balamohan > Priority: Minor > Attachments: HIVE-14803.1.patch, HIVE-14803.2.patch, > HIVE-14803.3.patch, HIVE-14803.4.patch, HIVE-14803.5.patch, > HIVE-14803.6.patch, HIVE-14803.7.patch > > > StatsTask's aggregateStats populates stats details for all partitions by > checking the file sizes which turns out to be expensive when larger number of > partitions are inserted. -- This message was sent by Atlassian JIRA (v6.3.4#6332)