[ https://issues.apache.org/jira/browse/HIVE-22979?focusedWorklogId=399257&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-399257 ]
ASF GitHub Bot logged work on HIVE-22979: ----------------------------------------- Author: ASF GitHub Bot Created on: 06/Mar/20 17:44 Start Date: 06/Mar/20 17:44 Worklog Time Spent: 10m Work Description: prasanthj commented on pull request #941: HIVE-22979: Support total file size in statistics annotation URL: https://github.com/apache/hive/pull/941#discussion_r389048647 ########## File path: ql/src/java/org/apache/hadoop/hive/ql/plan/Statistics.java ########## @@ -53,19 +53,21 @@ public State merge(State otherState) { private long numRows; private long runTimeNumRows; private long dataSize; + private long totalFileSize; Review comment: Done ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 399257) Time Spent: 40m (was: 0.5h) > Support total file size in statistics annotation > ------------------------------------------------ > > Key: HIVE-22979 > URL: https://issues.apache.org/jira/browse/HIVE-22979 > Project: Hive > Issue Type: Improvement > Affects Versions: 4.0.0 > Reporter: Prasanth Jayachandran > Assignee: Prasanth Jayachandran > Priority: Minor > Labels: pull-request-available > Attachments: HIVE-22979.1.patch > > Time Spent: 40m > Remaining Estimate: 0h > > Hive statistics annotation provide estimated Statistics for each operator. > The data size provided in TableScanOperator is raw data size (after > decompression and decoding), but there are some optimizations that can be > performed based on total file size on disk (scan cost estimation). -- This message was sent by Atlassian Jira (v8.3.4#803005)