[jira] [Work logged] (HIVE-22979) Support total file size in statistics annotation

ASF GitHub Bot (Jira) Fri, 06 Mar 2020 09:45:12 -0800


     [ 
https://issues.apache.org/jira/browse/HIVE-22979?focusedWorklogId=399257&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-399257
 ]


ASF GitHub Bot logged work on HIVE-22979:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 06/Mar/20 17:44
            Start Date: 06/Mar/20 17:44
    Worklog Time Spent: 10m 
      Work Description: prasanthj commented on pull request #941: HIVE-22979: 
Support total file size in statistics annotation
URL: https://github.com/apache/hive/pull/941#discussion_r389048647
 
 

 ##########
 File path: ql/src/java/org/apache/hadoop/hive/ql/plan/Statistics.java
 ##########
 @@ -53,19 +53,21 @@ public State merge(State otherState) {
   private long numRows;
   private long runTimeNumRows;
   private long dataSize;
+  private long totalFileSize;
 
 Review comment:
   Done
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 399257)
    Time Spent: 40m  (was: 0.5h)

> Support total file size in statistics annotation
> ------------------------------------------------
>
>                 Key: HIVE-22979
>                 URL: https://issues.apache.org/jira/browse/HIVE-22979
>             Project: Hive
>          Issue Type: Improvement
>    Affects Versions: 4.0.0
>            Reporter: Prasanth Jayachandran
>            Assignee: Prasanth Jayachandran
>            Priority: Minor
>              Labels: pull-request-available
>         Attachments: HIVE-22979.1.patch
>
>          Time Spent: 40m
>  Remaining Estimate: 0h
>
> Hive statistics annotation provide estimated Statistics for each operator. 
> The data size provided in TableScanOperator is raw data size (after 
> decompression and decoding), but there are some optimizations that can be 
> performed based on total file size on disk (scan cost estimation).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Work logged] (HIVE-22979) Support total file size in statistics annotation

Reply via email to