The merge criteria on dynamic partitons should be per partiton
--------------------------------------------------------------

                 Key: HIVE-1806
                 URL: https://issues.apache.org/jira/browse/HIVE-1806
             Project: Hive
          Issue Type: Bug
            Reporter: Ning Zhang
            Assignee: Ning Zhang


Currently the criteria of whether a merge job should be fired on dynamic 
generated partitions are is the average file size of files across all dynamic 
partitions. It is very common that some dynamic partitions contains mostly 
large files and some contains mostly small files. Even though the average size 
of the total files are larger than the hive.merge.smallfiles.avgsize, we should 
merge those partitions containing small files only. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to