Mithun Radhakrishnan created HIVE-17612:
-------------------------------------------

             Summary: Hive does not insert dynamic partition-sets atomically
                 Key: HIVE-17612
                 URL: https://issues.apache.org/jira/browse/HIVE-17612
             Project: Hive
          Issue Type: Improvement
          Components: CLI, Hive
    Affects Versions: 2.2.0, 3.0.0
            Reporter: Mithun Radhakrishnan
            Assignee: Mithun Radhakrishnan


If one inserts partitions to a Hive table using a Hive query (e.g. {{INSERT 
OVERWRITE TABLE my_table PARTITION (foo, bar) SELECT * FROM another_table;}}), 
each dynamic partition is added separately, using {{HMSC.append_partition()}}. 
By contrast, Pig/HCatLoader does the same atomically, using 
{{HMSC.add_partitions()}}.

Because of this behaviour, Oozie workflows might kick off when the first 
partition is registered, but before the last partition in the set is available.

This was verified in the metastore-logs, with multiple {{ADD_PARTITION}} events 
fired for the same query (i.e. once per added partition), instead of a single 
event for the set.

It would be ideal for Hive to provide atomic partition-adds.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to