Add timestamp column with index to the partition stats table.
-------------------------------------------------------------

                 Key: HIVE-2471
                 URL: https://issues.apache.org/jira/browse/HIVE-2471
             Project: Hive
          Issue Type: Improvement
            Reporter: Kevin Wilfong
            Assignee: Kevin Wilfong


Occasionally, when entries are added to the partition stats table the program 
is halted before it can delete those entries, by an exception, keyboard 
interrupt, etc.  These build up to the point where the table gets very large, 
and it hurts the performance of the update statement which is often called.  In 
order to fix this, I am adding a column to the table which is auto-populated 
with the current timestamp.  I am also adding an index on this column.  This 
will allow us to create scripts that go through periodically and clean out old 
entries from the table.  The index will help to keep the runtime of these 
scripts short, and hence reduce the amount of time they need to lock the 
table/indexes for.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to