[jira] [Updated] (HIVE-9938) Add retry logic to DbTxnMgr instead of aborting transactions.

Damien Carol (JIRA) Tue, 16 Jun 2015 07:32:00 -0700

     [ 
https://issues.apache.org/jira/browse/HIVE-9938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Damien Carol updated HIVE-9938:
-------------------------------
    Description: 
Sometimes parallel updates using DBTxnMgr results in the following error trace
{noformat}
5325 [main] INFO org.apache.hadoop.hive.ql.log.PerfLogger - <PERFLOG 
method=acquireReadWriteLocks from=org.apache.hadoop.hive.ql.Driver> 
5351 [main] ERROR org.apache.hadoop.hive.ql.Driver - FAILED: Error in acquiring 
locks: Error communicating with the metastore 
org.apache.hadoop.hive.ql.lockmgr.LockException: Error communicating with the 
metastore 
at org.apache.hadoop.hive.ql.lockmgr.DbLockManager.lock(DbLockManager.java:100) 
at 
org.apache.hadoop.hive.ql.lockmgr.DbTxnManager.acquireLocks(DbTxnManager.java:194)
 
{noformat}

Internally looking at the postgres logs we see 
{noformat}
2015-02-02 06:36:05,632 ERROR 
org.apache.hadoop.hive.metastore.RetryingHMSHandler: 
org.apache.thrift.TException: MetaException(message:Unable to update 
transaction database org.postgresql.util.PSQLException: ERROR: could not 
serialize access due to concurrent update 

{noformat}
Ideally we should add a retry logic to retry the failed transaction.

  was:
Sometimes parallel updates using DBTxnMgr results in the following error trace

5325 [main] INFO org.apache.hadoop.hive.ql.log.PerfLogger - <PERFLOG 
method=acquireReadWriteLocks from=org.apache.hadoop.hive.ql.Driver> 
5351 [main] ERROR org.apache.hadoop.hive.ql.Driver - FAILED: Error in acquiring 
locks: Error communicating with the metastore 
org.apache.hadoop.hive.ql.lockmgr.LockException: Error communicating with the 
metastore 
at org.apache.hadoop.hive.ql.lockmgr.DbLockManager.lock(DbLockManager.java:100) 
at 
org.apache.hadoop.hive.ql.lockmgr.DbTxnManager.acquireLocks(DbTxnManager.java:194)
 


Internally looking at the postgres logs we see 

2015-02-02 06:36:05,632 ERROR 
org.apache.hadoop.hive.metastore.RetryingHMSHandler: 
org.apache.thrift.TException: MetaException(message:Unable to update 
transaction database org.postgresql.util.PSQLException: ERROR: could not 
serialize access due to concurrent update 


Ideally we should add a retry logic to retry the failed transaction.


> Add retry logic to DbTxnMgr instead of aborting transactions.
> -------------------------------------------------------------
>
>                 Key: HIVE-9938
>                 URL: https://issues.apache.org/jira/browse/HIVE-9938
>             Project: Hive
>          Issue Type: Improvement
>    Affects Versions: 0.14.0
>            Reporter: bharath v
>
> Sometimes parallel updates using DBTxnMgr results in the following error trace
> {noformat}
> 5325 [main] INFO org.apache.hadoop.hive.ql.log.PerfLogger - <PERFLOG 
> method=acquireReadWriteLocks from=org.apache.hadoop.hive.ql.Driver> 
> 5351 [main] ERROR org.apache.hadoop.hive.ql.Driver - FAILED: Error in 
> acquiring locks: Error communicating with the metastore 
> org.apache.hadoop.hive.ql.lockmgr.LockException: Error communicating with the 
> metastore 
> at 
> org.apache.hadoop.hive.ql.lockmgr.DbLockManager.lock(DbLockManager.java:100) 
> at 
> org.apache.hadoop.hive.ql.lockmgr.DbTxnManager.acquireLocks(DbTxnManager.java:194)
>  
> {noformat}
> Internally looking at the postgres logs we see 
> {noformat}
> 2015-02-02 06:36:05,632 ERROR 
> org.apache.hadoop.hive.metastore.RetryingHMSHandler: 
> org.apache.thrift.TException: MetaException(message:Unable to update 
> transaction database org.postgresql.util.PSQLException: ERROR: could not 
> serialize access due to concurrent update 
> {noformat}
> Ideally we should add a retry logic to retry the failed transaction.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-9938) Add retry logic to DbTxnMgr instead of aborting transactions.

Reply via email to