[ https://issues.apache.org/jira/browse/HIVE-9938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Damien Carol updated HIVE-9938: ------------------------------- Description: Sometimes parallel updates using DBTxnMgr results in the following error trace {noformat} 5325 [main] INFO org.apache.hadoop.hive.ql.log.PerfLogger - <PERFLOG method=acquireReadWriteLocks from=org.apache.hadoop.hive.ql.Driver> 5351 [main] ERROR org.apache.hadoop.hive.ql.Driver - FAILED: Error in acquiring locks: Error communicating with the metastore org.apache.hadoop.hive.ql.lockmgr.LockException: Error communicating with the metastore at org.apache.hadoop.hive.ql.lockmgr.DbLockManager.lock(DbLockManager.java:100) at org.apache.hadoop.hive.ql.lockmgr.DbTxnManager.acquireLocks(DbTxnManager.java:194) {noformat} Internally looking at the postgres logs we see {noformat} 2015-02-02 06:36:05,632 ERROR org.apache.hadoop.hive.metastore.RetryingHMSHandler: org.apache.thrift.TException: MetaException(message:Unable to update transaction database org.postgresql.util.PSQLException: ERROR: could not serialize access due to concurrent update {noformat} Ideally we should add a retry logic to retry the failed transaction. was: Sometimes parallel updates using DBTxnMgr results in the following error trace 5325 [main] INFO org.apache.hadoop.hive.ql.log.PerfLogger - <PERFLOG method=acquireReadWriteLocks from=org.apache.hadoop.hive.ql.Driver> 5351 [main] ERROR org.apache.hadoop.hive.ql.Driver - FAILED: Error in acquiring locks: Error communicating with the metastore org.apache.hadoop.hive.ql.lockmgr.LockException: Error communicating with the metastore at org.apache.hadoop.hive.ql.lockmgr.DbLockManager.lock(DbLockManager.java:100) at org.apache.hadoop.hive.ql.lockmgr.DbTxnManager.acquireLocks(DbTxnManager.java:194) Internally looking at the postgres logs we see 2015-02-02 06:36:05,632 ERROR org.apache.hadoop.hive.metastore.RetryingHMSHandler: org.apache.thrift.TException: MetaException(message:Unable to update transaction database org.postgresql.util.PSQLException: ERROR: could not serialize access due to concurrent update Ideally we should add a retry logic to retry the failed transaction. > Add retry logic to DbTxnMgr instead of aborting transactions. > ------------------------------------------------------------- > > Key: HIVE-9938 > URL: https://issues.apache.org/jira/browse/HIVE-9938 > Project: Hive > Issue Type: Improvement > Affects Versions: 0.14.0 > Reporter: bharath v > > Sometimes parallel updates using DBTxnMgr results in the following error trace > {noformat} > 5325 [main] INFO org.apache.hadoop.hive.ql.log.PerfLogger - <PERFLOG > method=acquireReadWriteLocks from=org.apache.hadoop.hive.ql.Driver> > 5351 [main] ERROR org.apache.hadoop.hive.ql.Driver - FAILED: Error in > acquiring locks: Error communicating with the metastore > org.apache.hadoop.hive.ql.lockmgr.LockException: Error communicating with the > metastore > at > org.apache.hadoop.hive.ql.lockmgr.DbLockManager.lock(DbLockManager.java:100) > at > org.apache.hadoop.hive.ql.lockmgr.DbTxnManager.acquireLocks(DbTxnManager.java:194) > > {noformat} > Internally looking at the postgres logs we see > {noformat} > 2015-02-02 06:36:05,632 ERROR > org.apache.hadoop.hive.metastore.RetryingHMSHandler: > org.apache.thrift.TException: MetaException(message:Unable to update > transaction database org.postgresql.util.PSQLException: ERROR: could not > serialize access due to concurrent update > {noformat} > Ideally we should add a retry logic to retry the failed transaction. -- This message was sent by Atlassian JIRA (v6.3.4#6332)