[ https://issues.apache.org/jira/browse/HIVE-12481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15217999#comment-15217999 ]
Aihua Xu commented on HIVE-12481: --------------------------------- [~thejas] Yeah. When I made the change, actually I was debating if I should add that for zookeeper service discovery or not since we are already retrying for all the hosts. Do you guys think it makes sense to retry for both cases? I agree that it will be clear to support both cases ( I was thinking that we have tried all the hosts and we are sure that there is a nonrecoverable error. Of course, it's based on the assumption that we would configure multiple hosts for service discovery). Regarding the errors, seems this retry could also help on some intermittent errors other than authentication, like probably due to intermittent network issue. So that's the reason I made it more general. > Occasionally "Request is a replay" will be thrown from HS2 > ---------------------------------------------------------- > > Key: HIVE-12481 > URL: https://issues.apache.org/jira/browse/HIVE-12481 > Project: Hive > Issue Type: Improvement > Components: Authentication > Affects Versions: 2.0.0 > Reporter: Aihua Xu > Assignee: Aihua Xu > Fix For: 2.1.0 > > Attachments: HIVE-12481.2.patch, HIVE-12481.3.patch, HIVE-12481.patch > > > We have seen the following exception thrown from HS2 in secured cluster when > many queries are running simultaneously on single HS2 instance. > The cause I can guess is that it happens that two queries are submitted at > the same time and have the same timestamp. For such case, we can add a retry > for the query. > > {noformat} > 2015-11-18 16:12:33,117 ERROR org.apache.thrift.transport.TSaslTransport: > SASL negotiation failure > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: Failure unspecified at GSS-API level (Mechanism level: Request > is a replay (34))] > at > com.sun.security.sasl.gsskerb.GssKrb5Server.evaluateResponse(GssKrb5Server.java:177) > at > org.apache.thrift.transport.TSaslTransport$SaslParticipant.evaluateChallengeOrResponse(TSaslTransport.java:539) > at org.apache.thrift.transport.TSaslTransport.open(TSaslTransport.java:283) > at > org.apache.thrift.transport.TSaslServerTransport.open(TSaslServerTransport.java:41) > at > org.apache.thrift.transport.TSaslServerTransport$Factory.getTransport(TSaslServerTransport.java:216) > at > org.apache.hadoop.hive.thrift.HadoopThriftAuthBridge$Server$TUGIAssumingTransportFactory$1.run(HadoopThriftAuthBridge.java:739) > at > org.apache.hadoop.hive.thrift.HadoopThriftAuthBridge$Server$TUGIAssumingTransportFactory$1.run(HadoopThriftAuthBridge.java:736) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:356) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1651) > at > org.apache.hadoop.hive.thrift.HadoopThriftAuthBridge$Server$TUGIAssumingTransportFactory.getTransport(HadoopThriftAuthBridge.java:736) > at > org.apache.thrift.server.TThreadPoolServer$WorkerProcess.run(TThreadPoolServer.java:268) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > at java.lang.Thread.run(Thread.java:745) > Caused by: GSSException: Failure unspecified at GSS-API level (Mechanism > level: Request is a replay (34)) > at sun.security.jgss.krb5.Krb5Context.acceptSecContext(Krb5Context.java:788) > at sun.security.jgss.GSSContextImpl.acceptSecContext(GSSContextImpl.java:342) > at sun.security.jgss.GSSContextImpl.acceptSecContext(GSSContextImpl.java:285) > at > com.sun.security.sasl.gsskerb.GssKrb5Server.evaluateResponse(GssKrb5Server.java:155) > ... 14 more > Caused by: KrbException: Request is a replay (34) > at sun.security.krb5.KrbApReq.authenticate(KrbApReq.java:308) > at sun.security.krb5.KrbApReq.<init>(KrbApReq.java:144) > at > sun.security.jgss.krb5.InitSecContextToken.<init>(InitSecContextToken.java:108) > at sun.security.jgss.krb5.Krb5Context.acceptSecContext(Krb5Context.java:771) > ... 17 more > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)