[ 
https://issues.apache.org/jira/browse/SOLR-6969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14274046#comment-14274046
 ] 

Mark Miller commented on SOLR-6969:
-----------------------------------

{noformat}
ERROR - 2015-01-12 17:49:43.992; org.apache.solr.common.SolrException; Failure 
to open existing log file (non fatal) 
hdfs://localhost:8020/solr_test/collection1/core_node1/data/tlog/tlog.0000000000000000000:org.apache.solr.common.SolrException:
 
org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.hdfs.protocol.RecoveryInProgressException):
 Failed to close file 
/solr_test/collection1/core_node1/data/tlog/tlog.0000000000000000000. Lease 
recovery is in progress. Try again later.
        at 
org.apache.hadoop.hdfs.server.namenode.FSNamesystem.recoverLeaseInternal(FSNamesystem.java:2626)
        at 
org.apache.hadoop.hdfs.server.namenode.FSNamesystem.appendFileInternal(FSNamesystem.java:2462)
        at 
org.apache.hadoop.hdfs.server.namenode.FSNamesystem.appendFileInt(FSNamesystem.java:2700)
        at 
org.apache.hadoop.hdfs.server.namenode.FSNamesystem.appendFile(FSNamesystem.java:2663)
        at 
org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.append(NameNodeRpcServer.java:559)
        at 
org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolServerSideTranslatorPB.append(ClientNamenodeProtocolServerSideTranslatorPB.java:388)
        at 
org.apache.hadoop.hdfs.protocol.proto.ClientNamenodeProtocolProtos$ClientNamenodeProtocol$2.callBlockingMethod(ClientNamenodeProtocolProtos.java)
        at 
org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:585)
        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:928)
        at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2013)
        at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2009)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:422)
        at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1614)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2007)

        at 
org.apache.solr.update.HdfsTransactionLog.<init>(HdfsTransactionLog.java:121)
        at org.apache.solr.update.HdfsUpdateLog.init(HdfsUpdateLog.java:190)
        at org.apache.solr.update.UpdateHandler.<init>(UpdateHandler.java:134)
        at org.apache.solr.update.UpdateHandler.<init>(UpdateHandler.java:94)
{noformat}

> Just like we have to retry when the NameNode is in safemode on Solr startup, 
> we also need to retry when opening a transaction log file for append when we 
> get a RecoveryInProgressException.
> --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: SOLR-6969
>                 URL: https://issues.apache.org/jira/browse/SOLR-6969
>             Project: Solr
>          Issue Type: Bug
>          Components: hdfs
>            Reporter: Mark Miller
>            Assignee: Mark Miller
>
> This can happen after a hard crash and restart. The current workaround is to 
> stop and wait it out and start again. We should retry and wait a given amount 
> of time as we do when we detect safe mode though.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to