Jing Zhao created HDFS-6229:
-------------------------------

             Summary: Race condition in failover can cause RetryCache fail to 
work
                 Key: HDFS-6229
                 URL: https://issues.apache.org/jira/browse/HDFS-6229
             Project: Hadoop HDFS
          Issue Type: Bug
          Components: ha
    Affects Versions: 2.1.0-beta
            Reporter: Jing Zhao
            Assignee: Jing Zhao


Currently when NN failover happens, the old SBN first sets its state to active, 
then starts the active services (including tailing all the remaining editlog 
and building a complete retry cache based on the editlog). If a retry request, 
which has already succeeded in the old ANN (but the client fails to receive the 
response), comes in between, this retry may still get served by the new ANN but 
miss the retry cache.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to