[jira] [Resolved] (HDFS-134) premature end-of-decommission of datanodes

Allen Wittenauer (JIRA) Thu, 17 Jul 2014 22:21:33 -0700

     [ 
https://issues.apache.org/jira/browse/HDFS-134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Allen Wittenauer resolved HDFS-134.
-----------------------------------

    Resolution: Fixed

This has probably been fixed. Gonna close this as stale.

> premature end-of-decommission of datanodes
> ------------------------------------------
>
>                 Key: HDFS-134
>                 URL: https://issues.apache.org/jira/browse/HDFS-134
>             Project: Hadoop HDFS
>          Issue Type: Bug
>            Reporter: dhruba borthakur
>
> Decommissioning requires that the nodes be listed in the dfs.hosts.excludes 
> file. The administrator runs the "dfsadmin -refreshNodes" command. The 
> decommissioning process starts off. Suppose that one of the datanodes that 
> was being decommisioned has to re-register with the namenode. This can occur 
> if the namenode restarts or if the datanode restarts while the 
> decommissioning was in progress. Now, the namenode refuses to talk to this 
> datanode because it is in the excludes list! This is a premature end of the 
> decommissioning process.
> One way to fix this bug is to make the namenode always accept registration 
> requests, even for datanodes that are in the exclude list. The namenode, 
> however, should set the "being decommissioned" flag for these datanodes. It 
> should then re-start the decommisioning process for these datanodes. When the 
> decommissioning is complete, the namenode will ask the datanodes to shutdown.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (HDFS-134) premature end-of-decommission of datanodes

Reply via email to