Add mechanism to restore the removed storage directories
--------------------------------------------------------
Key: HDFS-3075
URL: https://issues.apache.org/jira/browse/HDFS-3075
Project: Hadoop HDFS
Issue Type: Improvement
Components: name-node
Affects Versions: 0.24.0, 1.1.0
Reporter: Brandon Li
Assignee: Brandon Li
When a storage directory is inaccessible, namenode removes it from the valid
storage dir list to a removedStorageDirs list. Those storage directories will
not be restored when they become healthy again.
The proposed solution is to restore the previous failed directories at the
beginning of checkpointing, say, rollEdits, by copying necessary metadata files
from healthy directory to unhealthy ones. In this way, whenever a failed
storage directory is recovered by the administrator, he/she can immediately
force a checkpointing to restored a failed directory.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira