Chris Nauroth created HDFS-7121:
-----------------------------------
Summary: For JournalNode operations that must succeed on all
nodes, attempt to undo the operation on all nodes if it fails on one node.
Key: HDFS-7121
URL: https://issues.apache.org/jira/browse/HDFS-7121
Project: Hadoop HDFS
Issue Type: Improvement
Components: journal-node
Reporter: Chris Nauroth
Several JournalNode operations are not satisfied by a quorum. They must
succeed on every JournalNode in the cluster. If the operation succeeds on some
nodes, but fails on others, then this may leave the nodes in an inconsistent
state and require operations to do manual recovery steps. For example, if
{{doPreUpgrade}} succeeds on 2 nodes and fails on 1 node, then the operator
will need to correct the problem on the failed node and also manually restore
the previous.tmp directory to current on the 2 successful nodes before
reattempting the upgrade.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)