Benjamin Teke created YARN-11463:
------------------------------------
Summary: Node Labels root directory creation doesn't have a retry
logic
Key: YARN-11463
URL: https://issues.apache.org/jira/browse/YARN-11463
Project: Hadoop YARN
Issue Type: Improvement
Components: capacity scheduler
Reporter: Benjamin Teke
When CS is initialized, it'll [try to create the configured node labels root
dir|https://github.com/apache/hadoop/blob/7169ec450957e5602775c3cd6fe1bf0b95773dfb/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-common/src/main/java/org/apache/hadoop/yarn/nodelabels/store/AbstractFSNodeStore.java#L69].
This however doesn't implement any kind of retry logic (in contrast to the RM
FS state store or ZK state store), hence if the distributed file system is
unavailable at the exact moment CS tries to start it'll fail. A retry logic
could be implemented to improve the robustness of the startup process.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]