Ferenc Erdelyi created YARN-11715:
-------------------------------------
Summary: NodeManager should recover by itself once the
container-executor can run program again
Key: YARN-11715
URL: https://issues.apache.org/jira/browse/YARN-11715
Project: Hadoop YARN
Issue Type: Improvement
Reporter: Ferenc Erdelyi
This is a continuation of the effort the YARN-11709 - 'NodeManager should be
shut down or blacklisted when it cannot run program
"/var/lib/yarn-ce/bin/container-executor"'.
[~zeekling] kindly [reviewed my PR and
suggested|https://github.com/apache/hadoop/pull/6960#discussion_r1707259916]
that "It would be nice if it could automatically recover when
container-executor is ok."
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]