chenruotao opened a new pull request, #7861: URL: https://github.com/apache/hadoop/pull/7861
https://issues.apache.org/jira/browse/YARN-11848 ### Description of PR when nodemanager restart, there are too many logs like "2025-03-23 01:53:05,883 WARN [RM Event dispatcher] rmnode.RMNodeImpl (RMNodeImpl.java:handleRunningAppOnNode(791)) - Cannot get RMApp by appId=application_1737511638592_8413719, just added it to finishedApplications list for cleanup" print in resourcemanager. when too many nodemanagers are restarted at one time, resourcemanager will be blocked. I think nodemanager should be able to determine whether an app has fininshed on local and reduce the number of apps reported to resourcemanager. ### How was this patch tested? restart nodemanager and summary the num of app be reported to resourcemanager ### For code changes: only nodemanager -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org