chenruotao opened a new pull request, #7861:
URL: https://github.com/apache/hadoop/pull/7861

   https://issues.apache.org/jira/browse/YARN-11848
   
   ### Description of PR
   when nodemanager restart, there are too many logs like
   
   "2025-03-23 01:53:05,883 WARN [RM Event dispatcher] rmnode.RMNodeImpl 
(RMNodeImpl.java:handleRunningAppOnNode(791)) - Cannot get RMApp by 
appId=application_1737511638592_8413719, just added it to finishedApplications 
list for cleanup"
   
   print in resourcemanager.
   when too many nodemanagers are restarted at one time, resourcemanager will 
be blocked.
   I think nodemanager should be able to determine whether an app has fininshed 
on local and reduce the number of apps reported to resourcemanager.
   
   ### How was this patch tested?
   restart nodemanager and summary the num of app be reported to resourcemanager
   
   ### For code changes:
   only nodemanager
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org

Reply via email to