Hi Experts:
     Seems like a bug in ExecutorAllocationManager, because 
numberMaxNeededExecutors value is negative in JMX exporter, which is 
unreasonable
     Spark Version is 3.1.1
    


And the following is the debug log, the numRunningOrPendingTasks get a negative 
value:


22/01/13 09:32:21 DEBUG ExecutorAllocationManager: max needed for rpId: 0 
numpending: -136, tasksperexecutor: 4

22/01/13 09:32:22 DEBUG ExecutorAllocationManager: max needed for rpId: 0 
numpending: -171, tasksperexecutor: 4

22/01/13 09:32:22 DEBUG ExecutorAllocationManager: max needed for rpId: 0 
numpending: -188, tasksperexecutor: 4

22/01/13 09:32:22 DEBUG ExecutorAllocationManager: max needed for rpId: 0 
numpending: -196, tasksperexecutor: 4

22/01/13 09:32:22 DEBUG ExecutorAllocationManager: max needed for rpId: 0 
numpending: -209, tasksperexecutor: 4

22/01/13 09:32:22 DEBUG ExecutorAllocationManager: max needed for rpId: 0 
numpending: -221, tasksperexecutor: 4

22/01/13 09:32:22 DEBUG ExecutorAllocationManager: max needed for rpId: 0 
numpending: -229, tasksperexecutor: 4

22/01/13 09:32:22 DEBUG ExecutorAllocationManager: max needed for rpId: 0 
numpending: -234, tasksperexecutor: 4

22/01/13 09:32:22 DEBUG ExecutorAllocationManager: max needed for rpId: 0 
numpending: -238, tasksperexecutor: 4

22/01/13 09:32:22 DEBUG ExecutorAllocationManager: max needed for rpId: 0 
numpending: -240, tasksperexecutor: 4

22/01/13 09:32:22 DEBUG ExecutorAllocationManager: max needed for rpId: 0 
numpending: -246, tasksperexecutor: 4

22/01/13 09:32:23 DEBUG ExecutorAllocationManager: max needed for rpId: 0 
numpending: -250, tasksperexecutor: 4

22/01/13 09:32:23 DEBUG ExecutorAllocationManager: max needed for rpId: 0 
numpending: -250, tasksperexecutor: 4

22/01/13 09:32:23 DEBUG ExecutorAllocationManager: max needed for rpId: 0 
numpending: -250, tasksperexecutor: 4

22/01/13 09:32:23 DEBUG ExecutorAllocationManager: max needed for rpId: 0 
numpending: -251, tasksperexecutor: 4

22/01/13 09:32:23 DEBUG ExecutorAllocationManager: max needed for rpId: 0 
numpending: -211, tasksperexecutor: 4




     The return value of maxNumExecutorsNeededPerResourceProfile method turns 
to be uncorrect after some hours running in thrift server.




Best 

xu

Reply via email to