Liu created FLINK-25566: --------------------------- Summary: Fail to cancel task if disk is bad for java.lang.NoClassDefFoundError Key: FLINK-25566 URL: https://issues.apache.org/jira/browse/FLINK-25566 Project: Flink Issue Type: Improvement Components: Runtime / Task Reporter: Liu Attachments: image-2022-01-07-19-07-10-968.png, image-2022-01-07-19-08-49-038.png, image-2022-01-07-19-11-39-448.png
When we detecting disk error, we will restart the job to rescale. However, the related task will stuck in cancelling for java.lang.NoClassDefFoundError. !image-2022-01-07-19-08-49-038.png|width=743,height=157! In the TaskManagerRunner's method onFatalError, it will not terminateJVM at once. The process will stuck in the disk. !image-2022-01-07-19-11-39-448.png|width=1085,height=400! In this case, maybe we should terminate the container at once. -- This message was sent by Atlassian Jira (v8.20.1#820001)