[ https://issues.apache.org/jira/browse/FLINK-32972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17759513#comment-17759513 ]
Matthias Pohl edited comment on FLINK-32972 at 11/20/23 7:24 AM: ----------------------------------------------------------------- 1.17: [https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=52682&view=logs&j=4d4a0d10-fca2-5507-8eed-c07f0bdf4887&t=7b25afdf-cc6c-566f-5459-359dc2585798&l=8716] was (Author: sergey nuyanzin): https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=52682&view=logs&j=4d4a0d10-fca2-5507-8eed-c07f0bdf4887&t=7b25afdf-cc6c-566f-5459-359dc2585798&l=8716 > TaskTest.testInterruptibleSharedLockInInvokeAndCancel causes a JVM shutdown > with exit code 239 > ---------------------------------------------------------------------------------------------- > > Key: FLINK-32972 > URL: https://issues.apache.org/jira/browse/FLINK-32972 > Project: Flink > Issue Type: Bug > Components: Runtime / Coordination > Affects Versions: 1.17.2 > Reporter: Sergey Nuyanzin > Priority: Major > Labels: test-stability > > Within this build > [https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=52668&view=logs&j=b0a398c0-685b-599c-eb57-c8c2a771138e&t=747432ad-a576-5911-1e2a-68c6bedc248a&l=8677] > it looks like task > {{1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0}} was > canceled > {noformat} > ================================================================================ > Test > testInterruptibleSharedLockInInvokeAndCancel(org.apache.flink.runtime.taskmanager.TaskTest) > is running. > -------------------------------------------------------------------------------- > 01:30:05,140 [ main] INFO > org.apache.flink.runtime.io.network.NettyShuffleServiceFactory [] - Created a > new FileChannelManager for storing result partitions of BLOCKING shuffles. > Used directories: > /tmp/flink-netty-shuffle-82415974-782a-46db-afbc-8f18f30a4ec5 > 01:30:05,177 [ main] INFO > org.apache.flink.runtime.io.network.buffer.NetworkBufferPool [] - Allocated > 32 MB for network buffer pool (number of memory segments: 1024, bytes per > segment: 32768). > 01:30:05,181 [ Test Task (1/1)#0] INFO > org.apache.flink.runtime.taskmanager.Task [] - Test Task > (1/1)#0 > (1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0) > switched from CREATED to DEPLOYING. > 01:30:05,190 [ Test Task (1/1)#0] INFO > org.apache.flink.runtime.taskmanager.Task [] - Loading JAR > files for task Test Task (1/1)#0 > (1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0) > [DEPLOYING]. > 01:30:05,192 [ Test Task (1/1)#0] INFO > org.apache.flink.runtime.taskmanager.Task [] - Test Task > (1/1)#0 > (1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0) > switched from DEPLOYING to INITIALIZING. > 01:30:05,192 [ Test Task (1/1)#0] INFO > org.apache.flink.runtime.taskmanager.Task [] - Test Task > (1/1)#0 > (1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0) > switched from INITIALIZING to RUNNING. > 01:30:05,195 [ main] INFO > org.apache.flink.runtime.taskmanager.Task [] - Attempting > to cancel task Test Task (1/1)#0 > (1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0). > 01:30:05,196 [ main] INFO > org.apache.flink.runtime.taskmanager.Task [] - Test Task > (1/1)#0 > (1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0) > switched from RUNNING to CANCELING. > 01:30:05,196 [ main] INFO > org.apache.flink.runtime.taskmanager.Task [] - Triggering > cancellation of task code Test Task (1/1)#0 > (1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0). > 01:30:05,197 [ Test Task (1/1)#0] INFO > org.apache.flink.runtime.taskmanager.Task [] - Test Task > (1/1)#0 > (1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0) > switched from CANCELING to CANCELED. > 01:30:05,198 [ Test Task (1/1)#0] INFO > org.apache.flink.runtime.taskmanager.Task [] - Freeing > task resources for Test Task (1/1)#0 > (1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0). > {noformat} > and after that there are records in logs complaining htat task did not react > {noformat} > 01:30:05,337 [Canceler/Interrupts for Test Task (1/1)#0 > (1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0).] > WARN org.apache.flink.runtime.taskmanager.Task [] - Task > 'Test Task (1/1)#0' did not react to cancelling signal - interrupting; it is > stuck for 0 seconds in method: > > app//org.apache.flink.runtime.metrics.groups.AbstractMetricGroup.close(AbstractMetricGroup.java:322) > app//org.apache.flink.runtime.metrics.groups.AbstractMetricGroup.close(AbstractMetricGroup.java:327) > app//org.apache.flink.runtime.metrics.groups.AbstractMetricGroup.close(AbstractMetricGroup.java:327) > app//org.apache.flink.runtime.metrics.groups.AbstractMetricGroup.close(AbstractMetricGroup.java:327) > app//org.apache.flink.runtime.metrics.groups.AbstractMetricGroup.close(AbstractMetricGroup.java:327) > app//org.apache.flink.runtime.metrics.groups.ComponentMetricGroup.close(ComponentMetricGroup.java:62) > app//org.apache.flink.runtime.metrics.groups.TaskMetricGroup.close(TaskMetricGroup.java:179) > app//org.apache.flink.runtime.taskmanager.Task.doRun(Task.java:866) > app//org.apache.flink.runtime.taskmanager.Task.run(Task.java:562) > java.base@11.0.11/java.lang.Thread.run(Thread.java:829) > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)