[ https://issues.apache.org/jira/browse/IGNITE-25206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17953695#comment-17953695 ]
Vladislav Pyatkov commented on IGNITE-25206: -------------------------------------------- Merged 0b1b6280e3d82251c1a83afd9cb79542ca96aeaf > Lease granted message not handled due to timeout > ------------------------------------------------ > > Key: IGNITE-25206 > URL: https://issues.apache.org/jira/browse/IGNITE-25206 > Project: Ignite > Issue Type: Bug > Reporter: Roman Puchkovskiy > Assignee: Denis Chudov > Priority: Major > Labels: ignite-3 > Attachments: _Integration_Tests_Module_Transactions_17300.log.zip > > Time Spent: 20m > Remaining Estimate: 0h > > LeaseGrantedMessage handling sometimes fails with TimeoutExceptions: > > {code:java} > [14:57:15]W: > [2025-04-21T10:57:15,004][ERROR][CompletableFutureDelayScheduler][FailureManager] > Critical system error detected. Will be handled accordingly to configured > handler [hnd=NoOpFailureHandler [super=AbstractFailureHandler > [ignoredFailureTypes=UnmodifiableSet [SYSTEM_WORKER_BLOCKED, > SYSTEM_CRITICAL_OPERATION_TIMEOUT]]], failureCtx=CRITICAL_ERROR] > [14:57:15]W: org.apache.ignite.internal.failure.StackTraceCapturingException: > Failed to process the lease granted message [msg=LeaseGrantedMessageImpl > [force=false, groupId=19_part_1, leaseExpirationTime=HybridTimestamp > [physical=2025-04-21 10:57:15:003 +0000, logical=0, > composite=114375592181956608], leaseStartTime=HybridTimestamp > [physical=2025-04-21 10:55:15:003 +0000, logical=2, > composite=114375584317636610]]]. > [14:57:15]W: at > org.apache.ignite.internal.failure.FailureManager.process(FailureManager.java:161) > ~[ignite-failure-handler-3.1.0-SNAPSHOT.jar:?] > [14:57:15]W: at > org.apache.ignite.internal.failure.FailureManager.process(FailureManager.java:138) > ~[ignite-failure-handler-3.1.0-SNAPSHOT.jar:?] > [14:57:15]W: at > org.apache.ignite.internal.replicator.PlacementDriverMessageProcessor.lambda$processPlacementDriverMessage$0(PlacementDriverMessageProcessor.java:145) > ~[ignite-replicator-3.1.0-SNAPSHOT.jar:?] > [14:57:15]W: at > java.base/java.util.concurrent.CompletableFuture.uniHandle(CompletableFuture.java:934) > [?:?] > [14:57:15]W: at > java.base/java.util.concurrent.CompletableFuture$UniHandle.tryFire(CompletableFuture.java:911) > [?:?] > [14:57:15]W: at > java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:510) > [?:?] > [14:57:15]W: at > java.base/java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:2162) > [?:?] > [14:57:15]W: at > java.base/java.util.concurrent.CompletableFuture$Timeout.run(CompletableFuture.java:2874) > [?:?] > [14:57:15]W: at > java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:539) > [?:?] > [14:57:15]W: at > java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264) [?:?] > [14:57:15]W: at > java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:304) > [?:?] > [14:57:15]W: at > java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136) > [?:?] > [14:57:15]W: at > java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635) > [?:?] > [14:57:15]W: at java.base/java.lang.Thread.run(Thread.java:833) [?:?] > [14:57:15]W: Caused by: java.util.concurrent.CompletionException: > java.util.concurrent.TimeoutException > [14:57:15]W: at > java.base/java.util.concurrent.CompletableFuture.encodeThrowable(CompletableFuture.java:332) > ~[?:?] > [14:57:15]W: at > java.base/java.util.concurrent.CompletableFuture.completeThrowable(CompletableFuture.java:347) > ~[?:?] > [14:57:15]W: at > java.base/java.util.concurrent.CompletableFuture$UniCompose.tryFire(CompletableFuture.java:1141) > ~[?:?] > [14:57:15]W: ... 9 more > [14:57:15]W: Caused by: java.util.concurrent.TimeoutException > [14:57:15]W: ... 7 more{code} > > It doesn't seem to ever fail any test to fail, so currently TimeoutException > is just suppressed if called in > PlacementDriverMessageProcessor#processPlacementDriverMessage() (see the > TODO), but it seems that an investigation is needed to either prove that this > is safe (in such a case, the TODO is to be replaced with a comment explaining > why it's safe), or to fix this if this is a bug. -- This message was sent by Atlassian Jira (v8.20.10#820010)