[ https://issues.apache.org/jira/browse/IGNITE-25374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Roman Puchkovskiy updated IGNITE-25374: --------------------------------------- Description: ESTOP is returned if a Raft node is fully stopped. We should retry our request when getting this error code. [2025-05-12T09:23:55,662][ERROR][%irlt_trisons_20001%MessagingService-inbound-Default-0-0][FailureManager] Critical system error detected. Will be handled accordingly to configured handler [hnd=NoOpFailureHandler [super=AbstractFailureHandler [ignoredFailureTypes=UnmodifiableSet [SYSTEM_WORKER_BLOCKED, SYSTEM_CRITICAL_OPERATION_TIMEOUT]]], failureCtx=CRITICAL_ERROR] org.apache.ignite.internal.failure.StackTraceCapturingException: Unknown error at org.apache.ignite.internal.failure.FailureManager.process(FailureManager.java:161) ~[ignite-failure-handler-3.1.0-SNAPSHOT.jar:?] at org.apache.ignite.internal.failure.FailureManager.process(FailureManager.java:138) ~[ignite-failure-handler-3.1.0-SNAPSHOT.jar:?] at org.apache.ignite.internal.metastorage.server.WatchProcessor.notifyFailureHandlerOnFirstFailureInNotificationChain(WatchProcessor.java:405) ~[ignite-metastorage-3.1.0-SNAPSHOT.jar:?] at org.apache.ignite.internal.metastorage.server.WatchProcessor.lambda$enqueue$3(WatchProcessor.java:233) ~[ignite-metastorage-3.1.0-SNAPSHOT.jar:?] at java.base/java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:863) ~[?:?] at java.base/java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:841) ~[?:?] at java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:510) ~[?:?] at java.base/java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:2162) ~[?:?] at org.apache.ignite.internal.raft.RaftGroupServiceImpl.handleErrorResponse(RaftGroupServiceImpl.java:770) ~[ignite-raft-3.1.0-SNAPSHOT.jar:?] at org.apache.ignite.internal.raft.RaftGroupServiceImpl.lambda$sendWithRetry$49(RaftGroupServiceImpl.java:639) ~[ignite-raft-3.1.0-SNAPSHOT.jar:?] at java.base/java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:863) ~[?:?] at java.base/java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:841) ~[?:?] at java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:510) ~[?:?] at java.base/java.util.concurrent.CompletableFuture.complete(CompletableFuture.java:2147) ~[?:?] at org.apache.ignite.internal.network.DefaultMessagingService.onInvokeResponse(DefaultMessagingService.java:587) ~[ignite-network-3.1.0-SNAPSHOT.jar:?] at org.apache.ignite.internal.network.DefaultMessagingService.handleInvokeResponse(DefaultMessagingService.java:480) ~[ignite-network-3.1.0-SNAPSHOT.jar:?] at org.apache.ignite.internal.network.DefaultMessagingService.lambda$handleMessageFromNetwork$4(DefaultMessagingService.java:414) ~[ignite-network-3.1.0-SNAPSHOT.jar:?] at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136) [?:?] at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635) [?:?] at java.base/java.lang.Thread.run(Thread.java:833) [?:?] Caused by: java.util.concurrent.CompletionException: org.apache.ignite.raft.jraft.rpc.impl.RaftException: IGN-CMN-65535 TraceId:287ed18d-0959-4b28-91c6-90418b9af00a ESTOP:Node is quit. at java.base/java.util.concurrent.CompletableFuture.encodeThrowable(CompletableFuture.java:332) ~[?:?] at java.base/java.util.concurrent.CompletableFuture.completeThrowable(CompletableFuture.java:347) ~[?:?] at java.base/java.util.concurrent.CompletableFuture$UniApply.tryFire(CompletableFuture.java:636) ~[?:?] ... 14 more Caused by: org.apache.ignite.raft.jraft.rpc.impl.RaftException: ESTOP:Node is quit. ... 12 more > Retry on ESTOP in RaftGroupServiceImpl > -------------------------------------- > > Key: IGNITE-25374 > URL: https://issues.apache.org/jira/browse/IGNITE-25374 > Project: Ignite > Issue Type: Improvement > Reporter: Roman Puchkovskiy > Assignee: Roman Puchkovskiy > Priority: Major > Labels: ignite-3 > Attachments: _Integration_Tests_Run_All_Other_36062.log.zip > > Time Spent: 10m > Remaining Estimate: 0h > > ESTOP is returned if a Raft node is fully stopped. We should retry our > request when getting this error code. > [2025-05-12T09:23:55,662][ERROR][%irlt_trisons_20001%MessagingService-inbound-Default-0-0][FailureManager] > Critical system error detected. Will be handled accordingly to configured > handler [hnd=NoOpFailureHandler [super=AbstractFailureHandler > [ignoredFailureTypes=UnmodifiableSet [SYSTEM_WORKER_BLOCKED, > SYSTEM_CRITICAL_OPERATION_TIMEOUT]]], failureCtx=CRITICAL_ERROR] > org.apache.ignite.internal.failure.StackTraceCapturingException: Unknown error > at > org.apache.ignite.internal.failure.FailureManager.process(FailureManager.java:161) > ~[ignite-failure-handler-3.1.0-SNAPSHOT.jar:?] > at > org.apache.ignite.internal.failure.FailureManager.process(FailureManager.java:138) > ~[ignite-failure-handler-3.1.0-SNAPSHOT.jar:?] > at > org.apache.ignite.internal.metastorage.server.WatchProcessor.notifyFailureHandlerOnFirstFailureInNotificationChain(WatchProcessor.java:405) > ~[ignite-metastorage-3.1.0-SNAPSHOT.jar:?] > at > org.apache.ignite.internal.metastorage.server.WatchProcessor.lambda$enqueue$3(WatchProcessor.java:233) > ~[ignite-metastorage-3.1.0-SNAPSHOT.jar:?] > at > java.base/java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:863) > ~[?:?] > at > java.base/java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:841) > ~[?:?] > at > java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:510) > ~[?:?] > at > java.base/java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:2162) > ~[?:?] > at > org.apache.ignite.internal.raft.RaftGroupServiceImpl.handleErrorResponse(RaftGroupServiceImpl.java:770) > ~[ignite-raft-3.1.0-SNAPSHOT.jar:?] > at > org.apache.ignite.internal.raft.RaftGroupServiceImpl.lambda$sendWithRetry$49(RaftGroupServiceImpl.java:639) > ~[ignite-raft-3.1.0-SNAPSHOT.jar:?] > at > java.base/java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:863) > ~[?:?] > at > java.base/java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:841) > ~[?:?] > at > java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:510) > ~[?:?] > at > java.base/java.util.concurrent.CompletableFuture.complete(CompletableFuture.java:2147) > ~[?:?] > at > org.apache.ignite.internal.network.DefaultMessagingService.onInvokeResponse(DefaultMessagingService.java:587) > ~[ignite-network-3.1.0-SNAPSHOT.jar:?] > at > org.apache.ignite.internal.network.DefaultMessagingService.handleInvokeResponse(DefaultMessagingService.java:480) > ~[ignite-network-3.1.0-SNAPSHOT.jar:?] > at > org.apache.ignite.internal.network.DefaultMessagingService.lambda$handleMessageFromNetwork$4(DefaultMessagingService.java:414) > ~[ignite-network-3.1.0-SNAPSHOT.jar:?] > at > java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136) > [?:?] > at > java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635) > [?:?] > at java.base/java.lang.Thread.run(Thread.java:833) [?:?] > Caused by: java.util.concurrent.CompletionException: > org.apache.ignite.raft.jraft.rpc.impl.RaftException: IGN-CMN-65535 > TraceId:287ed18d-0959-4b28-91c6-90418b9af00a ESTOP:Node is quit. > at > java.base/java.util.concurrent.CompletableFuture.encodeThrowable(CompletableFuture.java:332) > ~[?:?] > at > java.base/java.util.concurrent.CompletableFuture.completeThrowable(CompletableFuture.java:347) > ~[?:?] > at > java.base/java.util.concurrent.CompletableFuture$UniApply.tryFire(CompletableFuture.java:636) > ~[?:?] > ... 14 more > Caused by: org.apache.ignite.raft.jraft.rpc.impl.RaftException: ESTOP:Node is > quit. > ... 12 more -- This message was sent by Atlassian Jira (v8.20.10#820010)