[ https://issues.apache.org/jira/browse/IGNITE-18610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17728334#comment-17728334 ]
Kirill Tkalenko edited comment on IGNITE-18610 at 6/1/23 12:34 PM: ------------------------------------------------------------------- Problem remains the same, for in-memory tables a second rebalance happens which brings a race on install snapshot. Perhaps we can fix this when we come up with a new replication protocol for in-memory tables or fix it later in IGNITE-19234. was (Author: ktkale...@gridgain.com): Problem remains the same, for in-memory tables a second rebalance happens which brings a race on install snapshot. > ItTableRaftSnapshotsTest#leaderFeedsFollowerWithSnapshot hangs > --------------------------------------------------------------- > > Key: IGNITE-18610 > URL: https://issues.apache.org/jira/browse/IGNITE-18610 > Project: Ignite > Issue Type: Task > Reporter: Kirill Tkalenko > Priority: Major > Labels: ignite-3 > Fix For: 3.0.0-beta2 > > Attachments: _Integration_Tests_Module_Runner_7366.log (1).zip > > > It was found that > *org.apache.ignite.internal.raftsnapshot.ItTableRaftSnapshotsTest#leaderFeedsFollowerWithSnapshot* > could hang, it is not yet clear why, but the following stack trace was found > in the logs, [TC > link|https://ci.ignite.apache.org/viewLog.html?buildId=7023641&tab=buildResultsDiv&buildTypeId=ApacheIgnite3xGradle_Test_IntegrationTests_ModuleRunner&logTab=]. > I will also attach the [^_Integration_Tests_Module_Runner_7366.log (1).zip] > , to start the analysis you need to find the line > *org.apache.ignite.internal.raftsnapshot.ItTableRaftSnapshotsTest.leaderFeedsFollowerWithSnapshot([3] > aimem) (running for 25m:11s)*. > {noformat} > 2023-01-23 20:09:22:012 +0300 > [ERROR][%itrst_lffws_2%JRaft-FSMCaller-Disruptor-_stripe_3-0][StripedDisruptor] > Handle disruptor event error > [name=%itrst_lffws_2%JRaft-FSMCaller-Disruptor-, > event=org.apache.ignite.raft.jraft.core.FSMCallerImpl$ApplyTask@766e6436, > hasHandler=false] > org.apache.ignite.internal.storage.StorageClosedException: IGN-STORAGE-3 > TraceId:7e8dadf9-d715-494e-817f-67e2a75eabba Storage is already closed: > [table=TEST, partitionId=0] > at > org.apache.ignite.internal.storage.util.StorageUtils.throwExceptionDependingOnStorageState(StorageUtils.java:81) > at > org.apache.ignite.internal.storage.pagememory.mv.AbstractPageMemoryMvPartitionStorage.busy(AbstractPageMemoryMvPartitionStorage.java:1037) > at > org.apache.ignite.internal.storage.pagememory.mv.VolatilePageMemoryMvPartitionStorage.lastAppliedIndex(VolatilePageMemoryMvPartitionStorage.java:102) > at > org.apache.ignite.internal.table.distributed.raft.snapshot.outgoing.SnapshotAwarePartitionDataStorage.lastAppliedIndex(SnapshotAwarePartitionDataStorage.java:87) > at > org.apache.ignite.internal.table.distributed.raft.PartitionListener.onConfigurationCommitted(PartitionListener.java:394) > at > org.apache.ignite.internal.raft.server.impl.JraftServerImpl$DelegatingStateMachine.onRawConfigurationCommitted(JraftServerImpl.java:642) > at > org.apache.ignite.raft.jraft.core.FSMCallerImpl.doSnapshotLoad(FSMCallerImpl.java:695) > at > org.apache.ignite.raft.jraft.core.FSMCallerImpl.runApplyTask(FSMCallerImpl.java:403) > at > org.apache.ignite.raft.jraft.core.FSMCallerImpl$ApplyTaskHandler.onEvent(FSMCallerImpl.java:136) > at > org.apache.ignite.raft.jraft.core.FSMCallerImpl$ApplyTaskHandler.onEvent(FSMCallerImpl.java:130) > at > org.apache.ignite.raft.jraft.disruptor.StripedDisruptor$StripeEntryHandler.onEvent(StripedDisruptor.java:217) > at > org.apache.ignite.raft.jraft.disruptor.StripedDisruptor$StripeEntryHandler.onEvent(StripedDisruptor.java:181) > at > com.lmax.disruptor.BatchEventProcessor.run(BatchEventProcessor.java:137) > at java.base/java.lang.Thread.run(Thread.java:834) > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)