[ https://issues.apache.org/jira/browse/CASSANDRA-20575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17946542#comment-17946542 ]
Ariel Weisberg edited comment on CASSANDRA-20575 at 4/22/25 8:17 PM: --------------------------------------------------------------------- The test is sending requests to Accord before Accord is ready. The read is at {{ALL}} in this case. If you add a 30 second sleep after the replacement host is started you will see Accord finishing logging all the syncs and then be ready to execute requests. Accord should probably not claim to be started when it's not ready, and I think regular Cassandra blocks startup during host replacement on streaming until everything is ready? [~dcapwell] [~benedict] was (Author: aweisberg): The test is sending requests to Accord before Accord is ready. The read is at {{ALL}} in this case. If you add a 30 second sleep after the replacement host is started you will see Accord finishing logging all the syncs and then be ready to execute requests. Accord should probably not claim to be started when it's not ready, I think regular Cassandra blocks startup until everything is ready? [~dcapwell] [~benedict] > Test failure: > org.apache.cassandra.distributed.test.accord.AccordHostReplacementTest > ------------------------------------------------------------------------------------ > > Key: CASSANDRA-20575 > URL: https://issues.apache.org/jira/browse/CASSANDRA-20575 > Project: Apache Cassandra > Issue Type: Bug > Components: Accord > Reporter: Brandon Williams > Priority: Normal > Fix For: 5.x > > > https://app.circleci.com/pipelines/github/driftx/cassandra/1858/workflows/d65d081e-7abb-4af3-a28b-660a30ed4e1a/jobs/115464/tests > {noformat} > junit.framework.AssertionFailedError: Caught an exception at > seed:8547030211730L > at > org.apache.cassandra.harry.checker.TestHelper.withRandom(TestHelper.java:46) > at > org.apache.cassandra.harry.checker.TestHelper.withRandom(TestHelper.java:34) > at > org.apache.cassandra.distributed.test.accord.AccordHostReplacementTest.hostReplace(AccordHostReplacementTest.java:70) > at > java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at > java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:77) > at > java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > Caused by: java.lang.AssertionError: Caught an exception while validating > Operation{ lts=1806 pd=-7561896253598983740, kind=SELECT_PARTITION}: > CompiledStatement{cql=execute("SELECT * FROM > distributed_test_keyspace.host_replace0 WHERE pk0 = ? AND pk1 = ? AND pk2 = ? > AND pk3 = ? AND pk4 = ? AND pk5 = ? AND pk6 = ? AND pk7 = ? AND pk8 = ?;", > "hthfocoup",(short)-2953,(int)388860433,"geudranla",java.util.UUID.fromString("e5b2e3ea-0b4c-4b8c-a3ba-5e2403e4e905"),(short)3616,(int)2065317688,"bufsve",(int)1324995027} > at > org.apache.cassandra.harry.execution.RingAwareInJvmDTestVisitExecutor.executeValidatingVisit(RingAwareInJvmDTestVisitExecutor.java:113) > at > org.apache.cassandra.harry.execution.CQLVisitExecutor.execute(CQLVisitExecutor.java:154) > at > org.apache.cassandra.harry.execution.CQLVisitExecutor.executeVisit(CQLVisitExecutor.java:70) > at > org.apache.cassandra.harry.dsl.ReplayingHistoryBuilder$1.build(ReplayingHistoryBuilder.java:49) > at > org.apache.cassandra.harry.dsl.SingleOperationVisitBuilder.selectPartition(SingleOperationVisitBuilder.java:500) > at > org.apache.cassandra.harry.dsl.HistoryBuilder.selectPartition(HistoryBuilder.java:238) > at > org.apache.cassandra.distributed.test.accord.AccordHostReplacementTest.lambda$hostReplace$4(AccordHostReplacementTest.java:90) > at > org.apache.cassandra.harry.checker.TestHelper.withRandom(TestHelper.java:42) > Caused by: org.apache.cassandra.exceptions.ReadFailureException: Operation > failed - received 0 responses and 0 failures > at > org.apache.cassandra.service.accord.RequestBookkeeping.newFailed(RequestBookkeeping.java:106) > at > org.apache.cassandra.service.accord.RequestBookkeeping.newFailed(RequestBookkeeping.java:80) > at > org.apache.cassandra.service.accord.AccordResult.accept(AccordResult.java:145) > at > org.apache.cassandra.service.accord.AccordResult.accept(AccordResult.java:51) > at > accord.utils.async.AsyncResults$AbstractResult.notify(AsyncResults.java:85) > at > accord.utils.async.AsyncResults$AbstractResult.trySetResult(AsyncResults.java:116) > at > accord.utils.async.AsyncResults$AbstractResult.tryFailure(AsyncResults.java:129) > at > accord.utils.async.AsyncResults$SettableResult.tryFailure(AsyncResults.java:255) > at > accord.coordinate.AbstractCoordinatePreAccept.tryFailure(AbstractCoordinatePreAccept.java:109) > at > accord.coordinate.AbstractCoordinatePreAccept.accept(AbstractCoordinatePreAccept.java:151) > at > accord.coordinate.AbstractCoordinatePreAccept.accept(AbstractCoordinatePreAccept.java:42) > at > org.apache.cassandra.service.accord.interop.AccordInteropExecution.lambda$start$6(AccordInteropExecution.java:406) > at accord.utils.async.AsyncChains$FlatMap.accept(AsyncChains.java:256) > at accord.utils.async.AsyncChains$FlatMap.accept(AsyncChains.java:246) > at > accord.utils.async.AsyncResults$AbstractResult.notify(AsyncResults.java:85) > at > accord.utils.async.AsyncResults$AbstractResult.trySetResult(AsyncResults.java:116) > at > accord.utils.async.AsyncResults$AbstractResult.setResult(AsyncResults.java:147) > at > accord.utils.async.AsyncChains.lambda$encapsulate$0(AsyncChains.java:504) > at > org.apache.cassandra.concurrent.FutureTask$2.call(FutureTask.java:124) > at org.apache.cassandra.concurrent.FutureTask.call(FutureTask.java:61) > at org.apache.cassandra.concurrent.FutureTask.run(FutureTask.java:71) > at org.apache.cassandra.concurrent.SEPWorker.run(SEPWorker.java:145) > at > io.netty.util.concurrent.FastThreadLocalRunnable.run(FastThreadLocalRunnable.java:30) > at java.base/java.lang.Thread.run(Thread.java:833) > Suppressed: java.lang.RuntimeException: > org.apache.cassandra.exceptions.ReadTimeoutException: Operation timed out - > received only 2 responses. > at > accord.utils.WrappableException.wrap(WrappableException.java:56) > Caused by: org.apache.cassandra.exceptions.ReadTimeoutException: > Operation timed out - received only 2 responses. > at > org.apache.cassandra.service.reads.ReadCallback.awaitResults(ReadCallback.java:212) > at > org.apache.cassandra.service.reads.AbstractReadExecutor.awaitResponses(AbstractReadExecutor.java:428) > at > org.apache.cassandra.service.StorageProxy.fetchRows(StorageProxy.java:2625) > at > org.apache.cassandra.service.accord.interop.AccordInteropExecution.lambda$keyReadChains$0(AccordInteropExecution.java:291) > at > accord.utils.async.AsyncChains.lambda$encapsulate$0(AsyncChains.java:498) > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org