Hi Sébastien,

The error message you shared came from the repair coordinator node's log, and it's the result of failures reported by 3 other nodes. If you could have a look at the 3 nodes listed in the error message - 135.181.222.100, 135.181.217.109 and 135.181.221.180, you should be able to find that between the starting time of the repair and 2021-05-26 15:54:19 (the timestamp on the error message), each of the 3 nodes will have at least one error message in their log indicating the exact reason why they failed. There's many reasons a repair can fail, and schema disagreement is only one of them. By looking at the logs on the other nodes, you should be able to find out the root cause, and then come up with a solution.


Regards,

Bowen

On 26/05/2021 15:05, Sébastien Rebecchi wrote:
Hi,

I have an issue with repairing my Casandra cluster, that was already the case with Cassandra 3 and the issue is not solved with Cassandra 4 RC1.

I run in a for loop, one 1 by 1, the following command:

nodetool -h THE_NODE -u jTHE_USER -pw THE_PASSWORD repair --full -pr

and I always get the following error, see message and stack trace for Cassandra 4 RC1 at the bottom of the message (the same for C3).

I don't know what to do with that. Are there some mistakes I could have made in my table design explaining that? I have heard for example that it was not recommended to have big partitions, so I changed my data model to remove clustering keys I add before and then split big partitions in many independent one, and now the partitions are max 500KB each (the vast majority of them are max 100KB). But it did not change anything. Also my partition key was a compound of 9 columns and I changed that to have only 1 column for partition key by generating ids by myself, the same, no improvement.

Thank you for your help,

Sébastien

--

error: Repair job has failed with the error message: [2021-05-26 15:54:19,981] Repair command #2 failed with error Got negative replies from endpoints [135.181.222.100, 135.181.217.109, 135.181.221.180]
-- StackTrace --
java.lang.RuntimeException: Repair job has failed with the error message: [2021-05-26 15:54:19,981] Repair command #2 failed with error Got negative replies from endpoints [135.181.222.100, 135.181.217.109, 135.181.221.180]
at org.apache.cassandra.tools.RepairRunner.progress(RepairRunner.java:116)
at org.apache.cassandra.utils.progress.jmx.JMXNotificationProgressListener.handleNotification(JMXNotificationProgressListener.java:77) at com.sun.jmx.remote.internal.ClientNotifForwarder$NotifFetcher.dispatchNotification(ClientNotifForwarder.java:583) at com.sun.jmx.remote.internal.ClientNotifForwarder$NotifFetcher.doRun(ClientNotifForwarder.java:533) at com.sun.jmx.remote.internal.ClientNotifForwarder$NotifFetcher.run(ClientNotifForwarder.java:452) at com.sun.jmx.remote.internal.ClientNotifForwarder$LinearExecutor$1.run(ClientNotifForwarder.java:108)

Reply via email to