[ 
https://issues.apache.org/jira/browse/KAFKA-19022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17938478#comment-17938478
 ] 

Lorcan edited comment on KAFKA-19022 at 3/26/25 8:20 AM:
---------------------------------------------------------

Hi [~rsamudrala], I've checked the KafkaRaftClient class for the 
Inconsistent_Cluster_Id error and it looks like there are several instances 
where a customised error message with the cluster ids is not being set:

VoteResponseData
BeginQuorumEpochResponseData
EndQuorumEpochResponseData
FetchResponseData
FetchSnapshotResponseData

These all implement the ApiMessage, just like AddRaftVoterResponseData does. It 
looks like this would require a new schema for the above classes, given the 
logic when new fields have been added and I'm not sure if this would require a 
KIP or not.


was (Author: JIRAUSER308669):
Hi [~rsamudrala], I've checked the KafkaRaftClient class for the 
Inconsistent_Cluster_Id error and it looks like there are several instances 
where a customised error message with the cluster ids is not being set:

VoteResponseData
BeginQuorumEpochResponseData
EndQuorumEpochResponseData
FetchResponseData
FetchSnapshotResponseData

These all implement the ApiMessage, just like AddRaftVoterResponseData does. I 
can create a PR to add an errorMessage field to FetchResponseData to get 
feedback and see if this is a viable approach.

> Display cluster IDs being compared when encountering INCONSISTENT_CLUSTER_ID 
> error
> ----------------------------------------------------------------------------------
>
>                 Key: KAFKA-19022
>                 URL: https://issues.apache.org/jira/browse/KAFKA-19022
>             Project: Kafka
>          Issue Type: Improvement
>          Components: kraft, logging
>    Affects Versions: 3.9.0
>            Reporter: Ranganath Samudrala
>            Assignee: Lorcan
>            Priority: Major
>
> While migrating Kafka from zookeeper to kraft, we see errors in logs like
> {{INCONSISTENT_CLUSTER_ID in FETCH response }}
> or
> {{INCONSISTENT_CLUSTER_ID in VOTER response }}
> But cluster IDs being compared is not displayed in logs so there is not 
> enough information to see where the issue is. Is the class data *clusterId* 
> empty (which could potentially be a bug?) or incoming *clusterId* empty or 
> incorrect?
> [KafkaRaftClient|https://github.com/apache/kafka/blob/31e1a57c41cf9cb600751669dc71bcd9596b45f9/raft/src/main/java/org/apache/kafka/raft/KafkaRaftClient.java#L1459]
> {code:java}
>     private boolean hasValidClusterId(String requestClusterId) {
>         // We don't enforce the cluster id if it is not provided.
>         if (requestClusterId == null)Unknown macro: {
>                       return true;
>                  }
>                 return clusterId.equals(requestClusterId);
>      }
> .
> .
>     private CompletableFuture<FetchResponseData> handleFetchRequest(
>         RaftRequest.Inbound requestMetadata,
>         long currentTimeMs
>     ) {
>         FetchRequestData request = (FetchRequestData) requestMetadata.data();
>         if (!hasValidClusterId(request.clusterId())) {
>                      return completedFuture(new 
> FetchResponseData().setErrorCode(Errors.INCONSISTENT_CLUSTER_ID.code()));
>                  }
> .
> .
> {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to