[ https://issues.apache.org/jira/browse/KAFKA-9171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rajini Sivaram resolved KAFKA-9171. ----------------------------------- Reviewer: Ismael Juma Resolution: Fixed > DelayedFetch completion may throw exception, causing successful produce to be > failed > ------------------------------------------------------------------------------------ > > Key: KAFKA-9171 > URL: https://issues.apache.org/jira/browse/KAFKA-9171 > Project: Kafka > Issue Type: Bug > Components: core > Affects Versions: 2.4.0 > Reporter: Rajini Sivaram > Assignee: Rajini Sivaram > Priority: Major > Fix For: 2.4.0 > > > I was looking at the logs of the system test failure of > ReassignPartitionsTest. > Logs show produce error ReplicaNotAvailableException for two records in the > producer log, but the data logs of all the brokers contain the records. The > offsets of these records are returned as successful produce for two > subsequent records which don't appear in the logs and hence the test failed. > Broker logs of the leader at the time of the reassignment and leader change > show: > > {{[2019-11-11 07:23:17,727] ERROR [ReplicaManager broker=3] Error processing > append operation on partition test_topic-17 (kafka.server.ReplicaManager) > org.apache.kafka.common.errors.ReplicaNotAvailableException: Partition > test_topic-5 is not available}} > This is failing the append operation on `test_topic-17` when a different > partition `test_topic-5` was unavailable for fetch. I think it is fetch since > produce would have thrown NotLeaderForPartitionException rather than > ReplicaNotAvailableException. > We don't expect DelayedFetch to throw exceptions and it looks like we are not > handling `ReplicaNotAvailableException`. > I am not sure if this fixes the issues with ReassignPartitionsTest, but this > seems to a scenario that we should fix. -- This message was sent by Atlassian Jira (v8.3.4#803005)