Justine Olshan created KAFKA-18137:
--------------------------------------

             Summary: Unloading transaction state incorrectly removes loading 
partitions
                 Key: KAFKA-18137
                 URL: https://issues.apache.org/jira/browse/KAFKA-18137
             Project: Kafka
          Issue Type: Task
            Reporter: Justine Olshan
            Assignee: Justine Olshan


When there is a become follower transition on a transaction coordinator state 
partition, we intend to unload the state partition. However, we pass the new 
epoch to the method that does the unloading. In that method, we create a 
`TransactionPartitionAndLeaderEpoch` object comprising of the topic partition 
and the epoch that we use as a key to remove the partition from loading. 
However, we wouldn't ever expect to see this epoch in that map since we only 
load on the leader. See the code snippet: 
[https://github.com/apache/kafka/blob/d00f0ecf1a1a082c97564f4b807e7a342472b57a/core/src/main/scala/kafka/coordinator/transaction/TransactionStateManager.scala#L602]

We could have a partition load after the unloading occurs, and that partition 
will be stuck storing stale state on the broker until it restarts. While this 
may not immediately cause a correctness issue, we should try to properly clean 
up state.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to