Daniel Urban created KAFKA-14053:
------------------------------------

             Summary: Transactional producer should bump the epoch when a batch 
encounters delivery timeout
                 Key: KAFKA-14053
                 URL: https://issues.apache.org/jira/browse/KAFKA-14053
             Project: Kafka
          Issue Type: Bug
            Reporter: Daniel Urban
            Assignee: Daniel Urban


When a batch fails due to delivery timeout, it is possible that the batch is 
still in-flight. Due to underlying infra issues, it is possible that an 
EndTxnRequest and a WriteTxnMarkerRequest is processed before the in-flight 
batch is processed on the leader. This can cause transactional batches to be 
appended to the log after the corresponding abort marker.

This can cause the LSO to be infinitely blocked in the partition, or can even 
violate processing guarantees, as the out-of-order batch can become part of the 
next transaction.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to