Jason Gustafson created KAFKA-9802:
--------------------------------------

             Summary: Fix flaky system test `TransactionsTest.test_transactions`
                 Key: KAFKA-9802
                 URL: https://issues.apache.org/jira/browse/KAFKA-9802
             Project: Kafka
          Issue Type: Bug
            Reporter: Jason Gustafson
            Assignee: Jason Gustafson


Found when investigating a test failure in 
`kafkatest.tests.core.transactions_test.TransactionsTest.test_transactions`. 
The failure was caused by an unexpected transaction timeout. Looking into the 
logs, I found that the timeout was due to a Produce request which took just 
over 10s to complete. The test case that failed involved a hard kill of the 
broker, so it is certainly possible for some produce requests to reach up to 
the request timeout. 

The problem in this case is that `request.timeout.ms` was set to 30s while the 
`transaction.timeout.ms` was set to only 10s. There is no benefit to allowing a 
larger request timeout since the coordinator is just going to abort the 
transaction. So we should fix this test case to set these timeouts 
consistently. It might also be worth logging a warning to the user when the 
request timeout is larger than the transaction timeout.







--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to