Thanks for the KIP Sanjana. I think the motivation is good, but lack of more quantitative analysis. For instance:
1. How much retries we are saving by applying the exponential retry vs static retry? There should be some mathematical relations between the static retry ms, the initial exponential retry ms, the max exponential retry ms in a given time interval. 2. How does this affect the client timeout? With exponential retry, the client shall be getting easier to timeout on a parent level caller, for instance stream attempts to retry initializing producer transactions with given 5 minute interval. With exponential retry this mechanism could experience more frequent timeout which we should be careful with. 3. With regards to #2, we should have more detailed checklist of all the existing static retry scenarios, and adjust the initial exponential retry ms to make sure we won't get easily timeout in high level due to too few attempts. Boyang On Fri, Mar 13, 2020 at 4:38 PM Sanjana Kaundinya <skaundi...@gmail.com> wrote: > Hi Everyone, > > I’ve written a KIP about introducing exponential backoff for Kafka > clients. Would appreciate any feedback on this. > > > https://cwiki.apache.org/confluence/display/KAFKA/KIP-580%3A+Exponential+Backoff+for+Kafka+Clients > > Thanks, > Sanjana >