Alyssa Huang created KAFKA-18345: ------------------------------------ Summary: Investigate if binaryExponentialElectionBackoffMs can be removed or improved Key: KAFKA-18345 URL: https://issues.apache.org/jira/browse/KAFKA-18345 Project: Kafka Issue Type: Improvement Reporter: Alyssa Huang
Right now this exponential backoff method can return a min backoff of 200ms and cap out at the electionBackoffMaxMs (1s default). The logic for calculating this exponential backoff isn't very straightforward, and probably no longer needed now that Candidate transitions to Prospective after election loss. We can simplify states by not tracking "retries" as well. We _do_ still need a bit of randomness to ensure we can get past gridlocked elections, but as Diego's Raft thesis shows, 50ms of random range is sufficient to avoid repeat split votes in elections. > Using more randomness improves worst-case behavior: with a 50 ms random > range, the worst-case completion time (over 1,000 trials) was 513 ms. We should investigate if it would make sense to replace the exponential backoff with a non-exponential backoff that is more straightforward. e.g. {code:java} int randomElectionBackoffMs() { long backoff = random(0, 50ms) return Math.min(state.remainingElectionTimeoutMs, backoff) }{code} Note, it's fine to return state.remainingElectionTimeoutMs because that also has randomness in it. The alternative could also be using a lower default electionTimeoutMs (like 300ms mentioned in the Raft paper) and on election failure, waiting the extent of state.remainingElectionTimeoutMs (which contains some jitter). -- This message was sent by Atlassian Jira (v8.20.10#820010)