Mike Percy created KUDU-2486:
--------------------------------
Summary: Leader should back off heartbeating to failed followers
Key: KUDU-2486
URL: https://issues.apache.org/jira/browse/KUDU-2486
Project: Kudu
Issue Type: Improvement
Components: consensus
Affects Versions: 1.7.1
Reporter: Mike Percy
At the time of writing, the replica leader -> follower heartbeat mechanism does
not have a backoff mechanism built in. Rather it simply sends a heartbeat every
configured period (say, 500ms). If a server is offline this can cause log spam
until that replica is evicted, and if a server is overloaded this lack of a
backoff contributes to the problem.
Since we now have pre-election support, having leaders slow down their
heartbeat attempts when follower requests are returning errors should not cause
unnecessary leader elections, so backing off is feasible.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)