Ron Dagostino created KAFKA-14392:
-------------------------------------
Summary: KRaft should comment controller.socket.timeout.ms <=
broker.session.timeout.ms
Key: KAFKA-14392
URL: https://issues.apache.org/jira/browse/KAFKA-14392
Project: Kafka
Issue Type: Improvement
Reporter: Ron Dagostino
Assignee: Ron Dagostino
KRaft brokers maintain their liveness in the cluster by sending
BROKER_HEARTBEAT requests to the active controller; the active controller
fences a broker if it doesn't receive a heartbeat request from that broker
within the period defined by `broker.session.timeout.ms`. The broker should
use a request timeout for its BROKER_HEARTBEAT requests that is not larger than
the session timeout being used by the controller; doing so creates the
possibility that upon controller failover the broker might not cancel an
existing heartbeat request in time and then subsequently heartbeat to the new
controller to maintain an uninterrupted session in the cluster. In other
words, a failure of the active controller could result in under-replicated (or
under-min ISR) partitions simply due to a delay in brokers heartbeating to the
new controller.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)