TisonKun edited a comment on issue #11338: [FLINK-10052][ha] Tolerate temporarily suspended ZooKeeper connections URL: https://github.com/apache/flink/pull/11338#issuecomment-598161630 > Is setting the session timeout to a low value equivalent to not tolerating suspended connections from a users point-of-view? I will say "no" because they're totally different things. - SUSPENDED caused by ZK ConnectionLossException, which means the client-server socket broken. Then the ZK client will automatically retry to connect to other servers in the ensemble. - LOST caused by ZK SessionExpiredException, which (with Curator features) means the session timeout configured elapsed before the client can find a server. Practically, if session timeout set to a low value, with this feature turn on, and client-server connection loss, from a users point-of-view, the suspended connection still in tolerance. But the leader services possibly revoke the leadership soon after the session timeout elapsed. Often user suffers ZK connection issue(with or without this feature) when ZK ensemble unstable or high throughput on JM(said job with large parallelism in scheduling). But whether or not a session timeout and whether or not we tolerate suspended connection have nothing to do with each other.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org With regards, Apache Git Services