Igniters, Do we have some feature allows to check nodes aliveness on a regular basis?
Scenario: Precondition The cluster has no load but some node's JVM crashed. Expected actual The user performs an operation (eg. cache put) related to this node (via another node) and waits for some timeout to gain it's dead. The cluster starts the switch to relocate primary partitions to alive nodes. Now user able to retry the operation. Desired Some WatchDog checks nodes aliveness on a regular basis. Once a failure detected, the cluster starts the switch. Later, the user performs an operation on an already fixed cluster and waits for nothing. It would be good news if the "Desired" case is already Actual. Can somebody point to the feature that performs this check?