Some level of performance degradation is expected for small duration if the
broker is down. You will have to find out the bottleneck. It could be IOPS
or network bandwidth or some other resource.
When rebalance occurs it spikes the CPU but you are saying the CPU usage is
dropping ..that is quite s
We are using EC2 EBS volume "thoroughput optimized hdd (st1)" from AWS:
https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ebs-volume-types.html
with 3 brokers and replication factor 3.
There is no data lost we simply accept 10% of the messages sent during this
time period and the rest are delayed
If the cluster is busy then it will have lots of data to rebalance once the
broker comes online. What type is your underlying storage ? Are you using
SSD ?
5k/sec and avg size 3kb i.e. 15000Kb (14.6 MB /sec ) . So if your broker
is down for 10 minutes then approx 8 GB data need to rebalance and