Re: [ceph-users] Fwd: HW failure cause client IO drops

2019-04-16 Thread M Ranga Swami Reddy
OSD processes/daemon running as is...So ceph not making those OSD down or out. But as battery failed, which leads temperature high, leads CPU utlization increased - leads OSD response time more, so that other OSDs failed to response on time.. causing the utter slow or no IO... On Tue, Apr 16, 2

Re: [ceph-users] Fwd: HW failure cause client IO drops

2019-04-15 Thread Eugen Block
Good morning, the OSDs are usually marked out after 10 minutes, that's when rebalancing starts. But the I/O should not drop during that time, this could be related to your pool configuration. If you have a replicated pool of size 3 and also set min_size to 3 the I/O would pause if a node