[ceph-users] Re: Ceph cluster not recover after OSD down

Burkhard Linke Wed, 05 May 2021 02:17:58 -0700

Hi,

On 05.05.21 11:07, Andres Rojas Guerrero wrote:

Sorry, I have not understood the problem well, the problem I see is that
once the OSD fails, the cluster recovers but the MDS remains faulty:


*snipsnap*

     pgs:     1.562% pgs not active
              16128 active+clean
              238   incomplete
              18    down

The PGs in down and incomplete state will not allow any I/O, and thisleads to the slow ops and the unavailability of the services. 32 OSDsare currently down; if PG replicates are spread over these OSDs onlythere will be no automatic recover.

You will have to bring the OSDs back online to allow recovery. Are thoselocated on a single node or are multiple hosts involved?


Regards,

Burkhard

_______________________________________________
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io

[ceph-users] Re: Ceph cluster not recover after OSD down

Reply via email to