These steps pretty well correspond to http://docs.ceph.com/docs/mimic/cephfs/disaster-recovery/ (http://docs.ceph.com/docs/mimic/cephfs/disaster-recovery/) Were you able to replay journal manually with no issues? IIRC, "cephfs-journal-tool recover_dentries" would lead to OOM in case of MDS doing so, and it has already been discussed on this list. April 2, 2019 1:37 AM, "Pickett, Neale T" <ne...@lanl.gov (mailto:ne...@lanl.gov?to=%22Pickett,%20Neale%20T%22%20<ne...@lanl.gov>)> wrote: Here is what I wound up doing to fix this: * Bring down all MDSes so they stop flapping * Back up journal (as seen in previous message) * Apply journal manually * Reset journal manually * Clear session table * Clear other tables (not sure I needed to do this) * Mark FS down * Mark the rank 0 MDS as failed * Reset the FS (yes, I really mean it) * Restart MDSes * Finally get some sleep
_______________________________________________ ceph-users mailing list ceph-users@lists.ceph.com http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com