have you ever run disaster recovery
(http://docs.ceph.com/docs/luminous/cephfs/disaster-recovery/). Try
following steps
stop mds.a and run following commands step by step
cephfs-table-tool 0 reset session
cephfs-journal-tool event recover_dentries summary
cephfs-data-scan scan_links
restart mds
Hello all!
My Ceph MDS is crashed and is no longer starting.
How do I think the problem in this:
2018-08-10 16:59:18.147612 7f8d50037700 0 mds.0.cache.dir(0x604)
_fetched badness: got (but i already had) [inode 0x10001b99b20 [2,head]
~mds0/stray1/10001b99b20 auth v100647297 s=540 n(v0 b540