On Sun, Oct 20, 2019 at 1:53 PM Stefan Kooman <ste...@bit.nl> wrote: > > Dear list, > > Quoting Stefan Kooman (ste...@bit.nl): > > > I wonder if this situation is more likely to be hit on Mimic 13.2.6 than > > on any other system. > > > > Any hints / help to prevent this from happening? > > We have had this happening another two times now. In both cases the MDS > recovers, becomes active (for a few seconds), and crashes again. It won't > come out of this loop by itself. When put in deug mode "debug_mds = > 10/10) we won't hit the bug and it stays active. After a few minutes we > disable debug (live, ceph tell mds.* config set debug_mds 0/0) and it > keeps running (Heisenbug)... until hours later when it crashes again and the > story > repeats itself. > > So unfortunately no more debug information available, but at least a > workaround to get it active again. >
delete 'mdsX_openfiles.0' object from cephfs metadata pool. (X is rank of the crashed mds) > Gr. Stefan > > -- > | BIT BV https://www.bit.nl/ Kamer van Koophandel 09090351 > | GPG: 0xD14839C6 +31 318 648 688 / i...@bit.nl > _______________________________________________ > ceph-users mailing list > ceph-users@lists.ceph.com > http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com _______________________________________________ ceph-users mailing list ceph-users@lists.ceph.com http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com