Perhaps as a workaround you should just wipe this mon's data dir and remake it? In the past when I upgraded our mons from spinning disks to SSDs, I went through a procedure to remake each mon from scratch (wiping and resyncing each mon's leveldb one at a time).
I did something like this: service ceph stop cd /var/lib/ceph/mon mv mon.4 mon.4.old mkdir mon.4 ceph-mon --mkfs -i 4 --keyring /var/lib/ceph/tmp/keyring.mon.4 (where the keyring in the last step is a copy of the mon keyring). When you start the mon after this it should get the latest leveldb from the other mons. Maybe a procedure like this will work for you?? (Disclaimer: I've only ever tried this on Dumpling). Cheers, Dan ________________________________________ From: ceph-users-boun...@lists.ceph.com [ceph-users-boun...@lists.ceph.com] on behalf of Jens Kristian Søgaard [j...@mermaidconsulting.dk] Sent: 31 March 2014 20:55 To: Gregory Farnum Cc: ceph-users Subject: Re: [ceph-users] Mon hangs when started after Emperor upgrade Hi Gregory, > Is the mon process doing anything (that is, does it have any CPU > usage)? This looks to be an internal leveldb issue, but not one that > we've run into before, so I think there must be something unique about > the leveldb store involved. No, it is not doing anything at all. I'm not sure what could be unique about my leveldb store. The only thing I can think of is that this cluster was initially installed with Argonaut, and has over time been upgraded through Bobtail, Cuttlefish, Dumpling and now Emperor. -- Jens Kristian Søgaard, Mermaid Consulting ApS, j...@mermaidconsulting.dk, http://www.mermaidconsulting.com/ _______________________________________________ ceph-users mailing list ceph-users@lists.ceph.com http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com _______________________________________________ ceph-users mailing list ceph-users@lists.ceph.com http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com