Thanks for the assistance. It appears like the issue you are hitting is due to a failed watch:
2016-08-12 22:29:45.249895 7f6dcffff700 -1 librbd::ImageWatcher: 0x7f6db4003b60 image watch failed: 140096867143248, (107) Transport endpoint is not connected There is a heartbeat that your client is supposed to send to the OSD every 5 seconds to prevent the watch from timing out after 30 seconds. This is indicative of an overloaded client / cluster. The good news is that the fix is already available [1] and should be included in the next Ceph point release. This won't prevent the watch failure but should prevent the race condition between failure and recovery. [1] http://tracker.ceph.com/issues/16923 -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1607694 Title: Exclusive-Lock Issue To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/ceph/+bug/1607694/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs