Hi John Thanks for that, life saver! Running on Debian Jessie and I replaced the mail ceph repo in source.d to:
deb http://gitbuilder.ceph.com/ceph-deb-jessie-x86_64-basic/ref/wip-17466-jewel/ jessie main Updated and Upgraded Ceph, tried to manually run my mon which failed as it had already been started during the upgrade! Just to ask about the gitbuilder repo's, is there a way I can track if this patch gets pushed into the mainline (10.2.4 or something)? Are there any gotchas to consider with using them? Thanks again, My Domain Controller thanks you, my mailserver thanks you and my webserver thanks you!!! James On 7 October 2016 at 11:37, John Spray <jsp...@redhat.com> wrote: > On Fri, Oct 7, 2016 at 8:04 AM, James Horner <humankind...@gmail.com> > wrote: > > Hi All > > > > Just wondering if anyone can help me out here. Small home cluster with 1 > > mon, the next phase of the plan called for more but I hadn't got there > yet. > > > > I was trying to setup Cephfs and I ran "ceph fs new" without having an > MDS > > as I was having issues with rank 0 immediately being degraded. My > thinking > > was that I would bring up an MDS and it would be assigned to rank 0. > Anyhoo > > after I did that my mon crashed and I havn't been able to restart it > since, > > its output is: > > > > root@bertie ~ $ /usr/bin/ceph-mon -f --cluster ceph --id bertie > --setuser > > ceph --setgroup ceph 2>&1 | tee /var/log/ceph/mon-temp > > starting mon.bertie rank 0 at 192.168.2.3:6789/0 mon_data > > /var/lib/ceph/mon/ceph-bertie fsid 06e2f4e0-35e1-4f8c-b2a0-bc72c4cd3199 > > terminate called after throwing an instance of 'std::out_of_range' > > what(): map::at > > *** Caught signal (Aborted) ** > > in thread 7fad7f86c480 thread_name:ceph-mon > > ceph version 10.2.3 (ecc23778eb545d8dd55e2e4735b53cc93f92e65b) > > 1: (()+0x525737) [0x56219142b737] > > 2: (()+0xf8d0) [0x7fad7eb3c8d0] > > 3: (gsignal()+0x37) [0x7fad7cdc6067] > > 4: (abort()+0x148) [0x7fad7cdc7448] > > 5: (__gnu_cxx::__verbose_terminate_handler()+0x15d) [0x7fad7d6b3b3d] > > 6: (()+0x5ebb6) [0x7fad7d6b1bb6] > > 7: (()+0x5ec01) [0x7fad7d6b1c01] > > 8: (()+0x5ee19) [0x7fad7d6b1e19] > > 9: (std::__throw_out_of_range(char const*)+0x66) [0x7fad7d707b76] > > 10: (FSMap::get_filesystem(int) const+0x7c) [0x56219126ed6c] > > 11: (MDSMonitor::maybe_promote_standby(std::shared_ptr< > Filesystem>)+0x48a) > > [0x56219125b13a] > > 12: (MDSMonitor::tick()+0x4bb) [0x56219126084b] > > 13: (MDSMonitor::on_active()+0x28) [0x562191255da8] > > 14: (PaxosService::_active()+0x60a) [0x5621911d896a] > > 15: (PaxosService::election_finished()+0x7a) [0x5621911d8d7a] > > 16: (Monitor::win_election(unsigned int, std::set<int, std::less<int>, > > std::allocator<int> >&, unsigned long, MonCommand const*, int, > std::set<int, > > std::less<int>, std::allocator<int> > const*)+0x24e) [0x5621911958ce] > > 17: (Monitor::win_standalone_election()+0x20f) [0x562191195d9f] > > 18: (Monitor::bootstrap()+0x91b) [0x56219119676b] > > 19: (Monitor::init()+0x17d) [0x562191196a5d] > > 20: (main()+0x2694) [0x562191106f44] > > 21: (__libc_start_main()+0xf5) [0x7fad7cdb2b45] > > 22: (()+0x257edf) [0x56219115dedf] > > 2016-10-07 06:50:39.049061 7fad7f86c480 -1 *** Caught signal (Aborted) ** > > in thread 7fad7f86c480 thread_name:ceph-mon > > > > ceph version 10.2.3 (ecc23778eb545d8dd55e2e4735b53cc93f92e65b) > > 1: (()+0x525737) [0x56219142b737] > > 2: (()+0xf8d0) [0x7fad7eb3c8d0] > > 3: (gsignal()+0x37) [0x7fad7cdc6067] > > 4: (abort()+0x148) [0x7fad7cdc7448] > > 5: (__gnu_cxx::__verbose_terminate_handler()+0x15d) [0x7fad7d6b3b3d] > > 6: (()+0x5ebb6) [0x7fad7d6b1bb6] > > 7: (()+0x5ec01) [0x7fad7d6b1c01] > > 8: (()+0x5ee19) [0x7fad7d6b1e19] > > 9: (std::__throw_out_of_range(char const*)+0x66) [0x7fad7d707b76] > > 10: (FSMap::get_filesystem(int) const+0x7c) [0x56219126ed6c] > > 11: (MDSMonitor::maybe_promote_standby(std::shared_ptr< > Filesystem>)+0x48a) > > [0x56219125b13a] > > 12: (MDSMonitor::tick()+0x4bb) [0x56219126084b] > > 13: (MDSMonitor::on_active()+0x28) [0x562191255da8] > > 14: (PaxosService::_active()+0x60a) [0x5621911d896a] > > 15: (PaxosService::election_finished()+0x7a) [0x5621911d8d7a] > > 16: (Monitor::win_election(unsigned int, std::set<int, std::less<int>, > > std::allocator<int> >&, unsigned long, MonCommand const*, int, > std::set<int, > > std::less<int>, std::allocator<int> > const*)+0x24e) [0x5621911958ce] > > 17: (Monitor::win_standalone_election()+0x20f) [0x562191195d9f] > > 18: (Monitor::bootstrap()+0x91b) [0x56219119676b] > > 19: (Monitor::init()+0x17d) [0x562191196a5d] > > 20: (main()+0x2694) [0x562191106f44] > > 21: (__libc_start_main()+0xf5) [0x7fad7cdb2b45] > > 22: (()+0x257edf) [0x56219115dedf] > > NOTE: a copy of the executable, or `objdump -rdS <executable>` is > needed to > > interpret this. > > > > 0> 2016-10-07 06:50:39.049061 7fad7f86c480 -1 *** Caught signal > > (Aborted) ** > > in thread 7fad7f86c480 thread_name:ceph-mon > > > > ceph version 10.2.3 (ecc23778eb545d8dd55e2e4735b53cc93f92e65b) > > 1: (()+0x525737) [0x56219142b737] > > 2: (()+0xf8d0) [0x7fad7eb3c8d0] > > 3: (gsignal()+0x37) [0x7fad7cdc6067] > > 4: (abort()+0x148) [0x7fad7cdc7448] > > 5: (__gnu_cxx::__verbose_terminate_handler()+0x15d) [0x7fad7d6b3b3d] > > 6: (()+0x5ebb6) [0x7fad7d6b1bb6] > > 7: (()+0x5ec01) [0x7fad7d6b1c01] > > 8: (()+0x5ee19) [0x7fad7d6b1e19] > > 9: (std::__throw_out_of_range(char const*)+0x66) [0x7fad7d707b76] > > 10: (FSMap::get_filesystem(int) const+0x7c) [0x56219126ed6c] > > 11: (MDSMonitor::maybe_promote_standby(std::shared_ptr< > Filesystem>)+0x48a) > > [0x56219125b13a] > > 12: (MDSMonitor::tick()+0x4bb) [0x56219126084b] > > 13: (MDSMonitor::on_active()+0x28) [0x562191255da8] > > 14: (PaxosService::_active()+0x60a) [0x5621911d896a] > > 15: (PaxosService::election_finished()+0x7a) [0x5621911d8d7a] > > 16: (Monitor::win_election(unsigned int, std::set<int, std::less<int>, > > std::allocator<int> >&, unsigned long, MonCommand const*, int, > std::set<int, > > std::less<int>, std::allocator<int> > const*)+0x24e) [0x5621911958ce] > > 17: (Monitor::win_standalone_election()+0x20f) [0x562191195d9f] > > 18: (Monitor::bootstrap()+0x91b) [0x56219119676b] > > 19: (Monitor::init()+0x17d) [0x562191196a5d] > > 20: (main()+0x2694) [0x562191106f44] > > 21: (__libc_start_main()+0xf5) [0x7fad7cdb2b45] > > 22: (()+0x257edf) [0x56219115dedf] > > NOTE: a copy of the executable, or `objdump -rdS <executable>` is > needed to > > interpret this. > > > > Fairly sure its a Cephfs Error due to : > > 9: (std::__throw_out_of_range(char const*)+0x66) [0x7fad7d707b76] > > 10: (FSMap::get_filesystem(int) const+0x7c) [0x56219126ed6c] > > It looks like you're hitting this: > http://tracker.ceph.com/issues/17466 > > There is a branch called wip-17466-jewel that has a fix cherry picked > onto 10.2.3 -- hopefully if you install the mon from that branch then > your mons will be happy again. > > Packages: > http://gitbuilder.ceph.com/ceph-deb-trusty-x86_64-basic/ > ref/wip-17466-jewel/ > http://gitbuilder.ceph.com/ceph-rpm-centos7-x86_64-basic/ > ref/wip-17466-jewel/ > > Or of course you can build your own if you're on a platform that isn't > on gitbuilder.ceph.com > > John > > > I have nothing in the CephFS but I had just finished moving all my VMs > into > > rados. I don't care if CephFS gets wiped but I really need the vm images. > > > > If the mon is borked permanently then is there a way I can recover the > > images manually? > > > > Thanks in advance for any help > > > > James > > > > _______________________________________________ > > ceph-users mailing list > > ceph-users@lists.ceph.com > > http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com > > >
_______________________________________________ ceph-users mailing list ceph-users@lists.ceph.com http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com