Was that 0.61.4 -> 0.61.5? Our upgrade of all mons and osds on SL6.4 went without incident. -- dan
-- Dan van der Ster CERN IT-DSS On Friday, July 19, 2013 at 9:00 AM, Stefan Priebe - Profihost AG wrote: > crash is this one: > > 2013-07-19 08:59:32.137646 7f484a872780 0 ceph version > 0.61.5-17-g83f8b88 (83f8b88e5be41371cb77b39c0966e79cad92087b), process > ceph-mon, pid 22172 > 2013-07-19 08:59:32.173975 7f484a872780 -1 mon/OSDMonitor.cc > (http://OSDMonitor.cc): In > function 'virtual void OSDMonitor::update_from_paxos(bool*)' thread > 7f484a872780 time 2013-07-19 08:59:32.173506 > mon/OSDMonitor.cc (http://OSDMonitor.cc): 132: FAILED > assert(latest_bl.length() != 0) > > ceph version 0.61.5-17-g83f8b88 (83f8b88e5be41371cb77b39c0966e79cad92087b) > 1: (OSDMonitor::update_from_paxos(bool*)+0x16e1) [0x51d341] > 2: (PaxosService::refresh(bool*)+0xe6) [0x4f2c66] > 3: (Monitor::refresh_from_paxos(bool*)+0x57) [0x48f7b7] > 4: (Monitor::init_paxos()+0xe5) [0x48f955] > 5: (Monitor::preinit()+0x679) [0x4bba79] > 6: (main()+0x36b0) [0x484bb0] > 7: (__libc_start_main()+0xfd) [0x7f48489cec8d] > 8: /usr/bin/ceph-mon() [0x4801e9] > NOTE: a copy of the executable, or `objdump -rdS <executable>` is > needed to interpret this. > > --- begin dump of recent events --- > -13> 2013-07-19 08:59:32.136172 7f484a872780 5 asok(0x131a000) > register_command perfcounters_dump hook 0x1304010 > -12> 2013-07-19 08:59:32.136191 7f484a872780 5 asok(0x131a000) > register_command 1 hook 0x1304010 > -11> 2013-07-19 08:59:32.136194 7f484a872780 5 asok(0x131a000) > register_command perf dump hook 0x1304010 > -10> 2013-07-19 08:59:32.136200 7f484a872780 5 asok(0x131a000) > register_command perfcounters_schema hook 0x1304010 > -9> 2013-07-19 08:59:32.136204 7f484a872780 5 asok(0x131a000) > register_command 2 hook 0x1304010 > -8> 2013-07-19 08:59:32.136206 7f484a872780 5 asok(0x131a000) > register_command perf schema hook 0x1304010 > -7> 2013-07-19 08:59:32.136208 7f484a872780 5 asok(0x131a000) > register_command config show hook 0x1304010 > -6> 2013-07-19 08:59:32.136211 7f484a872780 5 asok(0x131a000) > register_command config set hook 0x1304010 > -5> 2013-07-19 08:59:32.136214 7f484a872780 5 asok(0x131a000) > register_command log flush hook 0x1304010 > -4> 2013-07-19 08:59:32.136216 7f484a872780 5 asok(0x131a000) > register_command log dump hook 0x1304010 > -3> 2013-07-19 08:59:32.136219 7f484a872780 5 asok(0x131a000) > register_command log reopen hook 0x1304010 > -2> 2013-07-19 08:59:32.137646 7f484a872780 0 ceph version > 0.61.5-17-g83f8b88 (83f8b88e5be41371cb77b39c0966e79cad92087b), process > ceph-mon, pid 22172 > -1> 2013-07-19 08:59:32.137967 7f484a872780 1 finished > global_init_daemonize > 0> 2013-07-19 08:59:32.173975 7f484a872780 -1 mon/OSDMonitor.cc > (http://OSDMonitor.cc): In > function 'virtual void OSDMonitor::update_from_paxos(bool*)' thread > 7f484a872780 time 2013-07-19 08:59:32.173506 > mon/OSDMonitor.cc (http://OSDMonitor.cc): 132: FAILED > assert(latest_bl.length() != 0) > > ceph version 0.61.5-17-g83f8b88 (83f8b88e5be41371cb77b39c0966e79cad92087b) > 1: (OSDMonitor::update_from_paxos(bool*)+0x16e1) [0x51d341] > 2: (PaxosService::refresh(bool*)+0xe6) [0x4f2c66] > 3: (Monitor::refresh_from_paxos(bool*)+0x57) [0x48f7b7] > 4: (Monitor::init_paxos()+0xe5) [0x48f955] > 5: (Monitor::preinit()+0x679) [0x4bba79] > 6: (main()+0x36b0) [0x484bb0] > 7: (__libc_start_main()+0xfd) [0x7f48489cec8d] > 8: /usr/bin/ceph-mon() [0x4801e9] > NOTE: a copy of the executable, or `objdump -rdS <executable>` is > needed to interpret this. > > Mi > > Am 19.07.2013 08:58, schrieb Stefan Priebe - Profihost AG: > > All mons do not work anymore: > > > > === mon.a === > > Starting Ceph mon.a on ccad... > > [21207]: (33) Numerical argument out of domain > > failed: 'ulimit -n 8192; /usr/bin/ceph-mon -i a --pid-file > > /var/run/ceph/mon.a.pid -c /etc/ceph/ceph.conf ' > > > > Stefan > > > > Am 19.07.2013 07:59, schrieb Sage Weil: > > > A note on upgrading: > > > > > > One of the fixes in 0.61.5 is with a 32bit vs 64bit bug with the feature > > > bits. We did not realize it before, but the fix will prevent 0.61.4 (or > > > earlier) from forming a quorum with 0.61.5. This is similar to the > > > upgrade > > > from bobtail (and the future upgrade to dumpling). As such, we recommend > > > you upgrade all monitors at once to avoid the potential for discruption > > > in > > > service. > > > > > > I'm adding a note to the release notes. > > > > > > Thanks! > > > sage > > > > > > > > > On Thu, 18 Jul 2013, Sage Weil wrote: > > > > > > > We've prepared another update for the Cuttlefish v0.61.x series. This > > > > release primarily contains monitor stability improvements, although > > > > there > > > > are also some important fixes for ceph-osd for large clusters and a few > > > > important CephFS fixes. We recommend that all v0.61.x users upgrade. > > > > > > > > * mon: misc sync improvements (faster, more reliable, better tuning) > > > > * mon: enable leveldb cache by default (big performance improvement) > > > > * mon: new scrub feature (primarily for diagnostic, testing purposes) > > > > * mon: fix occasional leveldb assertion on startup > > > > * mon: prevent reads until initial state is committed > > > > * mon: improved logic for trimming old osdmaps > > > > * mon: fix pick_addresses bug when expanding mon cluster > > > > * mon: several small paxos fixes, improvements > > > > * mon: fix bug osdmap trim behavior > > > > * osd: fix several bugs with PG stat reporting > > > > * osd: limit number of maps shared with peers (which could cause domino > > > > failures) > > > > * rgw: fix radosgw-admin buckets list (for all buckets) > > > > * mds: fix occasional client failure to reconnect > > > > * mds: fix bad list traversal after unlink > > > > * mds: fix underwater dentry cleanup (occasional crash after mds > > > > restart) > > > > * libcephfs, ceph-fuse: fix occasional hangs on umount > > > > * libcephfs, ceph-fuse: fix old bug with O_LAZY vs O_NOATIME confusion > > > > * ceph-disk: more robust journal device detection on RHEL/CentOS > > > > * ceph-disk: better, simpler locking > > > > * ceph-disk: do not inadvertantely mount over existing osd mounts > > > > * ceph-disk: better handling for unusual device names > > > > * sysvinit, upstart: handle symlinks in /var/lib/ceph/* > > > > > > > > Please also refer to the complete release notes: > > > > > > > > http://ceph.com/docs/master/release-notes/#v0-61-5-cuttlefish > > > > > > > > You can get v0.61.5 from the usual locations: > > > > > > > > * Git at git://github.com/ceph/ceph.git > > > > (http://github.com/ceph/ceph.git) > > > > * Tarball at http://ceph.com/download/ceph-0.61.5.tar.gz > > > > * For Debian/Ubuntu packages, see > > > > http://ceph.com/docs/master/install/debian > > > > * For RPMs, see http://ceph.com/docs/master/install/rpm > > > > > > > > > > _______________________________________________ > > > ceph-users mailing list > > > ceph-users@lists.ceph.com (mailto:ceph-users@lists.ceph.com) > > > http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com > > > > > > > > > _______________________________________________ > ceph-users mailing list > ceph-users@lists.ceph.com (mailto:ceph-users@lists.ceph.com) > http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com > >
_______________________________________________ ceph-users mailing list ceph-users@lists.ceph.com http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com