Hello Everyone,

I'm upgrading from 18.2.4 to 18.2.6, and I have a 4-node cluster with 8
NVMe's per node.  Each NVMe is split into 2 OSDs.  The upgrade went through
the mgr, mon, crash and began upgrading OSDs.

The OSDs it was upgrading were not coming back online.

I tried rebooting, and no luck.

journalctl -xe shows the following:

░░ The unit
docker-02cb79ef9a657cdaa26b781966aa6d2f1d5e54cdc9efa6c5ff1f0e98c3a866e4.scope
has successfully entered the 'dead' state.
Apr 29 06:24:09 prdhcistonode01 dockerd[2967]:
time="2025-04-29T06:24:09.282073583-04:00" level=info msg="ignoring event"
container=76c56ddd668015de0022bfa2527060e64a9513>
Apr 29 06:24:09 prdhcistonode01 containerd[2797]:
time="2025-04-29T06:24:09.282129114-04:00" level=info msg="shim
disconnected" id=76c56ddd668015de0022bfa2527060e64a95137>
Apr 29 06:24:09 prdhcistonode01 containerd[2797]:
time="2025-04-29T06:24:09.282219664-04:00" level=warning msg="cleaning up
after shim disconnected" id=76c56ddd668015de00>
Apr 29 06:24:09 prdhcistonode01 containerd[2797]:
time="2025-04-29T06:24:09.282242484-04:00" level=info msg="cleaning up dead
shim"
Apr 29 06:24:09 prdhcistonode01 bash[23886]: debug
2025-04-29T10:24:09.287+0000 7f6961ae9740  1 mClockScheduler:
set_osd_capacity_params_from_config: osd_bandwidth_cost_p>
Apr 29 06:24:09 prdhcistonode01 bash[23886]: debug
2025-04-29T10:24:09.287+0000 7f6961ae9740  0 osd.3:0.OSDShard using op
scheduler mclock_scheduler, cutoff=196
Apr 29 06:24:09 prdhcistonode01 bash[23886]: debug
2025-04-29T10:24:09.287+0000 7f6961ae9740  1 bdev(0x56046b4c8000
/var/lib/ceph/osd/ceph-3/block) open path /var/lib/cep>
Apr 29 06:24:09 prdhcistonode01 containerd[2797]:
time="2025-04-29T06:24:09.292047607-04:00" level=warning msg="cleanup
warnings time=\"2025-04-29T06:24:09-04:00\" level=>
Apr 29 06:24:09 prdhcistonode01 dockerd[2967]:
time="2025-04-29T06:24:09.292163618-04:00" level=info msg="ignoring event"
container=02cb79ef9a657cdaa26b781966aa6d2f1d5e54>
Apr 29 06:24:09 prdhcistonode01 containerd[2797]:
time="2025-04-29T06:24:09.292216428-04:00" level=info msg="shim
disconnected" id=02cb79ef9a657cdaa26b781966aa6d2f1d5e54c>
Apr 29 06:24:09 prdhcistonode01 containerd[2797]:
time="2025-04-29T06:24:09.292277279-04:00" level=warning msg="cleaning up
after shim disconnected" id=02cb79ef9a657cdaa2>
Apr 29 06:24:09 prdhcistonode01 containerd[2797]:
time="2025-04-29T06:24:09.292291949-04:00" level=info msg="cleaning up dead
shim"
Apr 29 06:24:09 prdhcistonode01 bash[23886]: debug
2025-04-29T10:24:09.287+0000 7f6961ae9740  1 bdev(0x56046b4c8000
/var/lib/ceph/osd/ceph-3/block) open size 640122932428>
Apr 29 06:24:09 prdhcistonode01 bash[23886]: debug
2025-04-29T10:24:09.287+0000 7f6961ae9740 -1
bluestore(/var/lib/ceph/osd/ceph-3) _set_cache_sizes bluestore_cache_meta_>
Apr 29 06:24:09 prdhcistonode01 bash[23886]: debug
2025-04-29T10:24:09.287+0000 7f6961ae9740  1 bdev(0x56046b4c8000
/var/lib/ceph/osd/ceph-3/block) close
Apr 29 06:24:09 prdhcistonode01 containerd[2797]:
time="2025-04-29T06:24:09.303385220-04:00" level=warning msg="cleanup
warnings time=\"2025-04-29T06:24:09-04:00\" level=>
Apr 29 06:24:09 prdhcistonode01 bash[24158]: debug
2025-04-29T10:24:09.307+0000 7f2c10403740  1 mClockScheduler:
set_osd_capacity_params_from_config: osd_bandwidth_cost_p>
Apr 29 06:24:09 prdhcistonode01 bash[24158]: debug
2025-04-29T10:24:09.307+0000 7f2c10403740  0 osd.0:0.OSDShard using op
scheduler mclock_scheduler, cutoff=196
Apr 29 06:24:09 prdhcistonode01 bash[23144]: debug
2025-04-29T10:24:09.307+0000 7f12f08c5740 -1 osd.15 0 OSD:init: unable to
mount object store
Apr 29 06:24:09 prdhcistonode01 bash[23144]: debug
2025-04-29T10:24:09.307+0000 7f12f08c5740 -1  ** ERROR: osd init failed:
(22) Invalid argument
Apr 29 06:24:09 prdhcistonode01 bash[24158]: debug
2025-04-29T10:24:09.307+0000 7f2c10403740  1 bdev(0x55d5e45f0000
/var/lib/ceph/osd/ceph-0/block) open path /var/lib/cep>
Apr 29 06:24:09 prdhcistonode01 bash[24158]: debug
2025-04-29T10:24:09.307+0000 7f2c10403740  1 bdev(0x55d5e45f0000
/var/lib/ceph/osd/ceph-0/block) open size 640122932428>
Apr 29 06:24:09 prdhcistonode01 bash[24158]: debug
2025-04-29T10:24:09.307+0000 7f2c10403740 -1
bluestore(/var/lib/ceph/osd/ceph-0) _set_cache_sizes bluestore_cache_meta_>
Apr 29 06:24:09 prdhcistonode01 bash[24158]: debug
2025-04-29T10:24:09.307+0000 7f2c10403740  1 bdev(0x55d5e45f0000
/var/lib/ceph/osd/ceph-0/block) close
Apr 29 06:24:09 prdhcistonode01 bash[24328]: debug
2025-04-29T10:24:09.363+0000 7f30b83b1740  1 mClockScheduler:
set_osd_capacity_params_from_config: osd_bandwidth_cost_p>
Apr 29 06:24:09 prdhcistonode01 bash[24328]: debug
2025-04-29T10:24:09.363+0000 7f30b83b1740  0 osd.8:0.OSDShard using op
scheduler mclock_scheduler, cutoff=196
Apr 29 06:24:09 prdhcistonode01 bash[24328]: debug
2025-04-29T10:24:09.363+0000 7f30b83b1740  1 bdev(0x555f40688000
/var/lib/ceph/osd/ceph-8/block) open path /var/lib/cep>
Apr 29 06:24:09 prdhcistonode01 bash[24328]: debug
2025-04-29T10:24:09.363+0000 7f30b83b1740  1 bdev(0x555f40688000
/var/lib/ceph/osd/ceph-8/block) open size 640122932428>
Apr 29 06:24:09 prdhcistonode01 bash[24328]: debug
2025-04-29T10:24:09.363+0000 7f30b83b1740 -1
bluestore(/var/lib/ceph/osd/ceph-8) _set_cache_sizes bluestore_cache_meta_>
Apr 29 06:24:09 prdhcistonode01 bash[24328]: debug
2025-04-29T10:24:09.363+0000 7f30b83b1740  1 bdev(0x555f40688000
/var/lib/ceph/osd/ceph-8/block) close
Apr 29 06:24:09 prdhcistonode01 systemd[1]:
ceph-fbc38f5c-a3a6-11ea-805c-3b954db9ce7a@osd.12.service: Main process
exited, code=exited, status=1/FAILURE


Any help you can offer would be greatly appreciated.  This is running in
docker:

Client: Docker Engine - Community
 Version:           24.0.7
 API version:       1.43
 Go version:        go1.20.10
 Git commit:        afdd53b
 Built:             Thu Oct 26 09:08:01 2023
 OS/Arch:           linux/amd64
 Context:           default

Server: Docker Engine - Community
 Engine:
  Version:          24.0.7
  API version:      1.43 (minimum version 1.12)
  Go version:       go1.20.10
  Git commit:       311b9ff
  Built:            Thu Oct 26 09:08:01 2023
  OS/Arch:          linux/amd64
  Experimental:     false
 containerd:
  Version:          1.6.25
  GitCommit:        d8f198a4ed8892c764191ef7b3b06d8a2eeb5c7f
 runc:
  Version:          1.1.10
  GitCommit:        v1.1.10-0-g18a0cb0
 docker-init:
  Version:          0.19.0
  GitCommit:        de40ad0

Thanks,
Marco
_______________________________________________
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io

Reply via email to