[ceph-users] Re: orchestrator not behaving strangely

Eugen Block Fri, 27 Jun 2025 03:27:36 -0700

Hi,

have you retried it after restarting/failing the mgr?


ceph mgr fail

Quite often this (still) helps.

Zitat von Holger Naundorf <naund...@rz.uni-kiel.de>:

Hello,
we are running a ceph cluster at version:

ceph version 19.2.2 (0eceb0defba60152a8182f7bd87d164b639885b8) squid (stable)
and since a few weeks the orchestrator started to misbehave - up tonow we could not identify any root cause, so I am fishing in thecommunity to see if there are any hints.
Problems:

An OSD removal (for disk replacement) gets stuck in the 'purge' step:

ceph orch osd rm 406 --replace

root@aadm01:~# ceph orch osd rm status
OSD HOST STATE PGS REPLACE FORCE ZAPDRAIN STARTED AT406 acn07 done, waiting for purge 0 True False True2025-06-25 09:18:07.650734+00:00
(now for more than 24h in this state)
At the same time the orchestrator is not restarting OSD daemons -i.e. an 'ceph orch daemon restart osd.xxx' claims its queuing uo therestart, but it never happens. Other services continue to becontrolled correctly via 'ceph orch ...'
If anyone has an idea where to poke around or can match this to someknown problem - I would appreciate any pointers.
Regards,
Holger

--
Dr. Holger Naundorf
Christian-Albrechts-Universität zu Kiel
Rechenzentrum / HPC / Server und Storage
Tel: +49 431 880-1990
Fax:  +49 431 880-1523
naund...@rz.uni-kiel.de



_______________________________________________
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io

[ceph-users] Re: orchestrator not behaving strangely

Reply via email to