[ceph-users] Re: CephFS: no MDS does join the filesystem

Eugen Block Thu, 10 Jul 2025 00:18:12 -0700

Hi Robert,

were you able to resolve this issue? I haven't faced that error myselfyet, so I can't really comment. But it would be interesting to know ifand how you got out of it.


Thanks,
Eugen

Zitat von Robert Sander <r.san...@heinlein-support.de>:

Hi,

Am 6/30/25 um 16:50 schrieb Robert Sander:
With marking the MDS as repaired you mean the command "ceph mdsrepaired storage_cluster:0", right?
It looks like we hit this bug: https://tracker.ceph.com/issues/65094

From the MDS log: No subtrees found for root MDS rank!
Jul 01 08:50:56 sn04 ceph-mds[1563881]: mds.0.1753398 handle_mds_mapi am now mds.0.1753398Jul 01 08:50:56 sn04 ceph-mds[1563881]: mds.0.1753398 handle_mds_mapstate change up:reconnect --> up:rejoin
Jul 01 08:50:56 sn04 ceph-mds[1563881]: mds.0.1753398 rejoin_start
Jul 01 08:50:56 sn04 ceph-mds[1563881]: mds.0.1753398 rejoin_joint_start
Jul 01 08:50:56 sn04 ceph-mds[1563881]: mds.0.1753398 rejoin_done
Jul 01 08:50:56 sn04 ceph-mds[1563881]: log_channel(cluster) log[ERR] : No subtrees found for root MDS rank!Jul 01 08:50:56 sn04 ceph-mds[1563881]:mds.beacon.storage_cluster.sn04.cbvzzu set_want_state: up:rejoin ->down:damagedJul 01 08:50:56 sn04 ceph-mds[1563881]: log_client log_queue is 1last_log 1 sent 0 num 1 unsent 1 sending 1Jul 01 08:50:56 sn04 ceph-mds[1563881]: log_client will send2025-07-01T06:50:56.151556+0000 mds.storage_cluster.sn04.cbvzzu(mds.0) 1 : cluster [ERR] No subtrees found for root MDS rank!Jul 01 08:50:56 sn04 ceph-mds[1563881]: monclient: _send_mon_messageto mon.sn03 at v2:192.168.91.53:3300/0Jul 01 08:50:56 sn04 ceph-mds[1563881]:mds.beacon.storage_cluster.sn04.cbvzzu Sending beacon down:damagedseq 11440Jul 01 08:50:56 sn04 ceph-mds[1563881]: monclient: _send_mon_messageto mon.sn03 at v2:192.168.91.53:3300/0Jul 01 08:50:56 sn04 ceph-mds[1563881]:mds.beacon.storage_cluster.sn04.cbvzzu received beacon replyup:rejoin seq 11439 rtt 1.011Jul 01 08:50:56 sn04 ceph-mds[1563881]:mds.beacon.storage_cluster.sn04.cbvzzu received beacon replydown:damaged seq 11440 rtt 0.0880002Jul 01 08:50:56 sn04 ceph-mds[1563881]:mds.storage_cluster.sn04.cbvzzu respawn!Jul 01 08:50:56 sn04ceph-28ca2bfa-d87e-11ed-83a3-1070fddda30f-mds-storage_cluster-sn04-cbvzzu[1563770]: -9999> 2025-07-01T06:50:56.150+0000 7f5daca13640 -1 log_channel(cluster) log [ERR] : No subtrees found for root MDSrank!Jul 01 08:50:56 sn04 ceph-mds[1563881]:mds.storage_cluster.sn04.cbvzzu e: '/usr/bin/ceph-mds'Jul 01 08:50:56 sn04 ceph-mds[1563881]:mds.storage_cluster.sn04.cbvzzu 0: '/usr/bin/ceph-mds'Jul 01 08:50:56 sn04 ceph-mds[1563881]:mds.storage_cluster.sn04.cbvzzu 1: '-n'Jul 01 08:50:56 sn04 ceph-mds[1563881]:mds.storage_cluster.sn04.cbvzzu 2: 'mds.storage_cluster.sn04.cbvzzu'Jul 01 08:50:56 sn04 ceph-mds[1563881]:mds.storage_cluster.sn04.cbvzzu 3: '-f'Jul 01 08:50:56 sn04 ceph-mds[1563881]:mds.storage_cluster.sn04.cbvzzu 4: '--setuser'Jul 01 08:50:56 sn04 ceph-mds[1563881]:mds.storage_cluster.sn04.cbvzzu 5: 'ceph'Jul 01 08:50:56 sn04 ceph-mds[1563881]:mds.storage_cluster.sn04.cbvzzu 6: '--setgroup'Jul 01 08:50:56 sn04 ceph-mds[1563881]:mds.storage_cluster.sn04.cbvzzu 7: 'ceph'Jul 01 08:50:56 sn04 ceph-mds[1563881]:mds.storage_cluster.sn04.cbvzzu 8: '--default-log-to-file=false'Jul 01 08:50:56 sn04 ceph-mds[1563881]:mds.storage_cluster.sn04.cbvzzu 9: '--default-log-to-journald=true'Jul 01 08:50:56 sn04 ceph-mds[1563881]:mds.storage_cluster.sn04.cbvzzu 10: '--default-log-to-stderr=false'Jul 01 08:50:56 sn04 ceph-mds[1563881]:mds.storage_cluster.sn04.cbvzzu respawning with exe /usr/bin/ceph-mdsJul 01 08:50:56 sn04 ceph-mds[1563881]:mds.storage_cluster.sn04.cbvzzu exe_path /proc/self/exeJul 01 08:50:56 sn04ceph-28ca2bfa-d87e-11ed-83a3-1070fddda30f-mds-storage_cluster-sn04-cbvzzu[1563770]: ignoring --setuser ceph since I am notrootJul 01 08:50:56 sn04ceph-28ca2bfa-d87e-11ed-83a3-1070fddda30f-mds-storage_cluster-sn04-cbvzzu[1563770]: ignoring --setgroup ceph since I am notrootJul 01 08:50:56 sn04 ceph-mds[1563881]: ceph version 18.2.4(e7ad5345525c7aa95470c26863873b581076945d) reef (stable), processceph-mds, pid 2
Jul 01 08:50:56 sn04 ceph-mds[1563881]: main not setting numa affinity
Jul 01 08:50:56 sn04 ceph-mds[1563881]: pidfile_write: ignore empty--pid-fileJul 01 08:50:56 sn04ceph-28ca2bfa-d87e-11ed-83a3-1070fddda30f-mds-storage_cluster-sn04-cbvzzu[1563770]: starting mds.storage_cluster.sn04.cbvzzuatJul 01 08:50:56 sn04 ceph-mds[1563881]:mds.storage_cluster.sn04.cbvzzu Updating MDS map to version 1753401from mon.2Jul 01 08:50:57 sn04 ceph-mds[1563881]:mds.storage_cluster.sn04.cbvzzu Updating MDS map to version 1753402from mon.2Jul 01 08:50:57 sn04 ceph-mds[1563881]:mds.storage_cluster.sn04.cbvzzu Monitors have assigned me to becomea standby.
And that's it. The MDS journal integrity seems to be OK:

# /usr/bin/cephfs-journal-tool --rank=storage_cluster:all journal inspect
Overall journal integrity: OK

How do we get this filesystem online again?

Regards
--
Robert Sander
Linux Consultant

Heinlein Consulting GmbH
Schwedter Str. 8/9b, 10119 Berlin

https://www.heinlein-support.de

Tel: +49 30 405051 - 0
Fax: +49 30 405051 - 19

Amtsgericht Berlin-Charlottenburg - HRB 220009 B
Geschäftsführer: Peer Heinlein - Sitz: Berlin
_______________________________________________
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io



_______________________________________________
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io

[ceph-users] Re: CephFS: no MDS does join the filesystem

Reply via email to