[ceph-users] Re: Reef: highly-available NFS with keepalive_only

Eugen Block Wed, 26 Mar 2025 07:17:35 -0700

I tried something else, but the result is not really satifying. Iedited the keepalive.conf files which had no peers at all or only onepeer, so they were all identical. Restarting the daemons helped havingonly one virtual ip assigned, so now the daemons did communicate and Isee messages like these:

Master received advert from 192.168.168.112 with same priority 80 buthigher IP address than ours

Entering BACKUP STATE

So that's good. But powering off the machine with the active nfsdaemon doesn't provide the expected result. Although keepalive assignsthe virtual ip to a different host, the failed nfs daemon lands on thethird node, so mounting is not possible.

To prevent that from happening, I reduced the number of hosts for nfsand ingress to two. And that seems to work as expected (aftermodifying the keepalive.conf again). But all in all, thekeepalive_only option seems a bit too much manual work at this point.

And just a side note: I don't see that a client is connected althoughI am writing data into the nfs export. Both the dashboard and CLI showno client:


ceph nfs export info ebl-nfs-cephfs /nfsovercephfs
{
  "access_type": "RW",
  "clients": [],
  "cluster_id": "ebl-nfs-cephfs",
...

I only see the active nfs daemon as a CephFS client.


Zitat von Eugen Block <ebl...@nde.ag>:

Thanks, I removed the ingress service and redeployed it again, withthe same result. The interesting part here is, the configs areidentical compared to the previous deployment, so the same peers (orno peers) as before.
Zitat von Robert Sander <r.san...@heinlein-support.de>:
Am 3/25/25 um 18:55 schrieb Eugen Block:
Okay, so I don't see anything in the keepalive log aboutcommunicating between each other. The config files are almostidentical, no difference in priority, but in unicast_peer. ceph03has no entry at all for unicast_peer, ceph02 has only ceph03 inthere while ceph01 has both of the others in its unicast_peerentry. That's weird, isn't it?
They should each have the other two as unicast_peers.
There must have been a glitch in the service generation. Maybe youshould try to remove it and deploy it as new?
Regards
--
Robert Sander
Linux Consultant

Heinlein Consulting GmbH
Schwedter Str. 8/9b, 10119 Berlin

https://www.heinlein-support.de

Tel: +49 30 405051 - 0
Fax: +49 30 405051 - 19

Amtsgericht Berlin-Charlottenburg - HRB 220009 B
Geschäftsführer: Peer Heinlein - Sitz: Berlin
_______________________________________________
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io



_______________________________________________
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io

[ceph-users] Re: Reef: highly-available NFS with keepalive_only

Reply via email to