Dear Ricardo,

perhaps the syntax of the --mgsnode specification?

Which Lustre version are you running? There might have been changes in the way 
mgsnodes are specified.

And the four NIDs you mentioned, are these all failover partners? Or DNS nodes?

Example from our site:
We have three MDS, each a pair of active server and failover partner.
The format command for the first (MGS+MDT0) read (under Lustre 2.10.6):
> ... --servicenode=10.20.3.0@o2ib5 --servicenode=10.20.3.1@o2ib5 
--mgsnode=10.20.3.0@o2ib5 --mgsnode=10.20.3.1@o2ib5 ...
No comma, no colon.
The format command for the second (MDT1) read:
> ...  --servicenode=10.20.2.236@o2ib5 --servicenode=10.20.2.237@o2ib5 
--mgsnode=10.20.3.0@o2ib5 --mgsnode=10.20.3.1@o2ib5 ...
Obviously the servicenodes are the IPs of MDT1 and its failover partner, the 
mgsnodes are again the IPs of MGS and its partner.


Regards,
Thomas

On 11/30/21 19:05, Ricardo Brugman wrote:
Hi all,

I’ve seen many questions/issues came by and I decided to post the issue that I 
encountered.

Recently I tried updating the mgsnode IP address on a lustre node and although 
the command executed successfully, the old IP value remained.

Old value: 10.10.10.2 (points to a server that is not a mgsnode)
New value: 
10.10.10.201@o2ib,10.10.10.202@o2ib,10.10.10.203@o2ib,10.10.10.204@o2ib

Please find the command and output below:

[root@xxx ~]# tunefs.lustre --erase-param mgsnode --writeconf 
--mgsnode=10.10.10.201@o2ib,10.10.10.202@o2ib,10.10.10.203@o2ib,10.10.10.204@o2ib
 zfs_R10_nvme0-4/dne_mdt1
checking for existing Lustre data: found

    Read previous values:
Target:     neohpfs-MDT0001
Index:      1
Lustre FS:  neohpfs
Mount type: zfs
Flags:      0x1
               (MDT )
Persistent mount opts:
Parameters: mgsnode=10.10.10.2@o2ib

    Permanent disk data:
Target:     neohpfs=MDT0001
Index:      1
Lustre FS:  neohpfs
Mount type: zfs
Flags:      0x141
               (MDT update writeconf )
Persistent mount opts:
Parameters:  
mgsnode=:10.10.10.201@o2ib,10.10.10.202@o2ib,10.10.10.203@o2ib,10.10.10.204@o2ib
[root@xxx ~]#

I did restart the lustre service thinking this would perhaps load the new 
value/config and although the service came up successfully, it still had not 
loaded the new value.

Appreciate any help, suggestions you can provide as to why the new value was 
not saved/loaded. In case I made a mistake, or I followed the incorrect 
step(s)/process than please, feel free to point that out.

Best Regards,
Ricardo


_______________________________________________
lustre-discuss mailing list
[email protected]
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org

_______________________________________________
lustre-discuss mailing list
[email protected]
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org

Reply via email to