We've created a few Lustre FS endpoints in AWS. They were mounted on a system. The Lustre endpoints got terminated soon after that, and others were created instead.

Now the old Lustre filesystems appear to be mounted on that node, and there's automation trying to unmount them, resulting in a very large number of umount processes just hanging. In dmesg I see this message repeated many, many times:

Lustre: 919:0:(client.c:2116:ptlrpc_expire_one_request()) @@@ Request sent has failed due to network error:

What is the recommended procedure to unmount those FSs? Just running umount manually also hangs indefinitely. I would prefer to not reboot that node.

--
Florin Andrei
https://florin.myip.org/
_______________________________________________
lustre-discuss mailing list
[email protected]
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org

Reply via email to