Re: [lustre-discuss] Connectivity issues after client crash

2024-12-03 Thread Jesse Stroik
Thanks for your reply. I did consider disabling peer discovery on the servers for one of the file systems as a test, but did not yet test it. i'll keep that idea in my back pocket in case the issue recurs with our workaround or we have the issue happen on any of our other 2.15 file systems. Di

Re: [lustre-discuss] Connectivity issues after client crash

2024-12-03 Thread Nehring, Shane R [ITS]
Hello Jesse, What I think we may have been hitting in this particular case was https://jira.whamcloud.com/browse/LU-16349 which we in the short term worked around by switching to ksocklnd from o2iblnd, and then ultimately mostly moved away from omnipath to ib. I've seen some weird behavior in our

Re: [lustre-discuss] Connectivity issues after client crash

2024-12-03 Thread Jesse Stroik
Hi Shane, I realize this is quite an old post but I think it is worth responding for posterity and because I suspect others who upgrade may run into this issue. I'm observing some similar issues to what you describe. They started this weekend for us on two of our servers which were upgraded to