down, etc.
Thanks for the help. I think it will solve the issues I’m having.
From: Kirill 'kkm' Katsnelson [mailto:k...@pobox.com]
Sent: Friday, February 28, 2020 5:56 AM
To: Slurm User Community List
Cc: Carter, Allan
Subject: Re: [slurm-users] How to show state of CLOUD nodes
I
I'm running clusters entirely in Google Cloud. I'm not sure I'm
understanding the issue--do the nodes disappear from view entirely only
when they fail to power up by ResumeTimeout? Failures of this kind are
happening in GCE when resources are momentarily unavailable, but the nodes
are still there,
I'm setting up an EC2 SLURM cluster and when an instance doesn't resume fast
enough I get an error like:
node c7-c5-24xl-464 not resumed by ResumeTimeout(600) - marking down and
power_save
I keep running into issues where my cloud nodes do not show up in sinfo and I
can't display their informa