Re: [slurm-users] How to show state of CLOUD nodes

2020-02-28 Thread Carter, Allan
down, etc. Thanks for the help. I think it will solve the issues I’m having. From: Kirill 'kkm' Katsnelson [mailto:k...@pobox.com] Sent: Friday, February 28, 2020 5:56 AM To: Slurm User Community List Cc: Carter, Allan Subject: Re: [slurm-users] How to show state of CLOUD nodes I

Re: [slurm-users] How to show state of CLOUD nodes

2020-02-28 Thread Kirill 'kkm' Katsnelson
I'm running clusters entirely in Google Cloud. I'm not sure I'm understanding the issue--do the nodes disappear from view entirely only when they fail to power up by ResumeTimeout? Failures of this kind are happening in GCE when resources are momentarily unavailable, but the nodes are still there,

[slurm-users] How to show state of CLOUD nodes

2020-02-27 Thread Carter, Allan
I'm setting up an EC2 SLURM cluster and when an instance doesn't resume fast enough I get an error like: node c7-c5-24xl-464 not resumed by ResumeTimeout(600) - marking down and power_save I keep running into issues where my cloud nodes do not show up in sinfo and I can't display their informa