Re: [slurm-users] [External] Re: Down nodes

2021-07-30 Thread Soichi Hayashi
on the node > 2) something on the network is stopping the communication between the node > and the master (firewall, selinux, congestion, bad nic, routes, etc) > > Brian Andrus > On 7/30/2021 3:51 PM, Soichi Hayashi wrote: > > Brian, > > Thank you for your reply and th

Re: [slurm-users] Down nodes

2021-07-30 Thread Soichi Hayashi
utes until slurm's ping agent kicks in and marking them down again. Thanks!! Soichi On Fri, Jul 30, 2021 at 2:21 PM Soichi Hayashi wrote: > Hello. I need a help with troubleshooting our slurm cluster. > > I am running slurm-wlm 17.11.2 on Ubuntu 20 on a public cloud > infrastruc

[slurm-users] (no subject)

2021-07-30 Thread Soichi Hayashi
RealMemory=60388 PartitionName=cloud LLN=YES Nodes=slurm9-compute[1-15] Default=YES MaxTime=48:00:00 State=UP Shared=YES I appreciate your assistance! Soichi Hayashi Indiana University