You need to use the scontrol reboot_nodes functionality, or similar, to restart your nodes. I suspect you'll see "Node rebooted unexpectedly" in "sinfo -R" for these nodes when you reboot them. SLURM wasn't aware that this restart was going to happen and so treats it as a problem. As someone also mentioned, ReturnToService is a slurm.conf parameter which also affects this behavior. ________________________________________ From: David Ramírez <[email protected]> Sent: Friday, July 29, 2016 8:13 AM To: slurm-dev Subject: [slurm-dev] Put node "idle" when node restart
Hi. I need to know what parameter must be point at slurm.conf. Because when some nodes are restarted, sinfo show node at "down" state. Slurm don't put idle auto when detect slurmd works?? Thanks -- Este correo y sus archivos asociados son privados y confidenciales y va dirigido exclusivamente a su destinatario. Si recibe este correo sin ser el destinatario del mismo, le rogamos proceda a su eliminación y lo ponga en conocimiento del emisor. La difusión por cualquier medio del contenido de este correo podría ser sancionada conforme a lo previsto en las leyes españolas. No se autoriza la utilización con fines comerciales o para su incorporación a ficheros automatizados de las direcciones del emisor o del destinatario . This mail and its attached files are confidential and are exclusively intended to their addressee. In case you may receive this mail not being its addressee, we beg you to let us know the error by reply and to proceed to delete it. The circulation by any mean of this mail could be penalised in accordance with the Spanish legislation. The use of both the transmitter and the addressee’s address with a commercial aim, or in order to be incorporated to automated files, is not authorised.
