[slurm-dev] Power save support (not working?)

2014-08-28 Thread Uwe Sauter
Hi all, (configuration and scripts below text) I have configured SLURM to power down idle nodes but it probably is misconfigured. I aim for a configuration where after a certain period (say 10min) idle nodes are powered down. As you can see from the configuration below I have SLURM call either

[slurm-dev] Re: Power save support (not working?)

2014-08-28 Thread Franco Broi
We use power saving so it definitely works, maybe you should try turning on debugging for the controller daemon with scontrol and checking the log file. On 28 Aug 2014 19:18, Uwe Sauter wrote: Hi all, (configuration and scripts below text) I have configured SLURM to power down idle nodes but

[slurm-dev] make eio_shutdown_time per eio handle

2014-08-28 Thread Hongjia Cao
The patch change the global eio_shutdown_time to a field in eio handle to allow multiple eio handles in one process. This will be convenient for a process to launch multiple job steps. >From ec0fe6dc208fe1d501da59d0c8f3c6bf20673005 Mon Sep 17 00:00:00 2001 From: Hongjia Cao Date: Mon, 18 Aug 201

[slurm-dev] HPC Basic Profile

2014-08-28 Thread dhvanika.shah
Does SLURM support HPC Basic Profile? Regards Dhvanika The information contained in this electronic message and any attachments to this message are intended for the exclusive use of the addressee(s) and may contain proprietary, confidential or privileged information. If you are not the intende

[slurm-dev] Re: Intel MPI Performance inconsistency (and workaround)

2014-08-28 Thread Holmes, Christopher (CMU)
Andy is right. When you restart the slurmd daemon, it inherits the system limits from your login session, which are different from the default system limits when the daemon is started on boot-up. If you modified /etc/security/limits.conf, or made changes in any of the bash startup scripts to im

[slurm-dev] Re: Intel MPI Performance inconsistency (and workaround)

2014-08-28 Thread Jesse Stroik
Yes, I did test this and can confirm it worked. Thanks. Best, Jesse Stroik University of Wisconsin On 8/28/2014 8:54 AM, Holmes, Christopher (CMU) wrote: Andy is right. When you restart the slurmd daemon, it inherits the system limits from your login session, which are different from the defa

[slurm-dev] Feedback on integration tests systemd/slurm and questions

2014-08-28 Thread Rémi Palancher
Hi developers, You should already know that systemd[1] is the fast growing init alternative that will be the new default on all major GNU/linux distributions including RHEL7, Centos, Fedora, Debian, Ubuntu and so on. Among other things, systemd has notably the particularity to put all processes

[slurm-dev] Re: make eio_shutdown_time per eio handle

2014-08-28 Thread David Bigagli
Code committed to 14.03.8. On 08/28/2014 05:24 AM, Hongjia Cao wrote: The patch change the global eio_shutdown_time to a field in eio handle to allow multiple eio handles in one process. This will be convenient for a process to launch multiple job steps. -- Thanks, /David/Bigagli Sl

[slurm-dev] Runnable jobs never start

2014-08-28 Thread Franco Broi
Hi Seen this a few times now, we have jobs queued that should be able to run but they wont start unless I restart the controller daemon. Other jobs submitted more recently seem to working fine. I can see from the slurmctld log file with debug=9 that they are not being tested to see if they are r