[slurm-dev] Re: cgroup freezer throwing "Device or resource busy" upon job cancel or kill - 14.03.6

2014-08-13 Thread David Bigagli
Interesting indeed. Let me have a look at it and experiment with it a bit. On 08/13/2014 04:16 PM, Kilian Cavalotti wrote: On Wed, Aug 13, 2014 at 10:00 AM, David Bigagli wrote: For some reason at the first attempt rmdir(2) returns EBUSY. Would writing to memory.force_empty before calling

[slurm-dev] Re: cgroup freezer throwing "Device or resource busy" upon job cancel or kill - 14.03.6

2014-08-13 Thread Kilian Cavalotti
On Wed, Aug 13, 2014 at 10:00 AM, David Bigagli wrote: > > For some reason at the first attempt rmdir(2) returns EBUSY. Would writing to memory.force_empty before calling rmdir() help? See http://lxr.free-electrons.com/source/Documentation/cgroups/memory.txt?v=2.6.32#L269 Cheers, -- Kilian

[slurm-dev] Account / partition association on heterogeneous clusters

2014-08-13 Thread Jesse Stroik
Our cluster has two primary groups of users. The users groups each have a different account from which we designate shares and for which we provide accounting information. We are in the process of adding nodes for which CPU time has a very different practical value to the end users. If users

[slurm-dev] Re: cgroup freezer throwing "Device or resource busy" upon job cancel or kill - 14.03.6

2014-08-13 Thread David Bigagli
For some reason at the first attempt rmdir(2) returns EBUSY. On 08/12/2014 11:05 PM, Kilian Cavalotti wrote: On Tue, Aug 12, 2014 at 6:56 PM, Trey Dockendorf wrote: This is slurm-14.03.6 running CentOS 6.5 kernel 2.6.32-431.23.3.el6.x86_64 Exact same behavior here, same Slurm version and

[slurm-dev] Re: cgroup freezer throwing "Device or resource busy" upon job cancel or kill - 14.03.6

2014-08-13 Thread Trey Dockendorf
Kilian, Thanks for confirming that others are seeing this. - Trey = Trey Dockendorf Systems Analyst I Texas A&M University Academy for Advanced Telecommunications and Learning Technologies Phone: (979)458-2396 Email: treyd...@tamu.edu Jabber: treyd...@tamu.edu

[slurm-dev] RE: slurm-dev Slurm configuration questions, was Re:

2014-08-13 Thread Williams, Kevin E. (Federal SIP)
Thanks for that. I was seeing a lot of newer messages without the [slurm-dev] header. Very annoying, but as a neophyte, I was mute on the subject… ;-) From: Riebs, Andy Sent: Wednesday, August 13, 2014 10:15 AM To: slurm-dev Subject: [slurm-dev] slurm-dev Slurm configuration questions, was Re

[slurm-dev] Re:

2014-08-13 Thread Andy Riebs
Hi Erica, You'll find much of this discussion takes place frequently, most recently about a week ago. To get started, [*]It looks like Slurm can't find a mail program. Use $ scontrol show config | grep MailProg to see what program Slurm is looking for. [*]You probably

[slurm-dev] slurm-dev Slurm configuration questions, was Re:

2014-08-13 Thread Andy Riebs
Oops; the other essential guideline for getting help is to include a meaningful subject line! On 08/13/2014 10:12 AM, Andy Riebs wrote: Hi Erica, You'll find much of this discussion takes place frequently, most recently about a week ago. To get started, [*]It

[slurm-dev]

2014-08-13 Thread Erica Riello
Hi all, I've installed slurm, and I when I try to start slurmctld, I get these errors: > slurmctld -D - slurmctld: pidfile not locked, assuming no running daemon slurmctld: error: Configured MailProg is invalid slurmctld: error: Job accounting information gathered, but not stored slurmctld: f

[slurm-dev] Allowing users to suspend/resume jobs

2014-08-13 Thread dhvanika.shah
Hi SLURM Users, I need a favor. My customer needs their all users to do suspend/resume as and when required. Can anyone please help how do I configure this in SLURM? Regards Dhvani The information contained in this electronic message and any attachments to this message are intended for th