Re: [slurm-users] Slurm Environment Variable for Memory
On Monday, 20 August 2018 9:21:39 PM AEST Juan A. Cordero Varelaq wrote: > I am just running an interactive job with "srun -I --pty /bin/bash" and > then run "echo $SLURM_MEM_PER_NODE", but it shows nothing. Does it have > to be defined in any conf file? My apologies - I've just tested here (with Slurm 17.11.7) and you are indeed correct, they only appear when launched with sbatch and salloc and not when you launch jobs directly with srun! Also you only ever get ${SLURM_MEM_PER_CPU} *or* ${SLURM_MEM_PER_NODE} but not both together, so you'll need to check for both. Hope this helps! All the best, Chris -- Chris Samuel : http://www.csamuel.org/ : Melbourne, VIC
Re: [slurm-users] Job cannot start on slurm v18.08.0pre2
Hello, I can try to tell from PMIx/UCX perspective. Do you have "MPI=pmix" parameter in your slurm.conf or have you specified "--mpi=pmix" in your srun command? If not - you are not running PMIx and thus UCX (UCX support is only in the PMIx plugin). I think this is confirmed by the log output that you have provided, I don't see any traces of PMIx plugin. пт, 17 авг. 2018 г. в 20:43, zhangtao102...@126.com : > Hi, > I have installed SLURM 18.08.0-0pre2 on a my cluster based on RHEL7.4 > (x86_64). > My configure parameters likes this: > ./configure --prefix=/opt/slurm17 --with-munge=/opt/munge > --with-pmix=/opt/pmix --with-ucx=/opt/openucx --with-hwloc=/usr > (openucx version is 1.5.0, pmix version is 3.0.0, hwloc version is 1.11.8) > > After completing the installation and configuration, it looks like slurm > is working normally. But when I submitted a simple test job with sbatch > sleep.sh(just call srun sleep 30 at single computing node), I found that > the job (ID=1032) state was R, but the job did not start normally on the > computation node (no process found). > > The appendix is the output log of the computing node of the management > node. > I can't tell if the cause of this problem is related to the compilation > parameters I specify (such as pmix, ucx), and I've never seen anything > similar in earlier versions. > Has anyone ever responded to a similar phenomenon with me? How to solve > the problem? > > Best regards > > -- > zhangtao102...@126.com > -- С Уважением, Поляков Артем Юрьевич Best regards, Artem Y. Polyakov
[slurm-users] Slurm release candidate version 18.08.0rc1 available for testing
We are pleased to announce the availability of Slurm release candidate version 18.08.0rc1. This is the first release candidate version of the upcoming 18.08 release series, and represents the end of development for the release cycle, and a finalization of the RPC and state file formats. If any issues are identified with this new release candidate, please report them through https://bugs.schedmd.com against the 18.08.x version and we will address them before the first production 18.08.0 release is made. Please note that the release versions are not intended for production use. Barring any late-discovered issues, the state file formats should not change between now and 18.08.0 and are considered frozen at this time for the 18.08 release. Slurm can be downloaded from https://www.schedmd.com/downloads.php . - Tim -- Tim Wickberg Chief Technology Officer, SchedMD LLC Commercial Slurm Development and Support