Folks,
my goal is to run a parallel job on a cluster of KNL nodes configured
with the same cluster *and* memory mode.
at first, i made a simple prototype with 8 nodes, and the four following
features : north, east, west and south.
each node is part of one quadrant, and there are two node
Mahmood,
fwiw, slurm provides torque compatible commands (qsub, qstat, pbsnodes)
that can help your users to transition from torque to slurm :
your users can submit torque scripts on your slurm cluster
qsub script.pbs
until they move to slurm
sbatch script.slurm
Cheers
Gilles
On 4/16/2
Victor,
in your slurm.conf, you should have a line like this one
NodeName=n[1-4] Feature=knl Sockets=1 CoresPerSocket=68 State=UNKNOWN
at first, make sure your regular Xeon nodes do *not* have the 'knl' feature
i guess an other option is not to have the
NodeFeaturesPlugins=knl_generic
lin
Diego,
this is probably due to a change in the PMI interface.
i suggest you rebuilt your MPI library first, and then try again
Cheers,
Gilles
On 7/19/2017 4:20 PM, Diego Zuccato wrote:
Hello all.
I've just upgraded from Debian 8 to Debian 9, and that upgraded slurm
from 14.03 to 16.05.
what if you manually run the hostname command on all your hosts (e.g. do
not use slurm) ?
do you get the expected result ?
On 7/28/2017 10:51 AM, 허웅 wrote:
I also have tried to run a job at specific node like:
srun --nodelist=sgo2 hostname
and here is my slurmctld log file
[2017-07-28T
what if you use this in your slurm.conf instead ?
# COMPUTE NODES
NodeName=GO[1-5]
# PARTITIONS
PartitionName=party Default=yes Nodes=GO[1-5]
On 7/28/2017 9:28 AM, 허웅 wrote:
=?utf-8?B?V2h5IG15IHNsdXJtIGlzIHJ1bm5pbmcgb24gb25seSBvbmUgbm9kZT8=?= I
have 5 nodes include control node.
and my n
y in slurm.conf. See
man slurm.conf look for frontendname.
-Original Message-
*From:* "Gilles Gouaillardet"
*To:* "slurm-dev";
*Cc:*
*Sent:* 2017-07-28 (금) 11:32:26
*Subject:* [slurm-dev] Re: Why my slurm is running on only one node?
what if you use this in your slurm.
fwiw, with Open MPI, ib0 can be selected with
export OMPI_MCA_btl_openib_if_include=ib0
assuming slurm was not configured not to export this environment variable
Gilles
On 10/25/2017 12:55 PM, Paul Hargrove wrote:
Re: [slurm-dev] Re: Selecting a network interface with srun
The "-iface ib0"