On Thu, Apr 23, 2020 at 1:43 PM Michael Robbert <mrobb...@mines.edu> wrote:

> It looks like you have hyper-threading turned on, but haven’t defined the
> ThreadsPerCore=2. You either need to turn off Hyper-threading in the BIOS
> or changed the definition of ThreadsPerCore in slurm.conf.
>

Nice find. node003 has hyper threading enabled but node001 and node002 do
not:
[root@node001 ~]# dmidecode -t processor | grep -E '(Core Count|Thread
Count)'
        Core Count: 12
        Thread Count: 12
        Core Count: 12
        Thread Count: 12

[root@node003 ~]# dmidecode -t processor | grep -E '(Core Count|Thread
Count)'
        Core Count: 12
        Thread Count: 24
        Core Count: 12
I found a great mini script <https://serverfault.com/a/792264/359447> to
disable hyperthreading without reboot. I did get the following warning but
I don't think it's a big issue:
 WARNING, didn't collect load info for all cpus, balancing is broken

Do I have to restart slurmctl on the head node and/or slurmd on node003?

Side question, are there ways with Slurm to test if hyperthreading improves
performance and job speed?

>

Reply via email to