Hi all! I've successfully managed to configure slurm on one head node and
two different compute nodes, one using "old" consumer RTX cards, a new one
using 4xA100 GPUS (80gb version).
I am now trying to set up a hybrid MIG configuration, where devices 0,1 are
kept as is, while 2 and 3 are split into
Our cluster has some nodes separated to their own partition for running
interactive sessions, which are required to be short and only use a few
nodes.
I've always disliked this approach because I see some of the interactive
nodes being idle while other jobs are waiting on the batch partition.