Thanks for that Chris! :)

Sounds like other than the new requests for gpu specifics, things should just 
work when upgrading to 19.05 as slurm is likely backwards compatible with the 
previous setup gres stuff.

Best,
Chris

—
Christopher Coffey
High-Performance Computing
Northern Arizona University
928-523-1167
 

On 8/12/19, 10:28 PM, "slurm-users on behalf of Chris Samuel" 
<slurm-users-boun...@lists.schedmd.com on behalf of ch...@csamuel.org> wrote:

    On Monday, 12 August 2019 11:42:48 AM PDT Christopher Benjamin Coffey wrote:
    
    > Excuse me if this has been explained somewhere, I did some searching. With
    > 19.05, is there any reason to have gres.conf on the GPU nodes? Is slurm
    > smart enough to enumerate the /dev/nvidia* devices? We are moving to 19.05
    > shortly, any gotchas with GRES and GPUs? Also, I'm guessing now, there is
    > no reason for users to request "--gres:gpu" type stuff anymore and instead
    > use: --gpus=n ?
    
    We do have 19.05 on our GPU nodes, but I've not had time to experiment with 
    the new request syntax just yet.
    
    Regarding configuration it does appear to be that you still need to set 
them 
    up, but if you link Slurm against the nvidia NVML library at compile time 
then 
    there is support for autodetection.
    
    
https://nam05.safelinks.protection.outlook.com/?url=https%3A%2F%2Fslurm.schedmd.com%2Fgres.html&amp;data=02%7C01%7Cchris.coffey%40nau.edu%7Cfc6ede93f45440fdaf1508d71faf0362%7C27d49e9f89e14aa099a3d35b57b2ba03%7C0%7C0%7C637012708851283210&amp;sdata=lUrvaHgA4jSVgvlcSd9GJBBOZ8dSWSHSNl9ee%2Bv4Xo0%3D&amp;reserved=0
    
    # In the case of GPUs, if AutoDetect=nvml in gres.conf and the NVML library
    # is installed on the node and was present during Slurm configuration, the
    # missing configuration details will be automatically gathered using the
    # NVML library. Configuration information about all other generic resource
    # must explicitly be described in the gres.conf file. 
    
    All the best,
    Chris
    -- 
      Chris Samuel  :  
https://nam05.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.csamuel.org%2F&amp;data=02%7C01%7Cchris.coffey%40nau.edu%7Cfc6ede93f45440fdaf1508d71faf0362%7C27d49e9f89e14aa099a3d35b57b2ba03%7C0%7C0%7C637012708851283210&amp;sdata=hnqqFo7C%2FVg60ZmgPZOcianQTcFlcRS5d%2Fl5O4OQCSw%3D&amp;reserved=0
  :  Berkeley, CA, USA
    
    
    
    
    

Reply via email to