Hi, All, I am gathering hardware requirements for head nodes for my next cluster. The new cluster will have ~1500 nodes. We ran 5 million jobs last year. I plan to run the slurmctld on one node and the slurmdbd on another. I also plan to write the StateSaveLocation to an NFS appliance. Does the following configuration look sufficient?
Node1: slurmctld 128GB ram 2TB local disk 12 core high clock rate CPU Node2: slurmdbd slurmctld backup 128GB ram 2TB local disk mirrored 500GB SSD for database 12 core high clock rate CPU Do I need more RAM in either node? Is12 cores enough? Is 500GB large enough for the Slurm database? -- Dan Barker ARC-TS