I'm pretty sure that a single, central slurmdbd service is required for multiple, federated clusters. I think that's what ties multiple clusters together into a single "federation".
You mention a problem with squeue, but you don't list the error messages. Are you sure that all nodes have identical slurm.conf, and that daemons have been restarted after changes? You may want to consult my Slurm Wiki at https://wiki.fysik.dtu.dk/niflheim/SLURM for configuration details.
Caveat: I just heard the talk at the SLUG conference, but I have no intention of working with federated clusters myself. So I can't help you. Commercial support from SchedMD is recommended, see https://www.schedmd.com/services.php
/Ole On 10/31/2017 06:36 PM, zhangtao102...@126.com wrote:
Thank you very much, Ole I have read this PDF document, but i'm not sure about the configuration. I guess the two slurmctld should be configured to use the same slurmdbd. Is it right? Or which is the right way? Thanks,regards ------------------------------------------------------------------------ zhangtao102...@126.com *From:* Ole Holm Nielsen <mailto:ole.h.niel...@fysik.dtu.dk> *Date:* 2017-10-31 19:08 *To:* slurm-dev <mailto:slurm-dev@schedmd.com> *Subject:* [slurm-dev] Re: question about federation On 10/31/2017 09:34 AM, zhangtao102...@126.com wrote:> I have noticed that slurm v17.11 will federated cluster, but i > cann't find detailed documentation about it. > Now, i have 2 question about federated cluster: > (1) When configuring federated cluster, should i configure the two > slurmctld communicated with the same slurmdbd (or make each cluster's > slurmctld/slurmdbd worked with the same mysql database)? Federation support was described at the Slurm User Group Meeting last month. PDFs of the presentations are online at http://slurm.schedmd.com/publications.html See the talk: Technical: Federated Cluster Support, Brian Christiansen and Danny Auble, SchedMD. Maybe this will help you? /Ole