Re: [slurm-users] PMIX with heterogeneous jobs

2019-07-16 Thread Mehlberg, Steve
s 0 slurmstepd: error: porthos [1] pmixp_server.c:930 [_process_server_request] mpi/pmix: ERROR: 0x146fdc016bd0: unexpected contrib from porthos:1, coll->seq=0, seq=0 On Tuesday, July 16, 2019, 09:49:59 AM EDT, Mehlberg, Steve mailto:steve.mehlb...@atos.net>> wrote: Has anyone been able t

[slurm-users] PMIX with heterogeneous jobs

2019-07-16 Thread Mehlberg, Steve
Has anyone been able to run an MPI job using PMIX and heterogeneous jobs successfully with 19.05 (or even 18.08)? I can run without heterogeneous jobs but get all sorts of errors when I try and split the job up. I haven't used MPI/PMIX much so maybe I'm missing something? Any ideas? [slurm@tre

Re: [slurm-users] Heterogeneous job one MPI_COMM_WORLD

2018-10-10 Thread Mehlberg, Steve
I got this same error when testing on older updates (17.11?). Try the Slurm-18.08 branch or master. I'm testing 18.08 now and get this: [slurm@trek6 mpihello]$ srun -phyper -n3 --mpi=pmi2 --pack-group=0-2 ./mpihello-ompi2-rhel7 | sort srun: job 643 queued and waiting for resources srun: job 64