I found a fun head scratcher, with openmpi 1.8.2 with torque 5 built with TM support, on hereto core layouts I get the fun thing: mpirun -report-bindings hostname <-------- Works mpirun -report-bindings -np 64 hostname <--------- Wat? -------------------------------------------------------------------------- A request was made to bind to that would result in binding more processes than cpus on a resource:
Bind to: CORE Node: nyx5518 #processes: 2 #cpus: 1 You can override this protection by adding the "overload-allowed" option to your binding directive. -------------------------------------------------------------------------- I ran with --oversubscribed and got the expected host list, which matched $PBS_NODEFILE and was 64 entires long: mpirun -overload-allowed -report-bindings -np 64 --oversubscribe hostname What did I do wrong? I'm stumped why one works one doesn't but the one that doesn't if your force it appears correct. Brock Palen www.umich.edu/~brockp CAEN Advanced Computing XSEDE Campus Champion bro...@umich.edu (734)936-1985
signature.asc
Description: Message signed with OpenPGP using GPGMail