OK. The next question is how touse it with torque (PBS)? currently we write this directive
Nodes=1:ppn=2 which means 4 threads. Then we omit -np and -hostfile in the mpirun command. On 31 Jul 2017 20:24, "Elken, Tom" <tom.el...@intel.com> wrote: > Hi Mahmood, > > > > With the -hostfile case, Open MPI is trying to helpfully run things faster > by keeping both processes on one host. Ways to avoid this… > > > > On the mpirun command line add: > > > > -pernode (runs 1 process per node), oe > > -npernode 1 , but these two has been deprecated in favor of the > wonderful syntax: > > --map-by ppr:1:node > > > > Or you could change your hostfile to: > > cluster slots=1 > > compute-0-0 slots=1 > > > > > > -Tom > > > > *From:* users [mailto:users-boun...@lists.open-mpi.org] *On Behalf Of *Mahmood > Naderan > *Sent:* Monday, July 31, 2017 6:47 AM > *To:* Open MPI Users <users@lists.open-mpi.org> > *Subject:* [OMPI users] -host vs -hostfile > > > > Hi, > > I have stuck at a problem which I don't remember that on previous > versions. when I run a test program with -host, it works. I mean, the > process spans to the hosts I specified. However, when I specify -hostfile, > it doesn't work!! > > mahmood@cluster:mpitest$ /share/apps/computer/openmpi-2.0.1/bin/mpirun -host > compute-0-0,cluster -np 2 a.out > > **************************************************************************** > > * hwloc 1.11.2 has encountered what looks like an error from the operating > system. > > * > > * Package (P#1 cpuset 0xffff0000) intersects with NUMANode (P#1 cpuset > 0xff00ffff) without inclusion! > > * Error occurred in topology.c line 1048 > > * > > * The following FAQ entry in the hwloc documentation may help: > > * What should I do when hwloc reports "operating system" warnings? > > * Otherwise please report this error message to the hwloc user's mailing list, > > * along with the output+tarball generated by the hwloc-gather-topology script. > > **************************************************************************** > > Hello world from processor cluster.hpc.org, rank 1 out of 2 processors > > Hello world from processor compute-0-0.local, rank 0 out of 2 processors > > mahmood@cluster:mpitest$ cat hosts > > cluster > > compute-0-0 > > > > mahmood@cluster:mpitest$ /share/apps/computer/openmpi-2.0.1/bin/mpirun > -hostfile hosts -np 2 a.out > > **************************************************************************** > > * hwloc 1.11.2 has encountered what looks like an error from the operating > system. > > * > > * Package (P#1 cpuset 0xffff0000) intersects with NUMANode (P#1 cpuset > 0xff00ffff) without inclusion! > > * Error occurred in topology.c line 1048 > > * > > * The following FAQ entry in the hwloc documentation may help: > > * What should I do when hwloc reports "operating system" warnings? > > * Otherwise please report this error message to the hwloc user's mailing list, > > * along with the output+tarball generated by the hwloc-gather-topology script. > > **************************************************************************** > > Hello world from processor cluster.hpc.org, rank 0 out of 2 processors > > Hello world from processor cluster.hpc.org, rank 1 out of 2 processors > > > how can I resolve that? > > Regards, > Mahmood > > > _______________________________________________ > users mailing list > users@lists.open-mpi.org > https://rfd.newmexicoconsortium.org/mailman/listinfo/users >
_______________________________________________ users mailing list users@lists.open-mpi.org https://rfd.newmexicoconsortium.org/mailman/listinfo/users