On 06/01/2012 05:06 PM, Edmund Sumbar wrote:
Thanks for the tips Gus. I'll definitely try some of these, particularly
the nodes:ppn syntax, and report back.


You can check for torque support with

mpicc --showme

It should show among other things -ltorque [if it
has torque support] and -lrdmacm -libverbs [if it
has OpenIB/Infinband support].

If Torque is not installed in a standard location
[such as /usr or /usr/local],
which is often the case, you may need
to point configure to the Torque library with:

--with-tm=/path/to/torque

Likewise for Infinband/OpenIB if you have it:

--with-openib=/path/to/openib

[I am citing these options from memory.
Do a './configure -help' to check the right syntax, please.]

Making a log file of your configure run may be helpful, to
diagnose problems.

Finally, if I remember right, there was some problem
reported in the list regarding Intel compilers 12.1.
[I built 1.4.5 with Intel 11 and it works fine.]
However, that problem may have been superseded in
the latest OpenMPI 1.6.0.
[The release notes will tell, or perhaps Jeff.]

I hope this helps,
Gus Correa

Right now, I'm upgrading the Intel Compilers
and rebuilding Open MPI.


On Fri, Jun 1, 2012 at 2:39 PM, Gus Correa <g...@ldeo.columbia.edu
<mailto:g...@ldeo.columbia.edu>> wrote:

    The [Torque/PBS] syntax '-l procs=48' is somewhat troublesome,
    and may not be understood by the scheduler [It doesn't
    work correctly with Maui, which is what we have here.  I read
    people saying it works with pbs_sched and with Moab,
    but that's hearsay.]
    This issue comes back very often in the Torque mailing
    list.

    Have you tried instead this alternate syntax?

    '-l nodes=2:ppn=24'

    [I am assuming here that your
    nodes have 24 cores, i.e. 24 'ppn', each]

    Then in the script:
    mpiexec -np 48 ./your_program


    Also, in your PBS script you could print
    the contents of PBS_NODEFILE.

    cat $PBS_NODEFILE


    A simple troubleshooting test is to launch 'hostname'
    with mpirun

    mpirun -np 48 hostname

    Finally, are you sure that the OpenMPI you are using was
    compiled with Torque support?
    If not, I wonder if clauses like '-bynode' would work at all.
    Jeff may correct me if I am wrong, but if your
    OpenMPI lacks Torque support,
    you may need to pass to mpirun
    the $PBS_NODEFILE as your hostfile.




--
Edmund Sumbar
University of Alberta
+1 780 492 9360



_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users

Reply via email to