Re: [OMPI users] mpirun hanging followup

2007-07-17 Thread Michael Edwards
On 7/17/07, Bill Johnstone wrote: Thanks for the help. I've replied below. --- "G.O." wrote: > 1- Check to make sure that there are no firewalls blocking > traffic between the nodes. There is no firewall in-between the nodes. If I run jobs directly via ssh, e.g. "ssh node4 env" they wo

Re: [OMPI users] mpirun hanging followup

2007-07-17 Thread Michael Edwards
If you are having difficulty getting openmpi set up yourself, you might look into OSCAR or Rocks, they make setting up your cluster much easier and include various mpi packages as well as other utilities for reducing your management overhead. I can help you (off list) get set up with OSCAR if you

Re: [OMPI users] openmpi fails on mx endpoint busy

2007-07-05 Thread Michael Edwards
If the machine is multi-processor you might want to add the sm btl. That cleared up some similar problems for me, though I don't use mx so your millage may vary. On 7/5/07, SLIM H.A. wrote: Hello I have compiled openmpi-1.2.3 with the --with-mx= configuration and gcc compiler. On testing wi

Re: [OMPI users] Issues with DL POLY

2007-06-07 Thread Michael Edwards
If your problem size is not large enough than any MPI program will perform worse on a "large number" of nodes because of the overhead involved in setting up the problem and network latency. Sometimes that "large number" is as small as two :) I am not at all familiar with DL POLY, but if you make

Re: [OMPI users] Segfault in orted (home directory problem)

2007-06-07 Thread Michael Edwards
That is the default behavior because having common home areas is fairly common, but with some work you can run your code from wherever is convenient. Using the -wd flag you can have the code run from wherever you want, but the code and data has to get there somehow. If you are using a batch sche

[OMPI users] SGE and OFED1.1

2007-06-06 Thread Michael Edwards
I am runing open-mpi 1.1.1-1 compiled from OFED1.1 which I downloaded from their website. I am using SGE installed via OSCAR 5.0 and when running under SGE I get the "mca_mpool_openib_register: ibv_reg_mr(0x59,528384) failed with error: Cannot allocate memory" error discussed at length in you