On Tue, Jul 27, 2010 at 7:29 PM, Gus Correa <g...@ldeo.columbia.edu> wrote:

> Hi Cristobal
>
> Does it run only on the head node alone?
> (Fuego? Agua? Acatenango?)
> Try to put only the head node on the hostfile and execute with mpiexec.
>
--> i will try only with the head node, and post results back

> This may help sort out what is going on.
> Hopefully it will run on the head node.
>
> Also, do you have Infinband connecting the nodes?
> The error messages refer to the openib btl (i.e. Infiniband),
> and complains of


no we are just using normal network 100MBit/s , since i am just testing yet.

>
> "perhaps a missing symbol, or compiled for a different
> version of Open MPI?".
> It sounds as a mixup of versions/builds.
>

--> i agree, somewhere there must be the remains of the older version

>
> Did you configure/build OpenMPI from source, or did you install
> it with apt-get?
> It may be easier/less confusing to install from source.
> If you did, what configure options did you use?
>

-->i installed from source,
./configure --prefix=/opt/openmpi-1.4.2 --with-sge --without-xgid
--disable--static

>
> Also, as for the OpenMPI runtime environment,
> it is not enough to set it on
> the command line, because it will be effective only on the head node.
> You need to either add them to the PATH and LD_LIBRARY_PATH
> on your .bashrc/.cshrc files (assuming these files and your home directory
> are *also* shared with the nodes via NFS),
> or use the --prefix option of mpiexec to point to the OpenMPI main
> directory.
>

yes, all nodes have their PATH and LD_LIBRARY_PATH set up properly inside
the login scripts ( .bashrc in my case  )

>
> Needless to say, you need to check and ensure that the OpenMPI directory
> (and maybe your home directory, and your work directory) is (are)
> really mounted on the nodes.
>

--> yes, doublechecked that they are

>
> I hope this helps,


--> thanks really!

>
> Gus Correa
>
> Update: i just reinstalled openMPI, with the same parameters, and it seems
> that the problem has gone, i couldnt test entirely but when i get back to
> lab ill confirm.
>

best regards!
Cristobal

Reply via email to