Jeff,

An update of what I did. Apparently, one of my lab mates installed another
version of OpenMPI manually and it clashed with the OpenMPI I installed from
the Ubuntu repository. I manually identified the files installed and deleted
them. After I installed OpenMPI from Ubuntu repository, my "mpirun.openmpi"
works!

On Fri, Jul 16, 2010 at 4:41 PM, TH Chew <teong...@gmail.com> wrote:

> Jeff,
>
> Thanks for the suggestion. Been looking into it and although, I installed
> the same OpenMPI version. But somehow, another software (Discovery Studio)
> was installed on birg-desktop-10, causing the mpirun to be messed up (since
> Discovery Studio also install some kind of mpirun/mpiexec). I type
> "mpirun.openmpi --version" on birg-desktop-10, the output is:
>
> ####################################################
> birg@birg-desktop-10:~$ mpirun.openmpi --version
> mpirun.openmpi: symbol lookup error: mpirun.openmpi: undefined symbol:
> orted_cmd_line
> ####################################################
>
> and when I type on other machine
>
> ####################################################
> birg@birg-frontnode:~/Desktop/nfs_shared$ mpirun.openmpi --version
> mpirun.openmpi (OpenRTE) 1.4.1
>
> Report bugs to http://www.open-mpi.org/community/help/
> ####################################################
>
> I am now uninstalling Discovery Studio and see whether it works or not.
>
> Thanks again.
>
>
> On Thu, Jul 15, 2010 at 7:15 PM, Jeff Squyres <jsquy...@cisco.com> wrote:
>
>> This usually means that you have mis-matched versions of Open MPI across
>> your machines.  Double check that you have the same version of Open MPI
>> installed on all the machines that you'll be running (e.g., perhaps
>> birg-desktop-10 has a different version?).
>>
>>
>> On Jul 15, 2010, at 5:18 AM, TH Chew wrote:
>>
>> > Hi all,
>> >
>> > I am setting up a 7+1 nodes cluster for MD simulation, specifically
>> using GROMACS. I am using Ubuntu Lucid 64-bit on all machines. Installed
>> gromacs, gromacs-openmpi, and gromacs-mpich from the repository. MPICH
>> version of gromacs runs fine without any error. However, when I ran OpenMPI
>> version of gromacs by
>> >
>> >
>> ###########################################################################
>> > mpirun.openmpi -np 8 -wdir /home/birg/Desktop/nfs/ -hostfile
>> ~/Desktop/mpi_settings/hostfile mdrun_mpi.openmpi -v
>> >
>> ###########################################################################
>> >
>> > an error occur, something like this
>> >
>> >
>> ###########################################################################
>> > [birg-desktop-10:02101] Error: unknown option "--daemonize"
>> > Usage: orted [OPTION]...
>> > Start an Open RTE Daemon
>> >
>> >    --bootproxy <arg0>    Run as boot proxy for <job-id>
>> > -d|--debug               Debug the OpenRTE
>> > -d|--spin                Have the orted spin until we can connect a
>> debugger
>> >                          to it
>> >    --debug-daemons       Enable debugging of OpenRTE daemons
>> >    --debug-daemons-file  Enable debugging of OpenRTE daemons, storing
>> output
>> >                          in files
>> >    --gprreplica <arg0>   Registry contact information.
>> > -h|--help                This help message
>> >    --mpi-call-yield <arg0>
>> >                          Have MPI (or similar) applications call yield
>> when
>> >                          idle
>> >    --name <arg0>         Set the orte process name
>> >    --no-daemonize        Don't daemonize into the background
>> >    --nodename <arg0>     Node name as specified by host/resource
>> >                          description.
>> >    --ns-nds <arg0>       set sds/nds component to use for daemon
>> (normally
>> >                          not needed)
>> >    --nsreplica <arg0>    Name service contact information.
>> >    --num_procs <arg0>    Set the number of process in this job
>> >    --persistent          Remain alive after the application process
>> >                          completes
>> >    --report-uri <arg0>   Report this process' uri on indicated pipe
>> >    --scope <arg0>        Set restrictions on who can connect to this
>> >                          universe
>> >    --seed                Host replicas for the core universe services
>> >    --set-sid             Direct the orted to separate from the current
>> >                          session
>> >    --tmpdir <arg0>       Set the root for the session directory tree
>> >    --universe <arg0>     Set the universe name as
>> >                          username@hostname:universe_name for this
>> >                          application
>> >    --vpid_start <arg0>   Set the starting vpid for this job
>> >
>> --------------------------------------------------------------------------
>> > A daemon (pid 5598) died unexpectedly with status 251 while attempting
>> > to launch so we are aborting.
>> >
>> > There may be more information reported by the environment (see above).
>> >
>> > This may be because the daemon was unable to find all the needed shared
>> > libraries on the remote node. You may set your LD_LIBRARY_PATH to have
>> the
>> > location of the shared libraries on the remote nodes and this will
>> > automatically be forwarded to the remote nodes.
>> >
>> --------------------------------------------------------------------------
>> >
>> --------------------------------------------------------------------------
>> > mpirun.openmpi noticed that the job aborted, but has no info as to the
>> process
>> > that caused that situation.
>> >
>> --------------------------------------------------------------------------
>> >
>> --------------------------------------------------------------------------
>> > mpirun.openmpi was unable to cleanly terminate the daemons on the nodes
>> shown
>> > below. Additional manual cleanup may be required - please refer to
>> > the "orte-clean" tool for assistance.
>> >
>> --------------------------------------------------------------------------
>> >     birg-desktop-04 - daemon did not report back when launched
>> >     birg-desktop-07 - daemon did not report back when launched
>> >     birg-desktop-10 - daemon did not report back when launched
>> >
>> ###########################################################################
>> >
>> > It is strange that it only happen on one of the compute node
>> (birg-desktop-10). If I remove birg-desktop-10 from the hostfile, I can run
>> OpenMPI gromacs successfully. Any idea?
>> >
>> > Thanks.
>> >
>> > --
>> > Regards,
>> > THChew
>> > _______________________________________________
>> > users mailing list
>> > us...@open-mpi.org
>> > http://www.open-mpi.org/mailman/listinfo.cgi/users
>>
>>
>> --
>> Jeff Squyres
>> jsquy...@cisco.com
>> For corporate legal information go to:
>> http://www.cisco.com/web/about/doing_business/legal/cri/
>>
>>
>> _______________________________________________
>> users mailing list
>> us...@open-mpi.org
>> http://www.open-mpi.org/mailman/listinfo.cgi/users
>>
>
>
>
> --
> Regards,
> THChew
>



-- 
Regards,
THChew

Reply via email to