Hi,
Here is the bizarre behavior of the system and hope that someone can
clarify is this related to OMPI or not.

When I issue the mpirun command with -np 2, I can see the output of the
program online as it is running (I am std out). However, if I issue the
command with -np 4, the progress is not shown!!


Please see the output below. I ran 'date' command first and the issued the
command with '-np 4'. After some seconds, I pressed ^C and ran 'date'
again. As you can see, there is no output information. Next, I ran with
'-np 2' and after a while I pressed ^C. You can see that the progress of
the program is shown.




mahmood@cluster:A4$ date
Sat Oct  1 11:26:13 2016
mahmood@cluster:A4$ /share/apps/computer/openmpi-2.0.0/bin/mpirun
--hostfile hosts.txt -np 4 /share/apps/chemistry/siesta-4.0/spar/siesta <
A.fdf
Siesta Version: siesta-4.0--500
Architecture  : x86_64-unknown-linux-gnu--unknown
Compiler flags: /share/apps/computer/openmpi-2.0.0/bin/mpifort
PP flags      : -DMPI -DFC_HAVE_FLUSH -DFC_HAVE_ABORT
PARALLEL version

* Running on    4 nodes in parallel
>> Start of run:   1-OCT-2016  11:26:23

                           ***********************
                           *  WELCOME TO SIESTA  *
                           ***********************

reinit: Reading from standard input
************************** Dump of input data file
****************************
^CKilled by signal 2.
mahmood@cluster:A4$ date
Sat Oct  1 11:26:30 2016
mahmood@cluster:A4$ /share/apps/computer/openmpi-2.0.0/bin/mpirun
--hostfile hosts.txt -np 2 /share/apps/chemistry/siesta-4.0/spar/siesta <
A.fdf
Siesta Version: siesta-4.0--500
Architecture  : x86_64-unknown-linux-gnu--unknown
Compiler flags: /share/apps/computer/openmpi-2.0.0/bin/mpifort
PP flags      : -DMPI -DFC_HAVE_FLUSH -DFC_HAVE_ABORT
PARALLEL version

* Running on    2 nodes in parallel
>> Start of run:   1-OCT-2016  11:26:36

                           ***********************
                           *  WELCOME TO SIESTA  *
                           ***********************

reinit: Reading from standard input
************************** Dump of input data file
****************************
SystemLabel      A
NumberOfAtoms    54
NumberOfSpecies  2
%block ChemicalSpeciesLabel
...
...
...
^CKilled by signal 2.
mahmood@cluster:A4$ date
Sat Oct  1 11:26:38 2016





Any idea about that? The problem occurs when I change the MPI's switches.

Regards,
Mahmood
_______________________________________________
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users

Reply via email to