Dear open-mpi user,
I am running a CPMD calculation in parallel. I got the following error and
job got killed. Below I have given the error message. What is this error
and how to fix it ?

/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left
on device
[compute-0-6.local:17488] opal_os_dirpath_create: Error: Unable to create
the sub-directory
(/data/20952.1.all.q/openmpi-sessions-sudhirs@compute-0-6.local_0) of
(/data/20952.1.all.q/openmpi-sessions-sudhirs@compute-0-6.local_0/43063/0/0),
mkdir failed [1]
[compute-0-6.local:17488] [[43063,0],0] ORTE_ERROR_LOG: Error in file
util/session_dir.c at line 101
[compute-0-6.local:17488] [[43063,0],0] ORTE_ERROR_LOG: Error in file
util/session_dir.c at line 425
[compute-0-6.local:17488] [[43063,0],0] ORTE_ERROR_LOG: Error in file
ess_hnp_module.c at line 273
--------------------------------------------------------------------------
It looks like orte_init failed for some reason; your parallel process is
likely to abort.  There are many reasons that a parallel process can
fail during orte_init; some of which are due to configuration or
environment problems.  This failure appears to be an internal failure;
here's some additional information (which may only be relevant to an
Open MPI developer):

  orte_session_dir failed
  --> Returned value Error (-1) instead of ORTE_SUCCESS
--------------------------------------------------------------------------
[compute-0-6.local:17488] [[43063,0],0] ORTE_ERROR_LOG: Error in file
runtime/orte_init.c at line 132
--------------------------------------------------------------------------
It looks like orte_init failed for some reason; your parallel process is
likely to abort.  There are many reasons that a parallel process can
fail during orte_init; some of which are due to configuration or
environment problems.  This failure appears to be an internal failure;
here's some additional information (which may only be relevant to an
Open MPI developer):

  orte_ess_set_name failed
  --> Returned value Error (-1) instead of ORTE_SUCCESS
--------------------------------------------------------------------------
[compute-0-6.local:17488] [[43063,0],0] ORTE_ERROR_LOG: Error in file
orterun.c at line 473
rm: cannot remove `/data/20952.1.all.q/rsh': No such file or directory

Thanks
-- 
Sudhir Kumar Sahoo
Ph.D Scholar
Dept. Of Chemistry
IIT Kanpur-208016

Reply via email to