Dear open-mpi user, I am running a CPMD calculation in parallel. I got the following error and job got killed. Below I have given the error message. What is this error and how to fix it ?
/opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device /opt/gridengine/mpi/startmpi.sh: line 60: echo: write error: No space left on device [compute-0-6.local:17488] opal_os_dirpath_create: Error: Unable to create the sub-directory (/data/20952.1.all.q/openmpi-sessions-sudhirs@compute-0-6.local_0) of (/data/20952.1.all.q/openmpi-sessions-sudhirs@compute-0-6.local_0/43063/0/0), mkdir failed [1] [compute-0-6.local:17488] [[43063,0],0] ORTE_ERROR_LOG: Error in file util/session_dir.c at line 101 [compute-0-6.local:17488] [[43063,0],0] ORTE_ERROR_LOG: Error in file util/session_dir.c at line 425 [compute-0-6.local:17488] [[43063,0],0] ORTE_ERROR_LOG: Error in file ess_hnp_module.c at line 273 -------------------------------------------------------------------------- It looks like orte_init failed for some reason; your parallel process is likely to abort. There are many reasons that a parallel process can fail during orte_init; some of which are due to configuration or environment problems. This failure appears to be an internal failure; here's some additional information (which may only be relevant to an Open MPI developer): orte_session_dir failed --> Returned value Error (-1) instead of ORTE_SUCCESS -------------------------------------------------------------------------- [compute-0-6.local:17488] [[43063,0],0] ORTE_ERROR_LOG: Error in file runtime/orte_init.c at line 132 -------------------------------------------------------------------------- It looks like orte_init failed for some reason; your parallel process is likely to abort. There are many reasons that a parallel process can fail during orte_init; some of which are due to configuration or environment problems. This failure appears to be an internal failure; here's some additional information (which may only be relevant to an Open MPI developer): orte_ess_set_name failed --> Returned value Error (-1) instead of ORTE_SUCCESS -------------------------------------------------------------------------- [compute-0-6.local:17488] [[43063,0],0] ORTE_ERROR_LOG: Error in file orterun.c at line 473 rm: cannot remove `/data/20952.1.all.q/rsh': No such file or directory Thanks -- Sudhir Kumar Sahoo Ph.D Scholar Dept. Of Chemistry IIT Kanpur-208016