Hi,

Am 23.08.2014 um 16:09 schrieb Pengcheng Wang:

> I need to run a single driver program that only require one proc with the 
> command mpirun -np 1 ./app or ./app. But it will schedule the launch of other 
> executable files including parallel and sequential computing. So I require 
> more than one proc to run it. It can be run smoothly as an interactive job 
> with the command below.
> 
> qrsh -cwd -pe "ompi*" 6 -l h_rt=00:30:00,test=true ./app
> 
> But after I submitted the job, a strange error occurred and it stopped... 
> Please find the job script and error message below:
> 
> • job submission script:
> #$ -S /bin/bash
> #$ -N couple
> #$ -cwd
> #$ -j y
> #$ -l h_rt=05:00:00
> #$ -l h_vmem=2G

Is a simple hello_world program listing the threads working? Does it work 
without the h_vmem limit?


> #$ -o couple.out
> #$ -pe ompi*  6

Which PEs can be addressed here? What are their allocation rules (looks like 
you need "$pe_slots").

What version of SGE?
What version of Open MPI?
Compiled with --with-sge?

For me it's working in either way.

-- Reuti


>     ./app
> 
> error message:
> error: executing task of job 6777095 failed:
> [cl231:23777] ERROR: A daemon on node cl231 failed to start as expected.
> [cl231:23777] ERROR: There may be more information available from
> [cl231:23777] ERROR: the 'qstat -t' command on the Grid Engine tasks.
> [cl231:23777] ERROR: If the problem persists, please restart the
> [cl231:23777] ERROR: Grid Engine PE job
> [cl231:23777] ERROR: The daemon exited unexpectedly with status 1.
> 
> Thanks for any help!
> 
> Pengcheng
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
> Link to this post: 
> http://www.open-mpi.org/community/lists/users/2014/08/25141.php

Reply via email to