Hi, Am 23.08.2014 um 16:09 schrieb Pengcheng Wang:
> I need to run a single driver program that only require one proc with the > command mpirun -np 1 ./app or ./app. But it will schedule the launch of other > executable files including parallel and sequential computing. So I require > more than one proc to run it. It can be run smoothly as an interactive job > with the command below. > > qrsh -cwd -pe "ompi*" 6 -l h_rt=00:30:00,test=true ./app > > But after I submitted the job, a strange error occurred and it stopped... > Please find the job script and error message below: > > • job submission script: > #$ -S /bin/bash > #$ -N couple > #$ -cwd > #$ -j y > #$ -l h_rt=05:00:00 > #$ -l h_vmem=2G Is a simple hello_world program listing the threads working? Does it work without the h_vmem limit? > #$ -o couple.out > #$ -pe ompi* 6 Which PEs can be addressed here? What are their allocation rules (looks like you need "$pe_slots"). What version of SGE? What version of Open MPI? Compiled with --with-sge? For me it's working in either way. -- Reuti > ./app > > error message: > error: executing task of job 6777095 failed: > [cl231:23777] ERROR: A daemon on node cl231 failed to start as expected. > [cl231:23777] ERROR: There may be more information available from > [cl231:23777] ERROR: the 'qstat -t' command on the Grid Engine tasks. > [cl231:23777] ERROR: If the problem persists, please restart the > [cl231:23777] ERROR: Grid Engine PE job > [cl231:23777] ERROR: The daemon exited unexpectedly with status 1. > > Thanks for any help! > > Pengcheng > _______________________________________________ > users mailing list > us...@open-mpi.org > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users > Link to this post: > http://www.open-mpi.org/community/lists/users/2014/08/25141.php