I don't know about the grid engine/ SGE.
However, in Torque, the batch/resource manager I use,
to allow oversubscription, you need to modify the batch server nodes file 
and pretend the nodes have more cores than the physical ones.
[Something like 'node01 np=8' would change to 'node01 np=16' for instance.]
Maybe there is something similar in SGE.

We had bad results [program hanging or aborting] 
when trying to run large programs which include PDE solvers
[climate models] and allowing oversubscription, even when a substantial amount
of  RAM was idle.  
That was a while ago, and I have not pursued the issue any further.
Maybe context switching among the [surplus of] processes is the problem.
Of course for 'hello, world' type of programs oversubscription works well.
Where is the threshold when oversubscription makes a program break down, 
I'd guess only trial and error may tell.

I hope this helps,
Gus Correa

On Dec 23, 2011, at 2:42 PM, Santosh Ansumali wrote:

> Dear All,
>        We are running a PDE solver which is memory bound. Due to
> cache related issue,   smaller  number of grid point per core leads to
> better performance for this code.  Thus, though available memory per
> core is more than 2 GB, we are able to good  performance   by using
> less than 1 GB per core.
> 
> I want to know whether oversubscribing the cores can potentially
> improve performance of such a code.  My thinking is that if I
> oversubscribe the cores,  each thread will be using less than 1 GB so
> cache related problems will be less severe.  Is this logic correct or
> due to cache conflict performance will deteriorate further?
>      In case, over-subscription can help, how shall I modify
> submission file (using sun grid engine) to enable over-subscription of
> cores?
> my current submission file is written as follows
> #!/bin/bash
> #$ -N first
> #$ -S /bin/bash
> #$ -cwd
> #$ -e $JOB_ID.$JOB_NAME.ERROR
> #$ -o $JOB_ID.$JOB_NAME.OUTPUT
> #$ -P faculty_prj
> #$ -p 0
> #$ -pe orte 8
> /opt/mpi/openmpi/1.3.3/gnu/bin/mpirun -np $NSLOTS ./test_vel.out
> 
> Is it possible to allow over-subscription by modifying submission file
> itself?  Or do I need to change hostfiles somehow?
> Thanks for your help!
> Best Regards
> Santosh Ansumali,
> Faculty Fellow,
> Engineering Mechanics Unit
> Jawaharlal Nehru Centre for Advanced Scientific Research (JNCASR)
>  Jakkur, Bangalore-560 064, India
> Tel: + 91 80 22082938
> 
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users


Reply via email to