Dear mailing list
I’m running into trouble in the configuration of the small cluster I’m managing.
I’ve installed openmpi-1.8.1 with gcc 4.7 on a Centos 6.5 with infiniband 
support.
Compile and installation were all ok and i can compile and actually run 
parallel jobs, both directly or by submitting them with the queue manager 
(gridengine).
My problem is that when two different subsets of two job end on the same node, 
they will not spread equally and use all the cores of the node but instead they 
will run on a common subset of cores leaving some other totally empty.
For example two 4 core jobs on a 8 core node will result in only 4 core running 
on the node (all of them being oversubscribed) and the other 4 cores being 
empty.
Clearly there must be an error in the way I’ve configured stuffs but i cannot 
find any hint on how to solve the problem.
I’ve tried to do different map (map by core or by slot) but I’ve never 
succeeded.
Could you give a me suggestion on this issue?
Regards
Antonio

________________________________
[http://www.plymouth.ac.uk/images/email_footer.gif]<http://www.plymouth.ac.uk/worldclass>

This email and any files with it are confidential and intended solely for the 
use of the recipient to whom it is addressed. If you are not the intended 
recipient then copying, distribution or other use of the information contained 
is strictly prohibited and you should not rely on it. If you have received this 
email in error please let the sender know immediately and delete it from your 
system(s). Internet emails are not necessarily secure. While we take every 
care, Plymouth University accepts no responsibility for viruses and it is your 
responsibility to scan emails and their attachments. Plymouth University does 
not accept responsibility for any changes made after it was sent. Nothing in 
this email or its attachments constitutes an order for goods or services unless 
accompanied by an official order form.

Reply via email to