We have a cluster with gridengine 6.5u2 and noticing a strange behavior
when running MPI jobs. Our application will finish, yet the processes
continue to run and use up the CPU. We did configure a parallel
environment for MPI as follows:
pe_namempi
slots 500
user_lis
Reuti wrote on 11/14/18 04:53:
Which version of MPICH are you using? Maybe it's not tightly integrated.
It's MPICH 3.2.1 which I believe is tightly integrated.
Thanks
___
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/
Hay, William wrote on 11/14/18 04:21:
Do you have ENABLE_ADDGRP_KILL set? Can be helpful in killing processes left
behind when a job exits.
We don't have that set yet. I will try setting ENABLE_ADDGRP_KILL=TRUE
in the execd_params for the global configuration and see if it helps.
Thanks
__
Hi all,
on the OGS webpage, it says:
> Grid Engine 2011.11 is compatible with Sun Grid Engine 6.2u5. You can even
> run Grid Engine 2011.11 on a subset of nodes while the rest of the cluster
> runs the older 6.2u5 release - ie. as a drop-in replacement for 6.2u5.
>
So, how is that achieved? I've