Txema Heredia Genestar <txema.llis...@gmail.com> writes:

> Hi all,
>
> I have a cluster in production running rocks-cluster 6.0 using SGE6.2u5.
> SGE6.2u5 has a bug that kills the qmaster when an amount of jobs using
> both -pe and -hold_jid are used. OGE (theoretically) has this bug
> fixed.

OGE is Oracle Grid Engine, which is rumoured to be trademarked.  Do you
mean that?  If so, you presumably want to talk to Oracle.

I don't know much about Rocks, but it seems to throw up a lot of
grid engine problems and doesn't ship an up-to-date SGE.

If someone can tell me how to build/install for Rocks, I'm happy to host
instructions or binaries for SGE 8.1.4 or newer, which has >1000 patches
over SGE6.2u5/OGS, including
<https://arc.liv.ac.uk/trac/SGE/changeset/3511/sge>, which is probably
the fix in question.

> What is the safest/cleanest way to upgrade from SGE to OGE?

If you want to upgrade from 6.2u5 to 8.1.4, you can do it live
<https://arc.liv.ac.uk/repos/darcs/sge/source/README.upgrade> in a
vanilla system, but I don't know if that will work in Rocks.

-- 
Community Grid Engine:  http://arc.liv.ac.uk/SGE/
_______________________________________________
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users

Reply via email to