-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Am 24.03.2017 um 22:11 schrieb Joshua Baker-LePain:

> On Fri, 24 Mar 2017 at 1:03pm, Reuti wrote
> 
>>> Is this expected behavior?  Or is something wonky with the cgroups here? 
>>> Thanks for any insights.
> 
> And the mystery deepens.  After changing execd_params to turn off 
> "USE_CGROUPS", I tried restarting the exec daemons on the compute nodes (just 
> to make sure the change was propagated, which I see now from the man page 
> isn't necessary).  However, the daemons failed to restart on some of the 
> nodes that aren't also admin hosts (do they have to be now?).  When the 
> testing showed that the commands generated output now on the nodes with 
> restarted exec daemons, I turned "USE_CGROUPS" back on and restarted the 
> daemons again... and the commands *still* work.  So it seems to be restarting 
> the daemons that "fixed" the issue, not the cgroups change. Color me even 
> more confused.
> 
>> You can try to use `strace` to call the two applications in question, maybe 
>> it give some hints about their behavior.
> 
> Good idea.  One result of the above shenanigans is that I currently have 
> nodes where these commands work, and ones where they don't (because those 
> exec daemons never got restarted).  This is the only difference that looks 
> relevant.

In case the execds are restarted, a different environment might be set (the raw 
one during boot, or the one of the root user [in case it got restarted by hand 
later on]). As these are inherited to the execd, they may behave different 
(limits, environment variables like $PATH/$LD_LIBRARY_PATH…).

- -- Reuti
-----BEGIN PGP SIGNATURE-----
Comment: GPGTools - https://gpgtools.org

iEYEARECAAYFAljVmogACgkQo/GbGkBRnRpAeQCgh26Ppnge/1RQw99OEmvs5kIh
phQAoMOs/HtdldAwTeEM9MEn2DbNAItF
=Z1jt
-----END PGP SIGNATURE-----

_______________________________________________
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users

Reply via email to