-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
Am 24.03.2017 um 22:11 schrieb Joshua Baker-LePain: > On Fri, 24 Mar 2017 at 1:03pm, Reuti wrote > >>> Is this expected behavior? Or is something wonky with the cgroups here? >>> Thanks for any insights. > > And the mystery deepens. After changing execd_params to turn off > "USE_CGROUPS", I tried restarting the exec daemons on the compute nodes (just > to make sure the change was propagated, which I see now from the man page > isn't necessary). However, the daemons failed to restart on some of the > nodes that aren't also admin hosts (do they have to be now?). When the > testing showed that the commands generated output now on the nodes with > restarted exec daemons, I turned "USE_CGROUPS" back on and restarted the > daemons again... and the commands *still* work. So it seems to be restarting > the daemons that "fixed" the issue, not the cgroups change. Color me even > more confused. > >> You can try to use `strace` to call the two applications in question, maybe >> it give some hints about their behavior. > > Good idea. One result of the above shenanigans is that I currently have > nodes where these commands work, and ones where they don't (because those > exec daemons never got restarted). This is the only difference that looks > relevant. In case the execds are restarted, a different environment might be set (the raw one during boot, or the one of the root user [in case it got restarted by hand later on]). As these are inherited to the execd, they may behave different (limits, environment variables like $PATH/$LD_LIBRARY_PATH…). - -- Reuti -----BEGIN PGP SIGNATURE----- Comment: GPGTools - https://gpgtools.org iEYEARECAAYFAljVmogACgkQo/GbGkBRnRpAeQCgh26Ppnge/1RQw99OEmvs5kIh phQAoMOs/HtdldAwTeEM9MEn2DbNAItF =Z1jt -----END PGP SIGNATURE----- _______________________________________________ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users