Not sure what you exactly mean by “LA” but if you use cgroups/taskset, then no matter how many threads the job spits, they all will be bound to the CPU you specify. For sure, for the optimal job run, you should allocate CPUs in advance.
O. From: Mike Serkov <serko...@gmail.com> Sent: Thursday, August 29, 2019 11:28 AM To: Ondrej Valousek <ondrej.valou...@adestotech.com> Cc: Dietmar Rieder <dietmar.rie...@i-med.ac.at>; users@gridengine.org Subject: Re: [gridengine users] limit CPU/slot resource to the number of reserved slots Also, something to keep in mind - cgroups will not solve this issue completely. It is just affinity enforcement. If the job spawns multiple threads and they all active - it will cause LA growing as well as some other side effects, regardless affinity setting. On big SMP boxes it may actually cause more instability. Anyway, jobs should be configured to use exact amount of threads they request, and it should be monitored. Best regards, Mikhail Serkov On Aug 29, 2019, at 4:16 AM, Ondrej Valousek <ondrej.valou...@adestotech.com<mailto:ondrej.valou...@adestotech.com>> wrote: Also a quick note: cgroups is the way to _enforce_ CPU affinity. For vast majority of the jobs, I would say just a simple taskset configuration (i.e. i.e. something like “-l binding linear”) would do as well. From: Dietmar Rieder <dietmar.rie...@i-med.ac.at<mailto:dietmar.rie...@i-med.ac.at>> Sent: Thursday, August 29, 2019 9:37 AM To: users@gridengine.org<mailto:users@gridengine.org>; Ondrej Valousek <ondrej.valou...@adestotech.com<mailto:ondrej.valou...@adestotech.com>>; users <users@gridengine.org<mailto:users@gridengine.org>> Subject: Re: [gridengine users] limit CPU/slot resource to the number of reserved slots Great, thanks so much! Dietmar Am 29. August 2019 09:05:35 MESZ schrieb Ondrej Valousek <ondrej.valou...@adestotech.com<mailto:ondrej.valou...@adestotech.com>>: Nope, SoGE (as of 8.1.9) supports CGROUPS w/o any code changes, just add “USE_CGROUPS=yes” to the exec parameter list to make shepherd use CGroup saveset controller. My path only extends it to supports system and hence possibility to hard enforce memory/cpu limits, etc… Hth, Ondrej From: Daniel Povey <dpo...@gmail.com<mailto:dpo...@gmail.com>> Sent: Monday, August 26, 2019 10:12 PM To: Dietmar Rieder <dietmar.rie...@i-med.ac.at<mailto:dietmar.rie...@i-med.ac.at>>; Ondrej Valousek <ondrej.valou...@adestotech.com<mailto:ondrej.valou...@adestotech.com>>; users <users@gridengine.org<mailto:users@gridengine.org>> Subject: Re: [gridengine users] limit CPU/slot resource to the number of reserved slots I don't think it's supported in Son of GridEngine. Ondrej Valousek (cc'd) described in the first thread here http://arc.liv.ac.uk/pipermail/sge-discuss/2019-August/thread.html<https://nam04.safelinks.protection.outlook.com/?url=http%3A%2F%2Farc.liv.ac.uk%2Fpipermail%2Fsge-discuss%2F2019-August%2Fthread.html&data=02%7C01%7Condrej.valousek%40adestotech.com%7C5dd0703493ab4e0952e908d72c6322dd%7C2ccd8edaa14a4b4f825ce6ad71d71b81%7C0%7C0%7C637026676659384548&sdata=zjHc5z3Q9VqJDc9VFiEeGXLAWBBr6hBNpNKdAau2gfI%3D&reserved=0> how he was able to implement it, but it required code changes, i.e. you would need to figure out how to build and install SGE from source, which is a task in itself. Dan On Mon, Aug 26, 2019 at 12:46 PM Dietmar Rieder <dietmar.rie...@i-med.ac.at<mailto:dietmar.rie...@i-med.ac.at>> wrote: Hi, thanks for your reply. This sounds promising. We are using Son of Grid Engine though. Can you point me to the right docs to get cgroup enabled in the exec host (CentOS 7). I must admit I have no experience with cgroups. Thanks again Dietmar On 8/26/19 4:03 PM, Skylar Thompson wrote: > At least for UGE, you will want to use the CPU set integration, which will > assign the job to a cgroup that has one CPU per requested slot. Once you > have cgroups enabled in the exec host OS, you can then set these options in > sge_conf: > > cgroup_path=/cgroup > cpuset=1 > > You can use this mechanism to have the m_mem_free request enforced as well. > > On Mon, Aug 26, 2019 at 02:15:22PM +0200, Dietmar Rieder wrote: >> Hi, >> >> may be this is a stupid question, but I'd like to limit the used/usable >> number of cores to the number of slots that were reserved for a job. >> >> We often see that people reserve 1 slot, e.g. "qsub -pe smp 1 [...]" >> but their program is then running in parallel on multiple cores. How can >> this be prevented? Is it possible that with reserving only one slot a >> process can not utilize more than this? >> >> I was told the this should be possible in slurm (which we don't have, >> and to which we don't want to switch to currently). >> >> Thanks >> Dietmar > -- _________________________________________ D i e t m a r R i e d e r, Mag.Dr. Innsbruck Medical University Biocenter - Institute of Bioinformatics Email: dietmar.rie...@i-med.ac.at<mailto:dietmar.rie...@i-med.ac.at> Web: http://www.icbi.at<https://nam04.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.icbi.at&data=02%7C01%7Condrej.valousek%40adestotech.com%7C5dd0703493ab4e0952e908d72c6322dd%7C2ccd8edaa14a4b4f825ce6ad71d71b81%7C0%7C0%7C637026676659384548&sdata=3tTtRvEP5JGq7n0g40JrHvUdRxEIT324NxG7UgwjAGY%3D&reserved=0> _______________________________________________ users mailing list users@gridengine.org<mailto:users@gridengine.org> https://gridengine.org/mailman/listinfo/users<https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgridengine.org%2Fmailman%2Flistinfo%2Fusers&data=02%7C01%7Condrej.valousek%40adestotech.com%7C5dd0703493ab4e0952e908d72c6322dd%7C2ccd8edaa14a4b4f825ce6ad71d71b81%7C0%7C0%7C637026676659394530&sdata=VDqFQo8Tv3mMSFHf81WlZgb9ZAnOrkxArc4VKreX2yw%3D&reserved=0> -- ________________________________ D i e t m a r R i e d e r, Mag.Dr. Innsbruck Medical University Biocenter - Institute of Bioinformatics Innrain 80, 6020 Innsbruck Phone: +43 512 9003 71402 Fax: +43 512 9003 73100 Email: dietmar.rie...@i-med.ac.at<mailto:dietmar.rie...@i-med.ac.at> Web: http://www.icbi.at<https://nam04.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.icbi.at&data=02%7C01%7Condrej.valousek%40adestotech.com%7C5dd0703493ab4e0952e908d72c6322dd%7C2ccd8edaa14a4b4f825ce6ad71d71b81%7C0%7C0%7C637026676659394530&sdata=Izzkn2gOXpOU7UtojPMUvEzHFbOe58O7tJdIAoOAuWQ%3D&reserved=0> _______________________________________________ users mailing list users@gridengine.org<mailto:users@gridengine.org> https://gridengine.org/mailman/listinfo/users<https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgridengine.org%2Fmailman%2Flistinfo%2Fusers&data=02%7C01%7Condrej.valousek%40adestotech.com%7C5dd0703493ab4e0952e908d72c6322dd%7C2ccd8edaa14a4b4f825ce6ad71d71b81%7C0%7C0%7C637026676659404536&sdata=cjEvLS4OxBidWijVSnY%2Bb5bSdKs1Q2d8DWZJGRk9Kmk%3D&reserved=0>
_______________________________________________ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users