On 29/10/2020 14:06, Qais Yousef wrote: > On 10/29/20 21:02, Yun Hsiang wrote: >> Hi Qais, >> >> On Thu, Oct 29, 2020 at 11:08:18AM +0000, Qais Yousef wrote: >>> Hi Yun >>> >>> Sorry for chipping in late. >>> >>> On 10/25/20 15:36, Yun Hsiang wrote:
[...] >>>> #define SCHED_FLAG_UTIL_CLAMP (SCHED_FLAG_UTIL_CLAMP_MIN | \ >>>> - SCHED_FLAG_UTIL_CLAMP_MAX) >>>> + SCHED_FLAG_UTIL_CLAMP_MAX | \ >>>> + SCHED_FLAG_UTIL_CLAMP_RESET) >>> >>> Is it safe to change this define in a uapi header without a potential >>> consequence? AFAICS, there're 3 occurrences, besides the one in __setscheduler_uclamp(), in which we use SCHED_FLAG_UTIL_CLAMP. 1) call uclamp_validate() in __sched_setscheduler() 2) jump to 'change' label in __sched_setscheduler() 3) check that the uattr->size is SCHED_ATTR_SIZE_VER1 in sched_copy_attr() 2) and 3) require SCHED_FLAG_UTIL_CLAMP_RESET to be part of SCHED_FLAG_UTIL_CLAMP but IMHO 1) needs this change: @@ -1413,8 +1413,14 @@ int sysctl_sched_uclamp_handler(struct ctl_table *table, int write, static int uclamp_validate(struct task_struct *p, const struct sched_attr *attr) { - unsigned int lower_bound = p->uclamp_req[UCLAMP_MIN].value; - unsigned int upper_bound = p->uclamp_req[UCLAMP_MAX].value; + unsigned int lower_bound, upper_bound; + + /* Do not check uclamp attributes values in reset case. */ + if (attr->sched_flags & SCHED_FLAG_UTIL_CLAMP_RESET) + return 0; + + lower_bound = p->uclamp_req[UCLAMP_MIN].value; + upper_bound = p->uclamp_req[UCLAMP_MAX].value; if (attr->sched_flags & SCHED_FLAG_UTIL_CLAMP_MIN) lower_bound = attr->sched_util_min; Otherwise a bogus sa.sched_util_min or sa.sched_util_max with SCHED_FLAG_UTIL_CLAMP_RESET could return -EINVAL. >>> FWIW I still have concerns about this approach. We're doing a reset to >>> control >>> cgroup behavior, I don't see any correlation between the two. Besides the >>> difference between RESET and setting uclamp_min=0 without RESET is not >>> obvious >>> nor intuitive for someone who didn't look at the code. >>> >>> I propose something like the below which is more explicit about what is >>> being >>> requested and delivered here. And if we decide to deprecate this behavior, >>> it'd be much easier to just ignore this flag. >>> >>> You must set this flag with your uclamp request to retain the cgroup >>> inheritance behavior. If the flag is not set, we automatically clear it. >> >> I think this behavior may not meet android requirement. Becasue in >> android there is group like top-app. And we want to boost the >> group by setting group uclamp_min. If group inheritance is explicit, we >> need to set this flag for all the tasks in top-app. This might be >> costly. > > You will not have to set it for every task. It's on by default like it works > now. This behavior doesn't change. > > But if you change the uclamp value of a task but still want it to continue to > inherit the cgroup values if it's attached to one, you must set this flag when > changing the uclamp value. I'm not really fond of this idea because: (1) explicit cgroup(-behavior) related settings through a per-task user interface. (2) uclamp reset makes already sense in the !CONFIG_UCLAMP_TASK_GROUP case. A task can reset its uclamp values here as well, and then 'inheriting' the system defaults again. Already mentioned in https://lkml.kernel.org/r/87362ihxvw.derkl...@matbug.net [...]