On Tue, Sep 08, 2015 at 05:53:31PM +0100, Morten Rasmussen wrote:
> On Tue, Sep 08, 2015 at 03:31:58PM +0100, Morten Rasmussen wrote:

> > On Tue, Sep 08, 2015 at 02:52:05PM +0200, Peter Zijlstra wrote:
> > But if we apply the scaling to the weight instead of time, we would only
> > have to apply it once and not three times like it is now? So maybe we
> > can end up with almost the same number of multiplications.
> > 
> > We might be loosing bits for low priority task running on cpus at a low
> > frequency though.
> 
> Something like the below. We should be saving one multiplication.

> @@ -2577,8 +2575,13 @@ __update_load_avg(u64 now, int cpu, struct sched_avg 
> *sa,
>               return 0;
>       sa->last_update_time = now;
>  
> -     scale_freq = arch_scale_freq_capacity(NULL, cpu);
> -     scale_cpu = arch_scale_cpu_capacity(NULL, cpu);
> +     if (weight || running)
> +             scale_freq = arch_scale_freq_capacity(NULL, cpu);
> +     if (weight)
> +             scaled_weight = weight * scale_freq >> SCHED_CAPACITY_SHIFT;
> +     if (running)
> +             scale_freq_cpu = scale_freq * arch_scale_cpu_capacity(NULL, cpu)
> +                                                     >> SCHED_CAPACITY_SHIFT;
>  
>       /* delta_w is the amount already accumulated against our next period */
>       delta_w = sa->period_contrib;
> @@ -2594,16 +2597,15 @@ __update_load_avg(u64 now, int cpu, struct sched_avg 
> *sa,
>                * period and accrue it.
>                */
>               delta_w = 1024 - delta_w;
> -             scaled_delta_w = cap_scale(delta_w, scale_freq);
>               if (weight) {
> -                     sa->load_sum += weight * scaled_delta_w;
> +                     sa->load_sum += scaled_weight * delta_w;
>                       if (cfs_rq) {
>                               cfs_rq->runnable_load_sum +=
> -                                             weight * scaled_delta_w;
> +                                             scaled_weight * delta_w;
>                       }
>               }
>               if (running)
> -                     sa->util_sum += scaled_delta_w * scale_cpu;
> +                     sa->util_sum += delta_w * scale_freq_cpu;
>  
>               delta -= delta_w;
>  

Sadly that makes the code worse; I get 14 mul instructions where
previously I had 11.

What happens is that GCC gets confused and cannot constant propagate the
new variables, so what used to be shifts now end up being actual
multiplications.

With this, I get back to 11. Can you see what happens on ARM where you
have both functions defined to non constants?

---
--- a/kernel/sched/fair.c
+++ b/kernel/sched/fair.c
@@ -2551,10 +2551,10 @@ static __always_inline int
 __update_load_avg(u64 now, int cpu, struct sched_avg *sa,
                  unsigned long weight, int running, struct cfs_rq *cfs_rq)
 {
+       unsigned long scaled_weight, scale_freq, scale_freq_cpu;
+       unsigned int delta_w, decayed = 0;
        u64 delta, periods;
        u32 contrib;
-       unsigned int delta_w, decayed = 0;
-       unsigned long scaled_weight = 0, scale_freq, scale_freq_cpu = 0;
 
        delta = now - sa->last_update_time;
        /*
@@ -2575,13 +2575,10 @@ __update_load_avg(u64 now, int cpu, stru
                return 0;
        sa->last_update_time = now;
 
-       if (weight || running)
-               scale_freq = arch_scale_freq_capacity(NULL, cpu);
-       if (weight)
-               scaled_weight = weight * scale_freq >> SCHED_CAPACITY_SHIFT;
-       if (running)
-               scale_freq_cpu = scale_freq * arch_scale_cpu_capacity(NULL, cpu)
-                                                       >> SCHED_CAPACITY_SHIFT;
+       scale_freq = arch_scale_freq_capacity(NULL, cpu);
+
+       scaled_weight = weight * scale_freq >> SCHED_CAPACITY_SHIFT;
+       scale_freq_cpu = scale_freq * arch_scale_cpu_capacity(NULL, cpu) >> 
SCHED_CAPACITY_SHIFT;
 
        /* delta_w is the amount already accumulated against our next period */
        delta_w = sa->period_contrib;
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to