On 2019/7/26 23:21, Julien Desfossez wrote:
> On 25-Jul-2019 10:30:03 PM, Aaron Lu wrote:
>>
>> I tried a different approach based on vruntime with 3 patches following.
> [...]
> 
> We have experimented with this new patchset and indeed the fairness is
> now much better. Interactive tasks with v3 were complete starving when
> there were cpu-intensive tasks running, now they can run consistently.

Yeah, the fairness is much better now.

For two cgroups created, I limited the cpuset to be one core(two siblings)
of these two cgroups, I still run gemmbench and sysbench-mysql, and here
is the mysql result:

Latency:
.----------------------------------------------------------------------------------------------.
|NA/AVX vanilla-SMT     [std% / sem%]     cpu% |coresched-SMT   [std% / sem%]   
  +/-     cpu% |
|----------------------------------------------------------------------------------------------|
|  1/1          6.7     [13.8%/ 1.4%]     2.1% |          6.4   [14.6%/ 1.5%]   
  4.0%    2.0% |
|  2/2          9.1     [ 5.0%/ 0.5%]     4.0% |         11.4   [ 6.8%/ 0.7%]   
-24.9%    3.9% |
'----------------------------------------------------------------------------------------------'

Throughput:
.----------------------------------------------------------------------------------------------.
|NA/AVX vanilla-SMT     [std% / sem%]     cpu% |coresched-SMT   [std% / sem%]   
  +/-     cpu% |
|----------------------------------------------------------------------------------------------|
|  1/1        310.2     [ 4.1%/ 0.4%]     2.1% |        296.2   [ 5.0%/ 0.5%]   
 -4.5%    2.0% | 
|  2/2        547.7     [ 3.6%/ 0.4%]     4.0% |        368.3   [ 4.8%/ 0.5%]   
-32.8%    3.9% | 
'----------------------------------------------------------------------------------------------'

Note: 2/2 case means 4 threads run on one core, which is overloaded.(cpu% is 
overall system report)

Though the latency/throughput has regressions, but standard deviation is much 
better now.

> With my initial test of TPC-C running in large VMs with a lot of
> background noise VMs, the results are pretty similar to v3, I will run
> more thorough tests and report the results back here.

I see something similar. I guess task placement could be another problem.
We don't check cookie matching in load balance and task wakeup, so
- if tasks with different cookie happen to be dispatched onto different cores,
The result should be good
- if tasks with different cookie are unfortunately dispatched onto the same
core, the result should be bad.

This problem is bypassed in my testing setup above, but may be one cause
of my other scenarios, need a while to sort out.

Thanks,
-Aubrey

Reply via email to