Hello, The scheduling problems I reported in the thread: http://lkml.org/lkml/2007/3/3/128 are caused by the set_cyc2ns_scale() function called when the CPU speed changes. Changing the scale causes a warp in the value returned by sched_clock().
The attached patch fixes the problem by adding an offset to the cyc2ns code to smooth CPU frequency transitions. It also makes the cyc2ns parameters per-CPU as cpufreq seems to support SMP but I don't have the hardware to test. If you want a version without the per-CPU or irqsave stuff, just ask. Although it solved all my scheduler issues, it may not be fully satisfactory as for example my TSC can run at a frequency as low as 350 MHz when the CPU is idle and slowed down to the max at 798 MHz by ondemand. So in this case, sched_clock() does not return nanoseconds as it thinks it does. Hopefully this is a non-issue as the scheduler is not stressed when the CPU is idle. For me, this is needed in 2.6.21 if I want to be able to listen to music while compiling a kernel using the ondemand governor. Thanks. Signed-off-by: Guillaume Chazarain <[EMAIL PROTECTED]> --- diff -r fb83e6d92a4c arch/i386/kernel/tsc.c --- a/arch/i386/kernel/tsc.c +++ b/arch/i386/kernel/tsc.c @@ -10,6 +10,7 @@ #include <linux/jiffies.h> #include <linux/init.h> #include <linux/dmi.h> +#include <linux/percpu.h> #include <asm/delay.h> #include <asm/tsc.h> @@ -79,20 +80,57 @@ static inline int check_tsc_unstable(voi * cyc2ns_scale is limited to 10^6 * 2^10, which fits in 32 bits. * ([EMAIL PROTECTED]) * + * ns += offset to avoid sched_clock jumps with cpufreq ([EMAIL PROTECTED]) + * * [EMAIL PROTECTED] "math is hard, lets go shopping!" */ -static unsigned long cyc2ns_scale __read_mostly; - #define CYC2NS_SCALE_FACTOR 10 /* 2^10, carefully chosen */ -static inline void set_cyc2ns_scale(unsigned long cpu_khz) -{ - cyc2ns_scale = (1000000 << CYC2NS_SCALE_FACTOR)/cpu_khz; +struct cyc2ns_params { + unsigned long scale; + unsigned long long offset; +}; +DEFINE_PER_CPU(struct cyc2ns_params, cyc2ns) __read_mostly; + +static inline unsigned long long __cycles_2_ns(struct cyc2ns_params *params, + unsigned long long cyc) +{ + return ((cyc * params->scale) >> CYC2NS_SCALE_FACTOR) + params->offset; } static inline unsigned long long cycles_2_ns(unsigned long long cyc) { - return (cyc * cyc2ns_scale) >> CYC2NS_SCALE_FACTOR; + struct cyc2ns_params *params; + unsigned long flags; + unsigned long long ns; + + params = &get_cpu_var(cyc2ns); + + local_irq_save(flags); + ns = __cycles_2_ns(params, cyc); + local_irq_restore(flags); + + put_cpu_var(cyc2ns); + return ns; +} + +static void set_cyc2ns_scale(unsigned long cpu_khz) +{ + struct cyc2ns_params *params; + unsigned long flags; + unsigned long long tsc_now, ns_now; + + get_scheduled_cycles(tsc_now); + params = &get_cpu_var(cyc2ns); + + local_irq_save(flags); + ns_now = __cycles_2_ns(params, tsc_now); + + params->scale = (1000000 << CYC2NS_SCALE_FACTOR)/cpu_khz; + params->offset += ns_now - __cycles_2_ns(params, tsc_now); + local_irq_restore(flags); + + put_cpu_var(cyc2ns); } /* -- Guillaume - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/