On Thu, Jul 12, 2018 at 11:29:38AM +0800, Yandong.Zhao wrote:
> From: Yandong Zhao <yandong77...@gmail.com>
> 
> It does not matter if the caller of may_use_simd() migrates to
> another cpu after the call, but it is still important that the
> kernel_neon_busy percpu instance that is read matches the cpu the
> task is running on at the time of the read.
> 
> This means that raw_cpu_read() is not sufficient. kernel_neon_busy
> may appear true if the caller migrates during the execution of
> raw_cpu_read() and the next task to be scheduled in on the initial
> cpu calls kernel_neon_begin().
> 
> This patch replaces raw_cpu_read() with this_cpu_read() to protect
> against this race.
> 
> Fixes: cb84d11e1625 ("arm64: neon: Remove support for nested or hardirq 
> kernel-mode NEON")
> Acked-by: Ard Biesheuvel <ard.biesheu...@linaro.org>
> Reviewed-by: Dave Martin <dave.mar...@arm.com>
> Reviewed-by: Mark Rutland <mark.rutl...@arm..com>
> Signed-off-by: Yandong Zhao <yandong77...@gmail.com>
> ---
>  arch/arm64/include/asm/simd.h | 19 +++++++------------
>  1 file changed, 7 insertions(+), 12 deletions(-)
> 
> diff --git a/arch/arm64/include/asm/simd.h b/arch/arm64/include/asm/simd.h
> index fa8b3fe..6495cc5 100644
> --- a/arch/arm64/include/asm/simd.h
> +++ b/arch/arm64/include/asm/simd.h
> @@ -29,20 +29,15 @@
>  static __must_check inline bool may_use_simd(void)
>  {
>       /*
> -      * The raw_cpu_read() is racy if called with preemption enabled.
> -      * This is not a bug: kernel_neon_busy is only set when
> -      * preemption is disabled, so we cannot migrate to another CPU
> -      * while it is set, nor can we migrate to a CPU where it is set.
> -      * So, if we find it clear on some CPU then we're guaranteed to
> -      * find it clear on any CPU we could migrate to.
> -      *
> -      * If we are in between kernel_neon_begin()...kernel_neon_end(),
> -      * the flag will be set, but preemption is also disabled, so we
> -      * can't migrate to another CPU and spuriously see it become
> -      * false.
> +      * kernel_neon_busy is only set while preemption is disabled,
> +      * and is clear whenever preemption is enabled. Since
> +      * this_cpu_read() is atomic w.r.t. preemption, kernel_neon_busy
> +      * cannot change under our feet -- if it's set we cannot be
> +      * migrated, and if it's clear we cannot be migrated to a CPU
> +      * where it is set.
>        */

This new explanation looks fine to me.

[...]

Cheers
---Dave

Reply via email to