I hope this will finally enable thread local support to work in a sane way in gcc so that we can actually use it in kernel space and get rid of all the this_cpu_xxx() macros?
And thread local RMVs primitives may actually be provided by gcc and be usable in user space so that we can write user space code with effective cpu local variable access?