[PATCH V4] arm64: percpu: Implement this_cpu operations
Will Deacon
will.deacon at arm.com
Mon Nov 17 02:40:11 PST 2014
Hi Steve,
On Fri, Nov 14, 2014 at 03:03:33PM +0000, Steve Capper wrote:
> The generic this_cpu operations disable interrupts to ensure that the
> requested operation is protected from pre-emption. For arm64, this is
> overkill and can hurt throughput and latency.
>
> This patch provides arm64 specific implementations for the this_cpu
> operations. Rather than disable interrupts, we use the exclusive
> monitor or atomic operations as appropriate.
>
> The following operations are implemented: add, add_return, and, or,
> read, write, xchg. We also wire up a cmpxchg implementation from
> cmpxchg.h.
>
> Testing was performed using the percpu_test module and hackbench on a
> Juno board running 3.18-rc4.
Looks good. I notice that this change drops the compiler barriers too,
which we used to get via local_irq_{enable/disable}. I *think* that's fine
(at least, I couldn't find a place that breaks due to that) but it would be
nice to know that it was deliberate :)
> +static inline void __percpu_write(void *ptr, unsigned long val, int size)
> +{
> + switch (size) {
> + case 1:
> + ACCESS_ONCE(*((u8 *)ptr)) = (u8) val;
> + break;
> + case 2:
> + ACCESS_ONCE(*((u16 *)ptr)) = (u16) val;
> + break;
> + case 4:
> + ACCESS_ONCE(*((u32 *)ptr)) = (u32) val;
> + break;
> + case 8:
> + ACCESS_ONCE(*((u64 *)ptr)) = (u64) val;
> + break;
I think you've gone a bit overboard with brackets and spacing here. Can't
you just do something like:
ACCESS_ONCE(*(u64 *)ptr) = (u64)val;
Anyway:
Reviewed-by: Will Deacon <will.deacon at arm.com>
Will
More information about the linux-arm-kernel
mailing list