[PATCH v3] arm64: Implement optimised IP checksum helpers
Catalin Marinas
catalin.marinas at arm.com
Fri Jun 17 10:02:38 PDT 2016
On Tue, May 31, 2016 at 06:04:40PM +0100, Robin Murphy wrote:
> AArch64 is capable of 128-bit memory accesses without alignment
> restrictions, which makes it both possible and highly practical to slurp
> up a typical 20-byte IP header in just 2 loads. Implement our own
> version of ip_fast_checksum() to take advantage of that, resulting in
> considerably fewer instructions and memory accesses than the generic
> version. We can also get more optimal code generation for csum_fold() by
> defining it a slightly different way round from the generic version, so
> throw that into the mix too.
>
> Suggested-by: Luke Starrett <luke.starrett at broadcom.com>
> Acked-by: Luke Starrett <luke.starrett at broadcom.com>
> Signed-off-by: Robin Murphy <robin.murphy at arm.com>
I now applied the correct version. Thanks for pointing out.
--
Catalin
More information about the linux-arm-kernel
mailing list