[PATCH v12 1/5] asm-generic: Improve csum_fold
Arnd Bergmann
arnd at arndb.de
Tue Dec 12 23:52:01 PST 2023
On Wed, Dec 13, 2023, at 02:18, Charlie Jenkins wrote:
> This csum_fold implementation introduced into arch/arc by Vineet Gupta
> is better than the default implementation on at least arc, x86, and
> riscv. Using GCC trunk and compiling non-inlined version, this
> implementation has 41.6667%, 25% fewer instructions on riscv64, x86-64
> respectively with -O3 optimization. Most implmentations override this
> default in asm, but this should be more performant than all of those
> other implementations except for arm which has barrel shifting and
> sparc32 which has a carry flag.
>
> Signed-off-by: Charlie Jenkins <charlie at rivosinc.com>
Acked-by: Arnd Bergmann <arnd at arndb.de>
More information about the linux-riscv
mailing list