[PATCH 1/1] arm64: Accelerate Adler32 using arm64 SVE instructions.
Li Qiang
liqiang64 at huawei.com
Thu Nov 5 04:05:53 EST 2020
在 2020/11/5 15:51, Ard Biesheuvel 写道:
> Note that NEON intrinsics can be compiled for 32-bit ARM as well (with
> a bit of care - please refer to lib/raid6/recov_neon_inner.c for an
> example of how to deal with intrinsics that are only available on
> arm64) and are less error prone, so intrinsics should be preferred if
> feasible.
>
> However, you have still not explained how optimizing Adler32 makes a
> difference for a real-world use case. Where is libdeflate used on a
> hot path?
> .
Sorry :(, I have not specifically searched for the use of this algorithm
in the kernel.
When I used perf to test the performance of the libz library before,
I saw that the adler32 algorithm occupies a lot of hot spots.I just
saw this algorithm used in the kernel code, so I think optimizing this
algorithm may have some positive optimization effects on the kernel.:)
--
Best regards,
Li Qiang
More information about the linux-arm-kernel
mailing list