[PATCH] crypto: arm64/sha256 - fix build when CONFIG_PREEMPT_VOLUNTARY=y

Ard Biesheuvel ardb at kernel.org
Thu May 8 09:16:40 PDT 2025


On Thu, 8 May 2025 at 18:05, Eric Biggers <ebiggers at kernel.org> wrote:
>
> On Thu, May 08, 2025 at 01:12:28PM +0200, Ard Biesheuvel wrote:
> > On Wed, 7 May 2025 at 19:09, Eric Biggers <ebiggers at kernel.org> wrote:
> > >
> > > From: Eric Biggers <ebiggers at google.com>
> > >
> > > Fix the build of sha256-ce.S when CONFIG_PREEMPT_VOLUNTARY=y by passing
> > > the correct label to the cond_yield macro.  Also adjust the code to
> > > execute only one branch instruction when CONFIG_PREEMPT_VOLUNTARY=n.
> > >
> > > Fixes: 6e36be511d28 ("crypto: arm64/sha256 - implement library instead of shash")
> > > Reported-by: kernel test robot <lkp at intel.com>
> > > Closes: https://lore.kernel.org/oe-kbuild-all/202505071811.yYpLUbav-lkp@intel.com/
> > > Signed-off-by: Eric Biggers <ebiggers at google.com>
> > > ---
> > >  arch/arm64/lib/crypto/sha256-ce.S | 7 ++++---
> > >  1 file changed, 4 insertions(+), 3 deletions(-)
> > >
> > > diff --git a/arch/arm64/lib/crypto/sha256-ce.S b/arch/arm64/lib/crypto/sha256-ce.S
> > > index a8461d6dad634..f3e21c6d87d2e 100644
> > > --- a/arch/arm64/lib/crypto/sha256-ce.S
> > > +++ b/arch/arm64/lib/crypto/sha256-ce.S
> > > @@ -121,14 +121,15 @@ CPU_LE(   rev32           v19.16b, v19.16b        )
> > >
> > >         /* update state */
> > >         add             dgav.4s, dgav.4s, dg0v.4s
> > >         add             dgbv.4s, dgbv.4s, dg1v.4s
> > >
> > > +       /* return early if voluntary preemption is needed */
> > > +       cond_yield      1f, x5, x6
> > > +
> >
> > This will yield needlessly when the condition hits during the final iteration.
> >
> > >         /* handled all input blocks? */
> > > -       cbz             x2, 1f
> > > -       cond_yield      3f, x5, x6
> > > -       b               0b
> > > +       cbnz            x2, 0b
>
> cond_yield doesn't actually yield, though.  It just checks whether yielding is
> needed.  So the behavior is the same: on the last iteration this function
> returns 0 (i.e. 0 blocks remaining), regardless of whether it gets to the end by
> jumping there due to TSK_TI_PREEMPT being set or by falling through after seeing
> nblocks==0.  We could keep the nblocks==0 check first, but the cond_yield check
> is lightweight and it's probably better to avoid the extra branch instruction on
> every other iteration.
>

Ah yes, you're right.

Hopefully we'll soon be able to get rid of it entirely.



More information about the linux-arm-kernel mailing list