[PATCH v2 0/4] riscv: Add Zalasr ISA extension support

Guo Ren guoren at kernel.org
Fri Sep 19 22:59:56 PDT 2025


On Fri, Sep 19, 2025 at 5:24 AM Andrea Parri <parri.andrea at gmail.com> wrote:
>
> [merging replies]
>
> > > I prefer option c) at first, it has fewer modification and influence.
> > Another reason is that store-release-to-load-acquire would give out a
> > FENCE rw, rw according to RVWMO PPO 7th rule instead of FENCE.TSO, which
> > is stricter than the Linux requirement you've mentioned.
>
> I mean, if "fewer modification" and "not-a-full-fence" were the only
> arguments, we would probably just stick with the current scheme (b),
> right?  What other arguments are available?  Don't get me wrong: no a
> priori objection from my end; I was really just wondering about the
> various interests/rationales in the RISC-V kernel community.  (It may
> surprise you, but some communities did consider that "UNLOCK+LOCK is
> not a full memory barrier" a disadvantage, because briefly "locking
> should provide strong ordering guarantees and be easy to reason about";
> in fact, not just "locking" if we consider x86 or arm64...)

The ld.aq is really faster than the "ld + fence r, rw" in microarch. I
don't care about the performance of the "UNLOCK+LOCK" scenario.

>
>
> > > asm volatile(ALTERNATIVE("fence rw, w;\t\nsb %0, 0(%1)\t\n",        \
> > > -                     SB_RL(%0, %1) "\t\nnop\t\n",          \
> > > +                     SB_RL(%0, %1) "\t\n fence.tso;\t\n",  \
> > >                       0, RISCV_ISA_EXT_ZALASR, 1)           \
> > >                       : : "r" (v), "r" (p) : "memory");     \
>
> nit: Why placing the fence after the store?  I imagine that FENCE.TSO
> could precede the store, this way, the store would actually not need
> to have that .RL annotation.  More importantly,
Yes, fence.tso is stricter than fence rw, w, it gives an additional
fence r, r barrier.

>
> That for (part of) smp_store_release().  Let me stress that my option
> (c) was meant to include similar changes for _every releases (that is,
> cmpxchg_release(), atomic_inc_return_release(), and many other), even
> if most of such releases do not currently create "problematic pairs"
> with a corresponding acquire: the concern is that future changes in the
> RISC-V atomics implementation or in generic locking code will introduce
> pairs of the form FENCE RW,W + .AQ or .RL + FENCE R,RW without anyone
> noticing...  In other words, I was suggesting that RISC-V _continues
> to meet the ordering property under discussion "by design" rather than
> "by Andrea or whoever's code auditing/review" (assuming it's feasible,
> i.e. that it doesn't clash with other people's arguments?); options (a)
> and (b) were also "designed" following this same criterion.
>
>   Andrea
--
Best Regards
 Guo Ren



More information about the linux-riscv mailing list