[PATCH] arm64: Add ARM64_HAS_LSE2 CPU capability

Mark Rutland mark.rutland at arm.com
Fri Sep 6 04:09:15 PDT 2024


On Fri, Sep 06, 2024 at 06:58:19PM +0800, tiantao (H) wrote:
> 
> 在 2024/9/6 17:44, Mark Rutland 写道:
> > On Fri, Sep 06, 2024 at 05:08:12PM +0800, Tian Tao wrote:
> > > When FEAT_LSE2 is implemented and Bit 6 of sctlr_elx is nAA, the
> > > full name of the Not-aligned access. nAA bit has two values:
> > > 0b0 Unaligned accesses by the specified instructions generate an
> > > Alignment fault.
> > > 0b1 Unaligned accesses by the specified instructions do not generate
> > > an Alignment fault.
> > > 
> > > this patch sets the nAA bit to 1,The following instructions will not
> > > generate an Alignment fault if all bytes being accessed are not within
> > > a single 16-byte quantity:
> > > • LDAPR, LDAPRH, LDAPUR, LDAPURH, LDAPURSH, LDAPURSW, LDAR, LDARH,LDLAR,
> > > LDLARH.
> > > • STLLR, STLLRH, STLR, STLRH, STLUR, and STLURH
> > > 
> > > Signed-off-by: Tian Tao <tiantao6 at hisilicon.com>
> > What is going to depend on this? Nothing in the kernel depends on being
> > able to make unaligned accesses with these instructions, and (since you
> > haven't added a HWCAP), userspace has no idea that these accesses won't
> > generate an alignment fault.
> > 
> > Mark.
> 
> I've come across a situation where the simplified code is as follows:
> 
>  long  address = (long) mmap(NULL,1024*1024*2,PROT_READ|PROT_WRITE,
> MAP_PRIVATE|MAP_ANONYMOUS,-1,0);
> 
> long new_address = address + 9;
> 
>  long *p = (long*) new_address;
>  long v = -1;
> 
>  __atomic_store(p, &v, __ATOMIC_RELEASE);
> 
> coredump occurs after executing __atomic_store, but the user code can't be
> changed,

Where is that code and why can't it be changed? That code has never
worked and would always have generated a coredump, and even with this
patch it cannot work on hardware without LSE2.

> so I'm trying to enable NAA to solve this problem.

AFAICT that's making a kernel change to paper over a userspace bug.
As-is I don't think that's a good justification for changing nAA.

Mark.

> 
> > > ---
> > >   arch/arm64/Kconfig              | 10 ++++++++++
> > >   arch/arm64/include/asm/sysreg.h |  1 +
> > >   arch/arm64/kernel/cpufeature.c  | 18 ++++++++++++++++++
> > >   arch/arm64/tools/cpucaps        |  1 +
> > >   4 files changed, 30 insertions(+)
> > > 
> > > diff --git a/arch/arm64/Kconfig b/arch/arm64/Kconfig
> > > index 77d7ef0b16c2..7afe73ebcd79 100644
> > > --- a/arch/arm64/Kconfig
> > > +++ b/arch/arm64/Kconfig
> > > @@ -2023,6 +2023,16 @@ config ARM64_TLB_RANGE
> > >   	  The feature introduces new assembly instructions, and they were
> > >   	  support when binutils >= 2.30.
> > > +config ARM64_LSE2_NAA
> > > +	bool "Enable support for not-aligned access"
> > > +	depends on AS_HAS_ARMV8_4
> > > +	help
> > > +	 LSE2 is an extension to the original LSE (Large System Extensions) feature,
> > > +	 introduced in ARMv8.4.
> > > +
> > > +	 Enable this feature will not generate an Alignment fault if all bytes being
> > > +	 accessed are not within a single 16-byte quantity.
> > > +
> > >   endmenu # "ARMv8.4 architectural features"
> > >   menu "ARMv8.5 architectural features"
> > > diff --git a/arch/arm64/include/asm/sysreg.h b/arch/arm64/include/asm/sysreg.h
> > > index 8cced8aa75a9..42e3a1959aa8 100644
> > > --- a/arch/arm64/include/asm/sysreg.h
> > > +++ b/arch/arm64/include/asm/sysreg.h
> > > @@ -854,6 +854,7 @@
> > >   #define SCTLR_ELx_ENDB	 (BIT(13))
> > >   #define SCTLR_ELx_I	 (BIT(12))
> > >   #define SCTLR_ELx_EOS	 (BIT(11))
> > > +#define SCTLR_ELx_nAA    (BIT(6))
> > >   #define SCTLR_ELx_SA	 (BIT(3))
> > >   #define SCTLR_ELx_C	 (BIT(2))
> > >   #define SCTLR_ELx_A	 (BIT(1))
> > > diff --git a/arch/arm64/kernel/cpufeature.c b/arch/arm64/kernel/cpufeature.c
> > > index 646ecd3069fd..558869a7c7f0 100644
> > > --- a/arch/arm64/kernel/cpufeature.c
> > > +++ b/arch/arm64/kernel/cpufeature.c
> > > @@ -2299,6 +2299,14 @@ static void cpu_enable_mte(struct arm64_cpu_capabilities const *cap)
> > >   }
> > >   #endif /* CONFIG_ARM64_MTE */
> > > +#ifdef CONFIG_ARM64_LSE2_NAA
> > > +static void cpu_enable_lse2(const struct arm64_cpu_capabilities *__unused)
> > > +{
> > > +	sysreg_clear_set(sctlr_el2, SCTLR_ELx_nAA, SCTLR_ELx_nAA);
> > > +	isb();
> > > +}
> > > +#endif
> > > +
> > >   static void user_feature_fixup(void)
> > >   {
> > >   	if (cpus_have_cap(ARM64_WORKAROUND_2658417)) {
> > > @@ -2427,6 +2435,16 @@ static const struct arm64_cpu_capabilities arm64_features[] = {
> > >   		ARM64_CPUID_FIELDS(ID_AA64ISAR0_EL1, ATOMIC, IMP)
> > >   	},
> > >   #endif /* CONFIG_ARM64_LSE_ATOMICS */
> > > +#ifdef CONFIG_ARM64_LSE2_NAA
> > > +	{
> > > +		.desc = "Support for not-aligned access",
> > > +		.capability = ARM64_HAS_LSE2,
> > > +		.type = ARM64_CPUCAP_SYSTEM_FEATURE,
> > > +		.matches = has_cpuid_feature,
> > > +		.cpu_enable = cpu_enable_lse2,
> > > +		ARM64_CPUID_FIELDS(ID_AA64MMFR2_EL1, AT, IMP)
> > > +	},
> > > +#endif
> > >   	{
> > >   		.desc = "Virtualization Host Extensions",
> > >   		.capability = ARM64_HAS_VIRT_HOST_EXTN,
> > > diff --git a/arch/arm64/tools/cpucaps b/arch/arm64/tools/cpucaps
> > > index ac3429d892b9..0c7c0a293574 100644
> > > --- a/arch/arm64/tools/cpucaps
> > > +++ b/arch/arm64/tools/cpucaps
> > > @@ -41,6 +41,7 @@ HAS_HCX
> > >   HAS_LDAPR
> > >   HAS_LPA2
> > >   HAS_LSE_ATOMICS
> > > +HAS_LSE2
> > >   HAS_MOPS
> > >   HAS_NESTED_VIRT
> > >   HAS_PAN
> > > -- 
> > > 2.33.0
> > > 
> > > 
> > .
> > 



More information about the linux-arm-kernel mailing list