[RFC PATCH v2 24/41] arm64/sve: Discard SVE state on system call
Mark Rutland
mark.rutland at arm.com
Wed Mar 22 10:03:30 PDT 2017
On Wed, Mar 22, 2017 at 02:50:54PM +0000, Dave Martin wrote:
> The base procedure call standard for the Scalable Vector Extension
> defines all of the SVE programmer's model state (Z0-31, P0-15, FFR)
> as caller-save, except for that subset of the state that aliases
> FPSIMD state.
>
> System calls from userspace will almost always be made through C
> library wrappers -- as a consequence of the PCS there will thus
> rarely if ever be any live SVE state at syscall entry in practice.
>
> This gives us an opportinity to make SVE explicitly caller-save
> around SVC and so stop carrying around the SVE state for tasks that
> use SVE only occasionally (say, by calling a library).
>
> Note that FPSIMD state will still be preserved around SVC.
>
> As a crude heuristic to avoid pathological cases where a thread
> that uses SVE frequently has to fault back into the kernel again to
> re-enable SVE after a syscall, we switch the thread back to
> FPSIMD-only context tracking only if the context is actually
> switched out before returning to userspace.
>
> Signed-off-by: Dave Martin <Dave.Martin at arm.com>
> ---
> arch/arm64/kernel/fpsimd.c | 17 +++++++++++++++++
> 1 file changed, 17 insertions(+)
>
> diff --git a/arch/arm64/kernel/fpsimd.c b/arch/arm64/kernel/fpsimd.c
> index 5fb5585..8c18384 100644
> --- a/arch/arm64/kernel/fpsimd.c
> +++ b/arch/arm64/kernel/fpsimd.c
> @@ -250,6 +250,23 @@ static void task_fpsimd_save(struct task_struct *task)
> BUG_ON(task != current);
>
> if (IS_ENABLED(CONFIG_ARM64_SVE) &&
> + task_pt_regs(task)->syscallno != ~0UL &&
> + test_tsk_thread_flag(task, TIF_SVE)) {
> + unsigned long tmp;
> +
> + clear_tsk_thread_flag(task, TIF_SVE);
> +
> + /* Trap if the task tries to use SVE again: */
> + asm volatile (
> + "mrs %[tmp], cpacr_el1\n\t"
> + "bic %[tmp], %[tmp], %[mask]\n\t"
> + "msr cpacr_el1, %[tmp]"
> + : [tmp] "=r" (tmp)
> + : [mask] "i" (CPACR_EL1_ZEN_EL0EN)
> + );
Given we're poking this bit in a few places, I think it would make more
sense to add enable/disable helpers. Those can also subsume the lazy
writeback used for the context switch, e.g.
static inline void sve_el0_enable(void)
}
unsigned long cpacr = read_sysreg(cpacr_el1);
if ((cpacr & CPACR_EL1_ZEN_EL0EN)
return;
cpacr |= CPACR_EL1_ZEN_EL0EN;
write_sysreg(cpacr, cpacr_el1);
}
static inline void sve_el0_disable(void)
{
unsigned long cpacr = read_sysreg(cpacr_el1);
if (!(cpacr & CPACR_EL1_ZEN_EL0EN)
return;
cpacr &= ~CPACR_EL1_ZEN_EL0EN;
write_sysreg(cpacr, cpacr_el1);
}
Thanks,
Mark.
More information about the linux-arm-kernel
mailing list