[RFC PATCH v2 2/8] arm64: Implement frame types
Madhavan T. Venkataraman
madvenka at linux.microsoft.com
Thu Mar 18 22:22:49 GMT 2021
On 3/18/21 12:40 PM, Mark Brown wrote:
> On Mon, Mar 15, 2021 at 11:57:54AM -0500, madvenka at linux.microsoft.com wrote:
>
>> To summarize, pt_regs->stackframe is used (or will be used) as a marker
>> frame in stack traces. To enable the unwinder to detect these frames, tag
>> each pt_regs->stackframe with a type. To record the type, use the unused2
>> field in struct pt_regs and rename it to frame_type. The types are:
>
> Unless I'm misreading what's going on here this is more trying to set a
> type for the stack as a whole than for a specific stack frame. I'm also
> finding this a bit confusing as the unwinder already tracks things it
> calls frame types and it handles types that aren't covered here like
> SDEI. At the very least there's a naming issue here.
>
When the unwinder gets to EL1 pt_regs->stackframe, it needs to be sure that
it is indeed a frame inside an EL1 pt_regs structure. It performs the
following checks:
FP == pt_regs->regs[29]
PC == pt_regs->pc
type == EL1_FRAME
to confirm that the frame is EL1 pt_regs->stackframe.
Similarly, for EL0, the type is EL0_FRAME.
Both these frames are on the task stack. So, it is not a stack type.
> Taking a step back though do we want to be tracking this via pt_regs?
> It's reliant on us robustly finding the correct pt_regs and on having
> the things that make the stack unreliable explicitly go in and set the
> appropriate type. That seems like it will be error prone, I'd been
> expecting to do something more like using sections to filter code for
> unreliable features based on the addresses of the functions we find on
> the stack or similar. This could still go wrong of course but there's
> fewer moving pieces, and especially fewer moving pieces specific to
> reliable stack trace.
>
In that case, I suggest doing both. That is, check the type as well
as specific functions. For instance, in the EL1 pt_regs, in addition
to the above checks, check the PC against el1_sync(), el1_irq() and
el1_error(). I have suggested this in the cover letter.
If this is OK with you, we could do that. We want to make really sure that
nothing goes wrong with detecting the exception frame.
> I'm wary of tracking data that only ever gets used for the reliable
> stack trace path given that it's going to be fairly infrequently used
> and hence tested, especially things that only crop up in cases that are
> hard to provoke reliably. If there's a way to detect things that
> doesn't use special data that seems safer.
>
If you dislike the frame type, I could remove it and just do the
following checks:
FP == pt_regs->regs[29]
PC == pt_regs->pc
and the address check against el1_*() functions
and similar changes for EL0 as well.
I still think that the frame type check makes it more robust.
>> EL1_FRAME
>> EL1 exception frame.
>
> We do trap into EL2 as well, the patch will track EL2 frames as EL1
> frames. Even if we can treat them the same the naming ought to be
> clear.
>
Are you referring to ARMv8.1 VHE extension where the kernel can run
at EL2? Could you elaborate? I thought that EL2 was basically for
Hypervisors.
Thanks.
>> FTRACE_FRAME
>> FTRACE frame.
>
> This is implemented later in the series. If using this approach I'd
> suggest pulling the change in entry-ftrace.S that sets this into this
> patch, it's easier than adding a note about this being added later and
> should help with any bisect issues.
>
OK. Good point.
Madhavan
More information about the linux-arm-kernel
mailing list