[PATCH v2] KVM: arm64: Initialize VCPU mdcr_el2 before loading it
Marc Zyngier
maz at kernel.org
Tue Mar 30 20:57:21 BST 2021
On Tue, 30 Mar 2021 18:49:54 +0100,
Alexandru Elisei <alexandru.elisei at arm.com> wrote:
>
> Hi Marc,
>
> On 3/30/21 6:13 PM, Alexandru Elisei wrote:
> > [..]
> >>> +}
> >>> +
> >>> /**
> >>> * kvm_arm_reset_debug_ptr - reset the debug ptr to point to the vcpu state
> >>> */
> >>> @@ -83,12 +137,7 @@ void kvm_arm_reset_debug_ptr(struct kvm_vcpu *vcpu)
> >>> * @vcpu: the vcpu pointer
> >>> *
> >>> * This is called before each entry into the hypervisor to setup any
> >>> - * debug related registers. Currently this just ensures we will trap
> >>> - * access to:
> >>> - * - Performance monitors (MDCR_EL2_TPM/MDCR_EL2_TPMCR)
> >>> - * - Debug ROM Address (MDCR_EL2_TDRA)
> >>> - * - OS related registers (MDCR_EL2_TDOSA)
> >>> - * - Statistical profiler (MDCR_EL2_TPMS/MDCR_EL2_E2PB)
> >>> + * debug related registers.
> >>> *
> >>> * Additionally, KVM only traps guest accesses to the debug registers if
> >>> * the guest is not actively using them (see the KVM_ARM64_DEBUG_DIRTY
> >>> @@ -100,27 +149,14 @@ void kvm_arm_reset_debug_ptr(struct kvm_vcpu *vcpu)
> >>>
> >>> void kvm_arm_setup_debug(struct kvm_vcpu *vcpu)
> >>> {
> >>> - bool trap_debug = !(vcpu->arch.flags & KVM_ARM64_DEBUG_DIRTY);
> >>> unsigned long mdscr, orig_mdcr_el2 = vcpu->arch.mdcr_el2;
> >>>
> >>> trace_kvm_arm_setup_debug(vcpu, vcpu->guest_debug);
> >>>
> >>> - /*
> >>> - * This also clears MDCR_EL2_E2PB_MASK to disable guest access
> >>> - * to the profiling buffer.
> >>> - */
> >>> - vcpu->arch.mdcr_el2 = __this_cpu_read(mdcr_el2) & MDCR_EL2_HPMN_MASK;
> >>> - vcpu->arch.mdcr_el2 |= (MDCR_EL2_TPM |
> >>> - MDCR_EL2_TPMS |
> >>> - MDCR_EL2_TPMCR |
> >>> - MDCR_EL2_TDRA |
> >>> - MDCR_EL2_TDOSA);
> >>> + kvm_arm_setup_mdcr_el2(vcpu, __this_cpu_read(mdcr_el2));
> >>>
> >>> /* Is Guest debugging in effect? */
> >>> if (vcpu->guest_debug) {
> >>> - /* Route all software debug exceptions to EL2 */
> >>> - vcpu->arch.mdcr_el2 |= MDCR_EL2_TDE;
> >>> -
> >>> /* Save guest debug state */
> >>> save_guest_debug_regs(vcpu);
> >>>
> >>> @@ -174,7 +210,6 @@ void kvm_arm_setup_debug(struct kvm_vcpu *vcpu)
> >>>
> >>> vcpu->arch.debug_ptr = &vcpu->arch.external_debug_state;
> >>> vcpu->arch.flags |= KVM_ARM64_DEBUG_DIRTY;
> >>> - trap_debug = true;
> >> There is something that slightly worries me here: there is now a
> >> disconnect between flagging debug as dirty and setting the
> >> trapping. And actually, you now check for KVM_ARM64_DEBUG_DIRTY and
> >> set the trap bits *before* setting the dirty bit itself.
> >>
> >> Here, I believe you end up with guest/host confusion of breakpoints,
> >> which isn't great. Or did I miss something?
> > I'm sorry, but I don't understand what you mean. This is my understanding of what
> > is happening.
> >
> > Without this patch, trap_debug is set to true and the KVM_ARM64_DEBUG_DIRTY flag
> > is set if vcpu->guest_debug & KVM_GUESTDBG_USE_HW. Further down, trap debug is
> > only used when computing mdcr_el2.
> >
> > With this patch, trap_debug is set to true if vcpu->guest_debug &
> > KVM_GUESTDBG_USE_HW and it's also used for computing mdcr_el2, but this happens in
> > kvm_arm_setup_mdcr_el2(), which is called at the start of kvm_arm_setup_debug().
> > The KVM_ARM_DEBUG_DIRTY flags is still set in kvm_arm_setup_debug() if
> > vcpu->guest_debug & KVM_GUESTDBG_USE_HW, like before.
> >
> > The guest never runs with the value computed in kvm_vcpu_first_run_init() unless
> > it's identical with the value recomputed in kvm_arm_setup_debug().
> >
> > The only difference I see is that mdcr_el2 is computed at the start of
> > kvm_arm_setup_debug(). I get the feeling I'm also missing something.
>
> I think I understand what you mean, you are worried that we won't
> set the bit in mdcr_el2 to trap debug in the same place where we set
> the debug dirty flag.
Yes, that's what I mean. The code is conceptually as such ATM:
debug_trap = (something based on vcpu->flags);
if (something else) {
check stuff;
vcpu->flags |= stuff;
debug_trap = true;
}
if (debug_trap)
set trap conditions;
You are turning this into:
debug_trap = (something based on vcpu->flags);
if (debug_trap) {
set trap conditions;
}
if (something else) {
check stuff;
vcpu->flags |= stuff;
}
which isn't the same thing. In your case, it probably works because of
KVM_GUESTDBG_USE_HW, but that's really hard to follow, and we have had
so many bugs in the debug code that it really needs to be kept as
stupid as possible.
> If that's the case, then I can move kvm_arm_setup_mdcr_el2 right
> after the BUG_ON() and remove the KVM_GUESTDBG_USE_HW check because
> the KVM_ARM_DEBUG_DIRTY would be already set.
Yes, I think that'd be better.
> Question though, if mdcr_el2 is tied to the debug dirty flag, we
> ignore the flag here (code without this patch):
>
> BUG_ON(!vcpu->guest_debug &&
> vcpu->arch.debug_ptr != &vcpu->arch.vcpu_debug_state);
>
> /* Trap debug register access */
> if (trap_debug)
> vcpu->arch.mdcr_el2 |= MDCR_EL2_TDA;
>
> /* If KDE or MDE are set, perform a full save/restore cycle. */
> if (vcpu_read_sys_reg(vcpu, MDSCR_EL1) & (DBG_MDSCR_KDE | DBG_MDSCR_MDE))
> vcpu->arch.flags |= KVM_ARM64_DEBUG_DIRTY;
>
> I suppose there's something I don't understand yet about how this is
> supposed to work.
The idea (IIRC) is that if MDSCR_EL1.KDE or MDSCR_EL1.MDE are set,
that's because the guest is currently debugging, and that we are
better off saying that the debug state is dirty, forcing a
save/restore cycle on entry.
You may want to dig into the git history for more accurate
information...
Hope this helps,
M.
--
Without deviation from the norm, progress is not possible.
More information about the linux-arm-kernel
mailing list