[PATCH v6 04/39] KVM: arm64: vgic: Split out mapping IRQs and setting irq_ops

Marc Zyngier maz at kernel.org
Tue Mar 17 09:00:56 PDT 2026


On Tue, 17 Mar 2026 11:40:59 +0000,
Sascha Bischoff <Sascha.Bischoff at arm.com> wrote:
> 
> Prior to this change, the act of mapping a virtual IRQ to a physical
> one also set the irq_ops. Unmapping then reset the irq_ops to NULL. So
> far, this has been fine and hasn't caused any major issues.
> 
> Now, however, as GICv5 support is being added to KVM, it has become
> apparent that conflating mapping/unmapping IRQs and setting/clearing
> irq_ops can cause issues. The reason is that the upcoming GICv5
> support introduces a set of default irq_ops for PPIs, and removing
> this when unmapping will cause things to break rather horribly.
> 
> Split out the mapping/unmapping of IRQs from the setting/clearing of
> irq_ops. The arch timer code is updated to set the irq_ops following a
> successful map. The irq_ops are intentionally not removed again on an
> unmap as the only irq_op introduced by the arch timer only takes
> effect if the hw bit in struct vgic_irq is set. Therefore, it is safe
> to leave this in place, and it avoids additional complexity when GICv5
> support is introduced.
> 
> Signed-off-by: Sascha Bischoff <sascha.bischoff at arm.com>
> ---
>  arch/arm64/kvm/arch_timer.c | 32 ++++++++++++++++++-------------
>  arch/arm64/kvm/vgic/vgic.c  | 38 +++++++++++++++++++++++++++++++------
>  include/kvm/arm_vgic.h      |  5 ++++-
>  3 files changed, 55 insertions(+), 20 deletions(-)
> 
> diff --git a/arch/arm64/kvm/arch_timer.c b/arch/arm64/kvm/arch_timer.c
> index 600f250753b45..1f536dd5978d4 100644
> --- a/arch/arm64/kvm/arch_timer.c
> +++ b/arch/arm64/kvm/arch_timer.c
> @@ -740,14 +740,17 @@ static void kvm_timer_vcpu_load_nested_switch(struct kvm_vcpu *vcpu,
>  
>  		ret = kvm_vgic_map_phys_irq(vcpu,
>  					    map->direct_vtimer->host_timer_irq,
> -					    timer_irq(map->direct_vtimer),
> -					    &arch_timer_irq_ops);
> -		WARN_ON_ONCE(ret);
> +					    timer_irq(map->direct_vtimer));
> +		if (!WARN_ON_ONCE(ret))
> +			kvm_vgic_set_irq_ops(vcpu, timer_irq(map->direct_vtimer),
> +					     &arch_timer_irq_ops);
> +
>  		ret = kvm_vgic_map_phys_irq(vcpu,
>  					    map->direct_ptimer->host_timer_irq,
> -					    timer_irq(map->direct_ptimer),
> -					    &arch_timer_irq_ops);
> -		WARN_ON_ONCE(ret);
> +					    timer_irq(map->direct_ptimer));
> +		if (!WARN_ON_ONCE(ret))
> +			kvm_vgic_set_irq_ops(vcpu, timer_irq(map->direct_ptimer),
> +					     &arch_timer_irq_ops);

Do we really need this eager setting of ops? Given that nothing seems
to clear them, why can't we just set the ops at vcpu init time? Given
that this is a pretty hot path (on each exception/exception return
between L2 and L1), the least we do here, the better.

>  	}
>  }
>  
> @@ -1565,20 +1568,23 @@ int kvm_timer_enable(struct kvm_vcpu *vcpu)
>  
>  	ret = kvm_vgic_map_phys_irq(vcpu,
>  				    map.direct_vtimer->host_timer_irq,
> -				    timer_irq(map.direct_vtimer),
> -				    &arch_timer_irq_ops);
> +				    timer_irq(map.direct_vtimer));
>  	if (ret)
>  		return ret;
>  
> +	kvm_vgic_set_irq_ops(vcpu, timer_irq(map.direct_vtimer),
> +			     &arch_timer_irq_ops);
> +
>  	if (map.direct_ptimer) {
>  		ret = kvm_vgic_map_phys_irq(vcpu,
>  					    map.direct_ptimer->host_timer_irq,
> -					    timer_irq(map.direct_ptimer),
> -					    &arch_timer_irq_ops);
> -	}
> +					    timer_irq(map.direct_ptimer));
> +		if (ret)
> +			return ret;
>  
> -	if (ret)
> -		return ret;
> +		kvm_vgic_set_irq_ops(vcpu, timer_irq(map.direct_ptimer),
> +				     &arch_timer_irq_ops);
> +	}

which would mean moving this to kvm_timer_vcpu_init().

>  
>  no_vgic:
>  	timer->enabled = 1;
> diff --git a/arch/arm64/kvm/vgic/vgic.c b/arch/arm64/kvm/vgic/vgic.c
> index e22b79cfff965..e37c640d74bcf 100644
> --- a/arch/arm64/kvm/vgic/vgic.c
> +++ b/arch/arm64/kvm/vgic/vgic.c
> @@ -553,10 +553,38 @@ int kvm_vgic_inject_irq(struct kvm *kvm, struct kvm_vcpu *vcpu,
>  	return 0;
>  }
>  
> +void kvm_vgic_set_irq_ops(struct kvm_vcpu *vcpu, u32 vintid,
> +			 struct irq_ops *ops)
> +{
> +	struct vgic_irq *irq = vgic_get_vcpu_irq(vcpu, vintid);
> +
> +	BUG_ON(!irq);
> +
> +	scoped_guard(raw_spinlock_irqsave, &irq->irq_lock)
> +	{
> +		irq->ops = ops;
> +	}

nit: opening brace in the wrong spot, and overall not useful. This
could simply be written as:

	scoped_guard(raw_spinlock_irqsave, &irq->irq_lock)
		irq->ops = ops;

> +
> +	vgic_put_irq(vcpu->kvm, irq);
> +}
> +
> +void kvm_vgic_clear_irq_ops(struct kvm_vcpu *vcpu, u32 vintid)
> +{
> +	struct vgic_irq *irq = vgic_get_vcpu_irq(vcpu, vintid);
> +
> +	BUG_ON(!irq);
> +
> +	scoped_guard(raw_spinlock_irqsave, &irq->irq_lock)
> +	{
> +		irq->ops = NULL;
> +	}
> +
> +	vgic_put_irq(vcpu->kvm, irq);
> +}
> +

nit: that could also be written as:

void kvm_vgic_clear_irq_ops(struct kvm_vcpu *vcpu, u32 vintid)
{
	kvm_vgic_set_irq_ops(vcpu, vintid, NULL);
}

I can fix all of it when applying if that works for you.

Thanks,

	M.

-- 
Without deviation from the norm, progress is not possible.



More information about the linux-arm-kernel mailing list