[PATCH v3 03/11] iommu: Add reset_device_done callback for hardware fault recovery

Baolu Lu baolu.lu at linux.intel.com
Thu Apr 23 19:40:11 PDT 2026


On 4/17/26 07:28, Nicolin Chen wrote:
> When an IOMMU hardware detects an error due to a faulty device (e.g. an ATS
> invalidation timeout), IOMMU drivers may quarantine the device by disabling
> specific hardware features or dropping translation capabilities.
> 
> To recover from these states, the IOMMU driver needs a reliable signal that
> the underlying physical hardware has been cleanly reset (e.g., via PCIe AER
> or a sysfs Function Level Reset) so as to lift the quarantine.
> 
> Introduce a reset_device_done callback in struct iommu_ops. Trigger it from
> the existing pci_dev_reset_iommu_done() path to notify the underlying IOMMU
> driver that the device's internal state has been sanitized.
> 
> Signed-off-by: Nicolin Chen<nicolinc at nvidia.com>
> ---
>   include/linux/iommu.h |  4 ++++
>   drivers/iommu/iommu.c | 12 ++++++++++++
>   2 files changed, 16 insertions(+)

Reviewed-by: Lu Baolu <baolu.lu at linux.intel.com>



More information about the linux-arm-kernel mailing list