[PATCH v3 03/11] iommu: Add reset_device_done callback for hardware fault recovery
Baolu Lu
baolu.lu at linux.intel.com
Thu Apr 23 19:40:11 PDT 2026
On 4/17/26 07:28, Nicolin Chen wrote:
> When an IOMMU hardware detects an error due to a faulty device (e.g. an ATS
> invalidation timeout), IOMMU drivers may quarantine the device by disabling
> specific hardware features or dropping translation capabilities.
>
> To recover from these states, the IOMMU driver needs a reliable signal that
> the underlying physical hardware has been cleanly reset (e.g., via PCIe AER
> or a sysfs Function Level Reset) so as to lift the quarantine.
>
> Introduce a reset_device_done callback in struct iommu_ops. Trigger it from
> the existing pci_dev_reset_iommu_done() path to notify the underlying IOMMU
> driver that the device's internal state has been sanitized.
>
> Signed-off-by: Nicolin Chen<nicolinc at nvidia.com>
> ---
> include/linux/iommu.h | 4 ++++
> drivers/iommu/iommu.c | 12 ++++++++++++
> 2 files changed, 16 insertions(+)
Reviewed-by: Lu Baolu <baolu.lu at linux.intel.com>
More information about the linux-arm-kernel
mailing list