RFC on Kdump and PCIe on ARM64
Sinan Kaya
okaya at codeaurora.org
Thu Mar 1 11:19:09 PST 2018
On 3/1/2018 2:05 PM, Bjorn Helgaas wrote:
> [+cc Joerg, David, iommu list]
>
> On Thu, Mar 01, 2018 at 12:44:26PM -0500, Sinan Kaya wrote:
>> Hi,
>>
>> We are seeing IOMMU faults when booting the kdump kernel on ARM64.
>>
>> [ 7.220162] arm-smmu-v3 arm-smmu-v3.0.auto: event 0x02 received:
>> [ 7.226123] arm-smmu-v3 arm-smmu-v3.0.auto: 0x0000010000000002
>> [ 7.232023] arm-smmu-v3 arm-smmu-v3.0.auto: 0x0000000000000000
>> [ 7.237925] arm-smmu-v3 arm-smmu-v3.0.auto: 0x0000000000000000
>> [ 7.243827] arm-smmu-v3 arm-smmu-v3.0.auto: 0x0000000000000000
>>
>> This is Nate's interpretation of the fault:
>>
>> "The PCI device is sending transactions just after the SMMU was
>> reset/reinitialized which is problematic because the device has not
>> yet been added to the SMMU and thus should not be doing *any* DMA.
>> DMA from the PCI devices should be quiesced prior to starting the
>> crashdump kernel or you risk overwriting portions of memory you
>> meant to preserve. In this case the SMMU was actually doing you a
>> favor by blocking these errant DMA operations!!"
>>
>> I think this makes sense especially for the IOMMU enabled case on
>> the host where an IOVA can overlap with the region of memory kdump
>> reserved for itself.
>>
>> Apparently, there has been similar concerns in the past.
>>
>> https://www.fujitsu.com/jp/documents/products/software/os/linux/catalog/LinuxConJapan2013-Indoh.pdf
>>
>> and was not addressed globally due to IOMMU+PCI driver ordering
>> issues and bugs in HW due to hot reset.
>>
>> https://lkml.org/lkml/2012/8/3/160
>>
>> Hot reset as mentioned is destructive and may not be the best
>> implementation choice. However, most of the modern endpoints
>> support PCIE function level reset.
>>
>> One other solution is for SMMUv3 driver to reserve the kdump used
>> IOVA addresses.
>>
>> Another solution is for the SMMUv3 driver to disable PCIe devices
>> behind the SMMU if it see SMMU is already enabled.
>
> What problem are you trying to solve? If the IOMMU is blocking DMA
> after the kdump kernel starts up, that sounds like the desired
> behavior.
>
Three issues:
1. I'm seeing a flood of SMMUv3 faults due to adapter using addresses from the
previous kernel. This might be OK.
2. When the SMMUv3 driver sees that it is enabled, it resets itself and
configures it one more time.
[ 7.018304] arm-smmu-v3 arm-smmu-v3.0.auto: ias 44-bit, oas 44-bit (features 0x00001fef)
[ 7.026379] arm-smmu-v3 arm-smmu-v3.0.auto: SMMU currently enabled! Resetting...
>From the moment IOMMU is disabled to the point where IOMMU get enabled again,
there is a potential for the PCIE device to corrupt the kdump kernel memory as
the bus master and memory enable bits are left enabled.
[ 0.000000] crashkernel reserved: 0x000000007fe00000 - 0x00000000ffe00000 (2048 MB)
This region happens to overlap with the IOVA addresses that SMMUv3 driver on the main
kernel is allocating.
IOVA addresses start from 0xFFFFFFFF and get decremented on each allocation.
3. The last one is adapter gets into fuzzy state due to not coming out of clean state
in the second time init and being rejected by SMMUv3 multiple times.
[ 16.093441] pci 0000:01:00.0: aer_status: 0x00040000, aer_mask: 0x00000000
[ 16.099356] pci 0000:01:00.0: Malformed TLP
[ 16.103522] pci 0000:01:00.0: aer_layer=Transaction Layer, aer_agent=Receiver ID
[ 16.110900] pci 0000:01:00.0: aer_uncor_severity: 0x00062011
[ 16.116543] pci 0000:01:00.0: TLP Header: 0a00a000 00008100 01010100 00000000
--
Sinan Kaya
Qualcomm Datacenter Technologies, Inc. as an affiliate of Qualcomm Technologies, Inc.
Qualcomm Technologies, Inc. is a member of the Code Aurora Forum, a Linux Foundation Collaborative Project.
More information about the kexec
mailing list