[PATCH 0/3] SMMUv3 CMD_SYNC optimisation
Nate Watterson
nwatters at codeaurora.org
Thu Aug 24 21:44:48 PDT 2017
Hi Robin,
On 8/18/2017 1:33 PM, Robin Murphy wrote:
> Hi all,
>
> Waiting for the command queue to drain for CMD_SYNC completion is likely
> a contention hotspot on high-core-count systems. If the SMMU is coherent
> and supports MSIs, though, we can use this cool feature (as suggested by
> the architecture, no less) to make syncs effectively non-blocking for
> anyone other than the caller.
>
> I don't have any hardware that supports MSIs, but this has at least
> passed muster on the Fast Model with cache modelling enabled - I'm hoping
> the Qualcomm machines have the appropriate configuration to actually test
> how well it works in reality. If it is worthwhile, I do have most of a
> plan for how we can do something similar in the non-MSI polling case (it's
> mostly a problem of handling the queue-wrapping edge cases correctly).
I tested this on QDF2400 hardware which supports MSI as a CMD_SYNC
completion signal. As with Thunder's "performance optimization" series,
I evaluated the patches using FIO with 4 NVME drives connected to a
single SMMU. Here's how they compared:
FIO - 512k blocksize / io-depth 32 / 1 thread per drive
Baseline 4.13-rc1 w/SMMU enabled: 25% of SMMU bypass performance
Baseline + Thunder Patch 1 : 28%
Baseline + CMD_SYNC Optimization: 36%
Baseline + Thunder Patches 2-5 : 86%
Baseline + Thunder Patches 1-5 : 100% [!!]
Seems like it would probably be worthwhile to implement this for the
non-MSI case also. Let me know if there are other workloads you're
particularly interested in, and I'll try to get those tested too.
-Nate
>
> Robin.
>
>
> Robin Murphy (3):
> iommu/arm-smmu-v3: Specialise CMD_SYNC handling
> iommu/arm-smmu-v3: Forget about cmdq-sync interrupt
> iommu/arm-smmu-v3: Utilise CMD_SYNC MSI feature
>
> drivers/iommu/arm-smmu-v3.c | 117 +++++++++++++++++++++++++++++---------------
> 1 file changed, 77 insertions(+), 40 deletions(-)
>
--
Qualcomm Datacenter Technologies as an affiliate of Qualcomm Technologies, Inc.
Qualcomm Technologies, Inc. is a member of the Code Aurora Forum, a Linux Foundation Collaborative Project.
More information about the linux-arm-kernel
mailing list