[PATCH v2] iommu/arm-smmu: Invalidate TLBs properly
Will Deacon
will.deacon at arm.com
Mon Dec 7 10:28:21 PST 2015
On Mon, Dec 07, 2015 at 06:18:52PM +0000, Robin Murphy wrote:
> When invalidating an IOVA range potentially spanning multiple pages,
> such as when removing an entire intermediate-level table, we currently
> only issue an invalidation for the first IOVA of that range. Since the
> architecture specifies that address-based TLB maintenance operations
> target a single entry, an SMMU could feasibly retain live entries for
> subsequent pages within that unmapped range, which is not good.
>
> Make sure we hit every possible entry by iterating over the whole range
> at the granularity provided by the pagetable implementation.
>
> Signed-off-by: Robin Murphy <robin.murphy at arm.com>
> ---
>
> v2: include SMMUv3 fix, use the same shorter loop construct everywhere.
>
> drivers/iommu/arm-smmu-v3.c | 5 ++++-
> drivers/iommu/arm-smmu.c | 16 +++++++++++++---
> 2 files changed, 17 insertions(+), 4 deletions(-)
>
> diff --git a/drivers/iommu/arm-smmu-v3.c b/drivers/iommu/arm-smmu-v3.c
> index c302b65..8bb5abf 100644
> --- a/drivers/iommu/arm-smmu-v3.c
> +++ b/drivers/iommu/arm-smmu-v3.c
> @@ -1354,7 +1354,10 @@ static void arm_smmu_tlb_inv_range_nosync(unsigned long iova, size_t size,
> cmd.tlbi.vmid = smmu_domain->s2_cfg.vmid;
> }
>
> - arm_smmu_cmdq_issue_cmd(smmu, &cmd);
> + do {
> + arm_smmu_cmdq_issue_cmd(smmu, &cmd);
> + cmd.tlbi.addr += granule;
> + } while (size -= granule);
> }
>
> static struct iommu_gather_ops arm_smmu_gather_ops = {
> diff --git a/drivers/iommu/arm-smmu.c b/drivers/iommu/arm-smmu.c
> index 601e3dd..eb28c3e 100644
> --- a/drivers/iommu/arm-smmu.c
> +++ b/drivers/iommu/arm-smmu.c
> @@ -597,12 +597,18 @@ static void arm_smmu_tlb_inv_range_nosync(unsigned long iova, size_t size,
> if (!IS_ENABLED(CONFIG_64BIT) || smmu->version == ARM_SMMU_V1) {
> iova &= ~12UL;
> iova |= ARM_SMMU_CB_ASID(cfg);
> - writel_relaxed(iova, reg);
> + do {
> + writel_relaxed(iova, reg);
> + iova += granule;
> + } while (size -= granule);
> #ifdef CONFIG_64BIT
> } else {
> iova >>= 12;
> iova |= (u64)ARM_SMMU_CB_ASID(cfg) << 48;
> - writeq_relaxed(iova, reg);
> + do {
> + writeq_relaxed(iova, reg);
> + iova += granule >> 12;
> + } while (size -= granule)
This doesn't compile.
> #endif
> }
> #ifdef CONFIG_64BIT
> @@ -610,7 +616,11 @@ static void arm_smmu_tlb_inv_range_nosync(unsigned long iova, size_t size,
> reg = ARM_SMMU_CB_BASE(smmu) + ARM_SMMU_CB(smmu, cfg->cbndx);
> reg += leaf ? ARM_SMMU_CB_S2_TLBIIPAS2L :
> ARM_SMMU_CB_S2_TLBIIPAS2;
> - writeq_relaxed(iova >> 12, reg);
> + iova >>= 12;
> + do {
> + writeq_relaxed(iova, reg);
> + iova += granule >> 12;
> + } while (size -= granule)
Same here.
Please at least build your patches, and preferably test them too.
Will
More information about the linux-arm-kernel
mailing list