[PATCH] iommu/io-pgtable-arm: Fix iova_to_phys for block entries

Robin Murphy robin.murphy at arm.com
Fri Jun 17 07:07:36 PDT 2016


On 16/06/16 18:44, Will Deacon wrote:
> The implementation of iova_to_phys for the long-descriptor ARM
> io-pgtable code always masks with the granule size when inserting the
> low virtual address bits into the physical address determined from the
> page tables. In cases where the leaf entry is found before the final
> level of table (i.e. due to a block mapping), this results in rounding
> down to the bottom page of the block mapping. Consequently, the physical
> address range batching in the vfio_unmap_unpin is defeated and we end
> up taking the long way home.
>
> This patch fixes the problem by masking the virtual address with the
> appropriate mask for the level at which the leaf descriptor is located.
> The short-descriptor code already gets this right, so no change is
> needed there.

With this, I now see VFIO unmapping at the same granularity as the 
initial mapping. To think of all the cumulative hours we've spent 
watching it split the blocks and go 4K at a time... *sigh*

Tested-by: Robin Murphy <robin.murphy at arm.com>

> Reported-by: Robin Murphy <robin.murphy at arm.com>
> Signed-off-by: Will Deacon <will.deacon at arm.com>
> ---
>   drivers/iommu/io-pgtable-arm.c | 2 +-
>   1 file changed, 1 insertion(+), 1 deletion(-)
>
> diff --git a/drivers/iommu/io-pgtable-arm.c b/drivers/iommu/io-pgtable-arm.c
> index a1ed1b73fed4..f5c90e1366ce 100644
> --- a/drivers/iommu/io-pgtable-arm.c
> +++ b/drivers/iommu/io-pgtable-arm.c
> @@ -576,7 +576,7 @@ static phys_addr_t arm_lpae_iova_to_phys(struct io_pgtable_ops *ops,
>   	return 0;
>
>   found_translation:
> -	iova &= (ARM_LPAE_GRANULE(data) - 1);
> +	iova &= (ARM_LPAE_BLOCK_SIZE(lvl, data) - 1);
>   	return ((phys_addr_t)iopte_to_pfn(pte,data) << data->pg_shift) | iova;
>   }
>
>




More information about the linux-arm-kernel mailing list