[PATCH v4 2/7] iommu/core: split mapping to page sizes as supported by the hardware
Kai Huang
mail.kai.huang at gmail.com
Thu Nov 10 01:17:24 EST 2011
>
> -int iommu_unmap(struct iommu_domain *domain, unsigned long iova, int gfp_order)
> +size_t iommu_unmap(struct iommu_domain *domain, unsigned long iova, size_t size)
> {
> - size_t size, unmapped;
> + size_t unmapped_page, unmapped = 0;
> + unsigned int min_pagesz;
>
> if (unlikely(domain->ops->unmap == NULL))
> return -ENODEV;
>
> - size = PAGE_SIZE << gfp_order;
> -
> - BUG_ON(!IS_ALIGNED(iova, size));
> -
> - unmapped = domain->ops->unmap(domain, iova, size);
> -
> - return get_order(unmapped);
> + /* find out the minimum page size supported */
> + min_pagesz = 1 << __ffs(domain->ops->pgsize_bitmap);
> +
> + /*
> + * The virtual address, as well as the size of the mapping, must be
> + * aligned (at least) to the size of the smallest page supported
> + * by the hardware
> + */
> + if (!IS_ALIGNED(iova | size, min_pagesz)) {
> + pr_err("unaligned: iova 0x%lx size 0x%lx min_pagesz 0x%x\n",
> + iova, (unsigned long)size, min_pagesz);
> + return -EINVAL;
> + }
> +
> + pr_debug("unmap this: iova 0x%lx size 0x%lx\n", iova,
> + (unsigned long)size);
> +
> + /*
> + * Keep iterating until we either unmap 'size' bytes (or more)
> + * or we hit an area that isn't mapped.
> + */
> + while (unmapped < size) {
> + size_t left = size - unmapped;
> +
> + unmapped_page = domain->ops->unmap(domain, iova, left);
> + if (!unmapped_page)
> + break;
> +
> + pr_debug("unmapped: iova 0x%lx size %lx\n", iova,
> + (unsigned long)unmapped_page);
> +
> + iova += unmapped_page;
> + unmapped += unmapped_page;
> + }
> +
> + return unmapped;
> }
> EXPORT_SYMBOL_GPL(iommu_unmap);
>
Seems the unmap function don't take phys as parameter, does this mean
domain->ops->unmap will walk through the page table to find out the
actual page size?
And another question: have we considered the IOTLB flush operation? I
think we need to implement similar logic when flush the DVMA range.
Intel VT-d's manual says software needs to specify the appropriate
mask value to flush large pages, but it does not say we need to
exactly match the page size as it is mapped. I guess it's not
necessary for Intel IOMMU, but other vendor's IOMMU may have such
limitation (or some other limitations). In my understanding current
implementation does not provide page size information for particular
DVMA ranges that has been mapped, and it's not flexible to implement
IOTLB flush code (ex, we may need to walk through page table to find
out actual page size). Maybe we can also add iommu_ops->flush_iotlb ?
-cody
More information about the linux-arm-kernel
mailing list