[PATCH 2/4] iommu: Implement common IOMMU ops for DMA mapping

Yingjoe Chen yingjoe.chen at mediatek.com
Thu May 28 23:35:01 PDT 2015


Hi Robin,

More info, hope this make it clearer. We are calling dma_map_sg_attrs
with the following 2 sg. With IOMMU, we are expecting it got merged into
1 contiguous va range, but instead we get 2 va range.

sg0 dma_address 0xfeeddc00 size 0x400, offset 0xc00
sg1 dma_address 0xfeede000 size 0x1000, offset 0x0

Joe.C

On Fri, 2015-05-29 at 13:26 +0800, Yong Wu wrote:
> Hi Robin,
>     Thanks.
> 
>     While we test venc in v4l2, we get a problem:
>     When we enter the funtion[0], it will be break unexpectedly in the
> funcion[1] while the offset of sg table is not zero. It is ok if the
> offset is zero. Then I add more log in dma-iommu.c, please help check
> below.
>     All we tested it based on dma v2. and have not tested it on v3 yet.
> The code of iommu-map-sg seems the same. if it's fixed in v3, I'm very
> sorry. The map_sg in mtk-iommu use default_iommu_map_sg.
>     Any question please tell me, Thanks very much. 
> 
> [0]http://lxr.free-electrons.com/source/drivers/media/v4l2-core/videobuf2-dma-contig.c#L564	[1]http://lxr.free-electrons.com/source/drivers/media/v4l2-core/videobuf2-dma-contig.c#L70
> 
> 
> On Wed, 2015-05-27 at 15:09 +0100, Robin Murphy wrote:
> > Taking inspiration from the existing arch/arm code, break out some
> > generic functions to interface the DMA-API to the IOMMU-API. This will
> > do the bulk of the heavy lifting for IOMMU-backed dma-mapping.
> > 
> > Signed-off-by: Robin Murphy <robin.murphy at arm.com>
> > ---
> >  drivers/iommu/Kconfig     |   7 +
> >  drivers/iommu/Makefile    |   1 +
> >  drivers/iommu/dma-iommu.c | 560 ++++++++++++++++++++++++++++++++++++++++++++++
> >  include/linux/dma-iommu.h |  94 ++++++++
> >  4 files changed, 662 insertions(+)
> >  create mode 100644 drivers/iommu/dma-iommu.c
> >  create mode 100644 include/linux/dma-iommu.h
> > 
> [snip]
> > +static int __finalise_sg(struct device *dev, struct scatterlist *sg, int nents,
> > +		dma_addr_t dma_addr)
> > +{
> > +	struct scatterlist *s, *seg = sg;
> > +	unsigned long seg_mask = dma_get_seg_boundary(dev);
> > +	unsigned int max_len = dma_get_max_seg_size(dev);
> > +	unsigned int seg_len = 0, seg_dma = 0;
> > +	int i, count = 1;
> > +
> > +	for_each_sg(sg, s, nents, i) {
> > +		/* Un-swizzling the fields here, hence the naming mismatch */
> > +		unsigned int s_offset = sg_dma_address(s);
> > +		unsigned int s_length = sg_dma_len(s);
> > +		unsigned int s_dma_len = s->length;
> > +
> > +		s->offset = s_offset;
> > +		s->length = s_length;
> > +		sg_dma_address(s) = DMA_ERROR_CODE;
> > +		sg_dma_len(s) = 0;
> > +
> > +		if (seg_len && (seg_dma + seg_len == dma_addr + s_offset) &&
> > +		    (seg_len + s_dma_len <= max_len) &&
> > +		    ((seg_dma & seg_mask) <= seg_mask - (seg_len + s_length))
> > +		   ) {
> > +			sg_dma_len(seg) += s_dma_len;
> > +		} else {
> > +			if (seg_len) {
> > +				seg = sg_next(seg);
> > +				count++;
> > +			}
> > +			sg_dma_len(seg) = s_dma_len;
> > +			sg_dma_address(seg) = dma_addr + s_offset;
>    Here the value of sg_dma_address have added s_offset, but
> sg_dma_len(seg) still is s_dma_len.
>    In the first loop, s_dma_len is from s->length which is alignd by
> "s_length = iova_align(iovad, s_length + s_offset);" in
> the interface iommu_dma_map_sg.
> > +
> > +			seg_len = s_offset;
> > +			seg_dma = dma_addr + s_offset;
> > +		}
> > +		seg_len += s_length;
> > +		dma_addr += s_dma_len;
> > +	}
> > +	return count;
> > +}
> > +
> > +static void __invalidate_sg(struct scatterlist *sg, int nents)
> > +{
> > +	struct scatterlist *s;
> > +	int i;
> > +
> > +	for_each_sg(sg, s, nents, i) {
> > +		if (sg_dma_address(s) != DMA_ERROR_CODE)
> > +			s->offset = sg_dma_address(s);
> > +		if (sg_dma_len(s))
> > +			s->length = sg_dma_len(s);
> > +		sg_dma_address(s) = DMA_ERROR_CODE;
> > +		sg_dma_len(s) = 0;
> > +	}
> > +}
> > +
> > +int iommu_dma_map_sg(struct device *dev, struct scatterlist *sg,
> > +		int nents, int prot, bool coherent)
> > +{
> > +	struct iommu_dma_domain *dom = arch_get_dma_domain(dev);
> > +	struct iova_domain *iovad = dom->iovad;
> > +	struct iova *iova;
> > +	struct scatterlist *s;
> > +	dma_addr_t dma_addr;
> > +	size_t iova_len = 0;
> > +	int i;
> > +
> > +	/*
> > +	 * Work out how much IOVA space we need, and align the segments to
> > +	 * IOVA granules for the IOMMU driver to handle. With some clever
> > +	 * trickery we can modify the list in a reversible manner.
> > +	 */
> > +	for_each_sg(sg, s, nents, i) {
> > +		size_t s_offset = iova_offset(iovad, s->offset);
> > +		size_t s_length = s->length;
> > +
> > +		sg_dma_address(s) = s->offset;
> > +		sg_dma_len(s) = s_length;
> > +		s->offset -= s_offset;
> > +		s_length = iova_align(iovad, s_length + s_offset);
> > +		s->length = s_length;
> At the begging, s->length is the length of valid data. but it's aligned
> here.
> > +
> > +		iova_len += s_length;
> > +	}
> > +
> > +	iova = __alloc_iova(dev, iova_len, coherent);
> > +	if (!iova)
> > +		goto out_restore_sg;
> > +
> > +	/*
> > +	 * We'll leave any physical concatenation to the IOMMU driver's
> > +	 * implementation - it knows better than we do.
> > +	 */
> > +	dma_addr = iova_dma_addr(iovad, iova);
> > +	if (iommu_map_sg(dom->domain, dma_addr, sg, nents, prot) < iova_len)
> > +		goto out_free_iova;
> > +
> > +	return __finalise_sg(dev, sg, nents, dev_dma_addr(dev, dma_addr));
> > +
> > +out_free_iova:
> > +	__free_iova(iovad, iova);
> > +out_restore_sg:
> > +	__invalidate_sg(sg, nents);
> > +	return 0;
> > +}
> > +
> [snip]
> 
> 





More information about the linux-arm-kernel mailing list