[PATCH] ARM: mm: dma: Update coherent streaming apis with missing memory barrier
Catalin Marinas
catalin.marinas at arm.com
Thu Apr 24 03:13:56 PDT 2014
On Thu, Apr 24, 2014 at 10:16:24AM +0100, Russell King - ARM Linux wrote:
> On Thu, Apr 24, 2014 at 10:09:27AM +0100, Catalin Marinas wrote:
> > If we only do D-cache maintenance by MVA, the ARM ARM (both v7 and v8)
> > claims that these are ordered relative to any explicit load/stores to
> > the same address. So in theory we don't even need a DMB for unmapping
> > with DMA_FROM_DEVICE. But in practice, we may have the outer cache,
> > hence a DSB is required before the outer_sync() (we could move it there
> > though).
>
> The general usecase for outer_sync() is: dsb(); outer_sync(); Why would
> we want to change this to dsb(); dmb(); outer_sync(); (where the dmb is
> in outer_sync itself?)
>
> Seems more sensible for it to stay at the outer_sync() call site where
> it's needed.
You are right, it gets worse for the wmb() case if we change
outer_sync(), I was thinking about cache maintenance.
An optimisation would be for functions like v7_dma_inv_range() to no
longer have the dsb but move it to the __dma_page_cpu_to_dev() before
the outer_*_range() ops. If we assume that a streaming DMA is started by
a writel() access which has a dsb already, in the absence of outer cache
we wouldn't need any dsb at all, hence something like a conditional
sync_for_outer() barrier (dsb if outer cache or no-op otherwise) in
__dma_page_cpu_to_dev().
In the __dma_page_dev_to_cpu() we wouldn't need any dsb at all for cache
maintenance since subsequent accesses to the same address are ordered by
the hardware (and outer cache maintenance is done before the inner
anyway).
(that's from an ARMv7 perspective, we need to check ordering on earlier
architectures)
--
Catalin
More information about the linux-arm-kernel
mailing list