[PATCH v6 4/4] dmaengine: pl330: Don't require irq-safe runtime PM

Ulf Hansson ulf.hansson at linaro.org
Tue Jan 24 07:29:05 PST 2017


On 24 January 2017 at 10:27, Marek Szyprowski <m.szyprowski at samsung.com> wrote:
> This patch replaces irq-safe runtime PM with non-irq-safe version based on
> the new approach. Existing, irq-safe runtime PM implementation for PL330 was
> not bringing much benefits of its own - only clocks were enabled/disabled.
>
> Till now non-irq-safe runtime PM implementation was only possible by calling
> pm_runtime_get/put functions from alloc/free_chan_resources. All other DMA
> engine API functions cannot be called from a context, which permits sleeping.
> Such implementation, in practice would result in keeping DMA controller's
> device active almost all the time, because most of the slave device drivers
> (DMA engine clients) acquire DMA channel in their probe() function and
> released it during driver removal.
>
> This patch provides a new, different approach. It is based on an observation
> that there can be only one slave device using each DMA channel. PL330 hardware
> always has dedicated channels for each peripheral device. Using recently
> introduced device dependencies (links) infrastructure one can ensure proper
> runtime PM state of PL330 DMA controller basing on the runtime PM state of
> the slave device.
>
> In this approach in pl330_alloc_chan_resources() function a new dependency
> is being created between PL330 DMA controller device (as a supplier) and
> given slave device (as a consumer). This way PL330 DMA controller device
> runtime active counter is increased when the slave device is resumed and
> decreased the same time when given slave device is put to suspend. This way
> it has been ensured to keep PL330 DMA controller runtime active if there is
> an active used of any of its DMA channels. Slave device pointer is initially
> stored in per-channel data in of_dma_xlate callback. This is similar to what
> has been already implemented in Exynos IOMMU driver in commit 2f5f44f205cc95
> ("iommu/exynos: Use device dependency links to control runtime pm").
>
> If slave device doesn't implement runtime PM or keeps device runtime active
> all the time, then PL330 DMA controller will be runtime active all the time
> when channel is being allocated. The goal is however to have runtime PM

I would remove everything after "The goal is however.."  from this section.

Instead, what I think is important to state (may be added in the first
section in the change log) is however another limitation with irq-safe
runtime PM. That is, it may prevent the generic PM domain (genpd) from
being powered off, particularly in cases when the genpd doesn't have
the GENPD_FLAG_IRQ_SAFE set.

> added to all devices in the system, because it lets respective power
> domains to be turned off, what gives the best results in terms of power
> saving.
>
> If one requests memory-to-memory channel, runtime active counter is
> increased unconditionally. This might be a drawback of this approach, but
> PL330 is not really used for memory-to-memory operations due to poor
> performance in such operations compared to the CPU.
>
> Introducing non-irq-safe runtime power management finally allows to turn off
> audio power domain on Exynos5 SoCs.
>
> Removal of irq-safe runtime PM is based on the revert of the following
> commits:
> 1. commit 5c9e6c2b2ba3 "dmaengine: pl330: fix runtime pm support"
> 2. commit 81cc6edc0870 "dmaengine: pl330: Fix hang on dmaengine_terminate_all
>    on certain boards"
> 3. commit ae43b3289186 "ARM: 8202/1: dmaengine: pl330: Add runtime Power
>    Management support v12"
>
> Signed-off-by: Marek Szyprowski <m.szyprowski at samsung.com>
> Reviewed-by: Krzysztof Kozlowski <krzk at kernel.org>
> Acked-by: Arnd Bergmann <arnd at arndb.de>

Nice work!

Reviewed-by: Ulf Hansson <ulf.hansson at linaro.org>

Kind regards
Uffe

> ---
>  drivers/dma/pl330.c | 166 +++++++++++++++++++++-------------------------------
>  1 file changed, 66 insertions(+), 100 deletions(-)
>
> diff --git a/drivers/dma/pl330.c b/drivers/dma/pl330.c
> index c77a3494659c..dff7228198f6 100644
> --- a/drivers/dma/pl330.c
> +++ b/drivers/dma/pl330.c
> @@ -268,9 +268,6 @@ enum pl330_byteswap {
>
>  #define NR_DEFAULT_DESC        16
>
> -/* Delay for runtime PM autosuspend, ms */
> -#define PL330_AUTOSUSPEND_DELAY 20
> -
>  /* Populated by the PL330 core driver for DMA API driver's info */
>  struct pl330_config {
>         u32     periph_id;
> @@ -449,8 +446,8 @@ struct dma_pl330_chan {
>         bool cyclic;
>
>         /* for runtime pm tracking */
> -       bool active;
>         struct device *slave;
> +       struct device_link *slave_link;
>  };
>
>  struct pl330_dmac {
> @@ -2008,7 +2005,6 @@ static void pl330_tasklet(unsigned long data)
>         struct dma_pl330_chan *pch = (struct dma_pl330_chan *)data;
>         struct dma_pl330_desc *desc, *_dt;
>         unsigned long flags;
> -       bool power_down = false;
>
>         spin_lock_irqsave(&pch->lock, flags);
>
> @@ -2023,18 +2019,10 @@ static void pl330_tasklet(unsigned long data)
>         /* Try to submit a req imm. next to the last completed cookie */
>         fill_queue(pch);
>
> -       if (list_empty(&pch->work_list)) {
> -               spin_lock(&pch->thread->dmac->lock);
> -               _stop(pch->thread);
> -               spin_unlock(&pch->thread->dmac->lock);
> -               power_down = true;
> -               pch->active = false;
> -       } else {
> -               /* Make sure the PL330 Channel thread is active */
> -               spin_lock(&pch->thread->dmac->lock);
> -               _start(pch->thread);
> -               spin_unlock(&pch->thread->dmac->lock);
> -       }
> +       /* Make sure the PL330 Channel thread is active */
> +       spin_lock(&pch->thread->dmac->lock);
> +       _start(pch->thread);
> +       spin_unlock(&pch->thread->dmac->lock);
>
>         while (!list_empty(&pch->completed_list)) {
>                 struct dmaengine_desc_callback cb;
> @@ -2047,13 +2035,6 @@ static void pl330_tasklet(unsigned long data)
>                 if (pch->cyclic) {
>                         desc->status = PREP;
>                         list_move_tail(&desc->node, &pch->work_list);
> -                       if (power_down) {
> -                               pch->active = true;
> -                               spin_lock(&pch->thread->dmac->lock);
> -                               _start(pch->thread);
> -                               spin_unlock(&pch->thread->dmac->lock);
> -                               power_down = false;
> -                       }
>                 } else {
>                         desc->status = FREE;
>                         list_move_tail(&desc->node, &pch->dmac->desc_pool);
> @@ -2068,12 +2049,6 @@ static void pl330_tasklet(unsigned long data)
>                 }
>         }
>         spin_unlock_irqrestore(&pch->lock, flags);
> -
> -       /* If work list empty, power down */
> -       if (power_down) {
> -               pm_runtime_mark_last_busy(pch->dmac->ddma.dev);
> -               pm_runtime_put_autosuspend(pch->dmac->ddma.dev);
> -       }
>  }
>
>  static struct dma_chan *of_dma_pl330_xlate(struct of_phandle_args *dma_spec,
> @@ -2105,11 +2080,63 @@ static struct dma_chan *of_dma_pl330_xlate(struct of_phandle_args *dma_spec,
>         return dma_get_slave_channel(&pl330->peripherals[chan_id].chan);
>  }
>
> +static int pl330_add_slave_pm_link(struct pl330_dmac *pl330,
> +                                  struct dma_pl330_chan *pch)
> +{
> +       int i;
> +
> +       /* No slave device means memory-to-memory channels */
> +       if (!pch->slave)
> +               return pm_runtime_get_sync(pl330->ddma.dev);
> +
> +       /*
> +        * No additional locking is needed, {alloc,free}_chan_resources
> +        * are called under dma_list_mutex in dmaengine core
> +        */
> +       for (i = 0; i < pl330->num_peripherals; i++) {
> +               if (pl330->peripherals[i].slave == pch->slave &&
> +                   pl330->peripherals[i].slave_link) {
> +                       pch->slave_link = pl330->peripherals[i].slave_link;
> +                       return 0;
> +               }
> +       }
> +
> +       pch->slave_link = device_link_add(pch->slave, pl330->ddma.dev,
> +                                      DL_FLAG_PM_RUNTIME | DL_FLAG_RPM_ACTIVE);
> +       if (!pch->slave_link)
> +               return -ENODEV;
> +
> +       return 0;
> +}
> +
> +static void pl330_del_slave_pm_link(struct pl330_dmac *pl330,
> +                                   struct dma_pl330_chan *pch)
> +{
> +       struct device_link *link = pch->slave_link;
> +       int i, count = 0;
> +
> +       if (!pch->slave)
> +               pm_runtime_put(pl330->ddma.dev);
> +
> +       for (i = 0; i < pl330->num_peripherals; i++)
> +               if (pl330->peripherals[i].slave_link == link)
> +                       count++;
> +
> +       pch->slave_link = NULL;
> +       if (count == 1)
> +               device_link_del(link);
> +}
> +
>  static int pl330_alloc_chan_resources(struct dma_chan *chan)
>  {
>         struct dma_pl330_chan *pch = to_pchan(chan);
>         struct pl330_dmac *pl330 = pch->dmac;
>         unsigned long flags;
> +       int ret;
> +
> +       ret = pl330_add_slave_pm_link(pl330, pch);
> +       if (ret < 0)
> +               return ret;
>
>         spin_lock_irqsave(&pl330->lock, flags);
>
> @@ -2119,6 +2146,7 @@ static int pl330_alloc_chan_resources(struct dma_chan *chan)
>         pch->thread = pl330_request_channel(pl330);
>         if (!pch->thread) {
>                 spin_unlock_irqrestore(&pl330->lock, flags);
> +               pl330_del_slave_pm_link(pl330, pch);
>                 return -ENOMEM;
>         }
>
> @@ -2160,9 +2188,7 @@ static int pl330_terminate_all(struct dma_chan *chan)
>         unsigned long flags;
>         struct pl330_dmac *pl330 = pch->dmac;
>         LIST_HEAD(list);
> -       bool power_down = false;
>
> -       pm_runtime_get_sync(pl330->ddma.dev);
>         spin_lock_irqsave(&pch->lock, flags);
>         spin_lock(&pl330->lock);
>         _stop(pch->thread);
> @@ -2171,8 +2197,6 @@ static int pl330_terminate_all(struct dma_chan *chan)
>         pch->thread->req[0].desc = NULL;
>         pch->thread->req[1].desc = NULL;
>         pch->thread->req_running = -1;
> -       power_down = pch->active;
> -       pch->active = false;
>
>         /* Mark all desc done */
>         list_for_each_entry(desc, &pch->submitted_list, node) {
> @@ -2189,10 +2213,6 @@ static int pl330_terminate_all(struct dma_chan *chan)
>         list_splice_tail_init(&pch->work_list, &pl330->desc_pool);
>         list_splice_tail_init(&pch->completed_list, &pl330->desc_pool);
>         spin_unlock_irqrestore(&pch->lock, flags);
> -       pm_runtime_mark_last_busy(pl330->ddma.dev);
> -       if (power_down)
> -               pm_runtime_put_autosuspend(pl330->ddma.dev);
> -       pm_runtime_put_autosuspend(pl330->ddma.dev);
>
>         return 0;
>  }
> @@ -2210,7 +2230,6 @@ static int pl330_pause(struct dma_chan *chan)
>         struct pl330_dmac *pl330 = pch->dmac;
>         unsigned long flags;
>
> -       pm_runtime_get_sync(pl330->ddma.dev);
>         spin_lock_irqsave(&pch->lock, flags);
>
>         spin_lock(&pl330->lock);
> @@ -2218,8 +2237,6 @@ static int pl330_pause(struct dma_chan *chan)
>         spin_unlock(&pl330->lock);
>
>         spin_unlock_irqrestore(&pch->lock, flags);
> -       pm_runtime_mark_last_busy(pl330->ddma.dev);
> -       pm_runtime_put_autosuspend(pl330->ddma.dev);
>
>         return 0;
>  }
> @@ -2232,7 +2249,6 @@ static void pl330_free_chan_resources(struct dma_chan *chan)
>
>         tasklet_kill(&pch->task);
>
> -       pm_runtime_get_sync(pch->dmac->ddma.dev);
>         spin_lock_irqsave(&pl330->lock, flags);
>
>         pl330_release_channel(pch->thread);
> @@ -2242,19 +2258,17 @@ static void pl330_free_chan_resources(struct dma_chan *chan)
>                 list_splice_tail_init(&pch->work_list, &pch->dmac->desc_pool);
>
>         spin_unlock_irqrestore(&pl330->lock, flags);
> -       pm_runtime_mark_last_busy(pch->dmac->ddma.dev);
> -       pm_runtime_put_autosuspend(pch->dmac->ddma.dev);
> +
> +       pl330_del_slave_pm_link(pl330, pch);
>  }
>
>  static int pl330_get_current_xferred_count(struct dma_pl330_chan *pch,
>                                            struct dma_pl330_desc *desc)
>  {
>         struct pl330_thread *thrd = pch->thread;
> -       struct pl330_dmac *pl330 = pch->dmac;
>         void __iomem *regs = thrd->dmac->base;
>         u32 val, addr;
>
> -       pm_runtime_get_sync(pl330->ddma.dev);
>         val = addr = 0;
>         if (desc->rqcfg.src_inc) {
>                 val = readl(regs + SA(thrd->id));
> @@ -2263,8 +2277,6 @@ static int pl330_get_current_xferred_count(struct dma_pl330_chan *pch,
>                 val = readl(regs + DA(thrd->id));
>                 addr = desc->px.dst_addr;
>         }
> -       pm_runtime_mark_last_busy(pch->dmac->ddma.dev);
> -       pm_runtime_put_autosuspend(pl330->ddma.dev);
>
>         /* If DMAMOV hasn't finished yet, SAR/DAR can be zero */
>         if (!val)
> @@ -2350,16 +2362,6 @@ static void pl330_issue_pending(struct dma_chan *chan)
>         unsigned long flags;
>
>         spin_lock_irqsave(&pch->lock, flags);
> -       if (list_empty(&pch->work_list)) {
> -               /*
> -                * Warn on nothing pending. Empty submitted_list may
> -                * break our pm_runtime usage counter as it is
> -                * updated on work_list emptiness status.
> -                */
> -               WARN_ON(list_empty(&pch->submitted_list));
> -               pch->active = true;
> -               pm_runtime_get_sync(pch->dmac->ddma.dev);
> -       }
>         list_splice_tail_init(&pch->submitted_list, &pch->work_list);
>         spin_unlock_irqrestore(&pch->lock, flags);
>
> @@ -2787,44 +2789,12 @@ static irqreturn_t pl330_irq_handler(int irq, void *data)
>         BIT(DMA_SLAVE_BUSWIDTH_8_BYTES)
>
>  /*
> - * Runtime PM callbacks are provided by amba/bus.c driver.
> - *
> - * It is assumed here that IRQ safe runtime PM is chosen in probe and amba
> - * bus driver will only disable/enable the clock in runtime PM callbacks.
> + * Runtime PM callbacks are provided by amba/bus.c driver, system sleep
> + * suspend/resume is implemented by generic helpers, which use existing
> + * runtime PM callbacks.
>   */
> -static int __maybe_unused pl330_suspend(struct device *dev)
> -{
> -       struct amba_device *pcdev = to_amba_device(dev);
> -
> -       pm_runtime_disable(dev);
> -
> -       if (!pm_runtime_status_suspended(dev)) {
> -               /* amba did not disable the clock */
> -               amba_pclk_disable(pcdev);
> -       }
> -       amba_pclk_unprepare(pcdev);
> -
> -       return 0;
> -}
> -
> -static int __maybe_unused pl330_resume(struct device *dev)
> -{
> -       struct amba_device *pcdev = to_amba_device(dev);
> -       int ret;
> -
> -       ret = amba_pclk_prepare(pcdev);
> -       if (ret)
> -               return ret;
> -
> -       if (!pm_runtime_status_suspended(dev))
> -               ret = amba_pclk_enable(pcdev);
> -
> -       pm_runtime_enable(dev);
> -
> -       return ret;
> -}
> -
> -static SIMPLE_DEV_PM_OPS(pl330_pm, pl330_suspend, pl330_resume);
> +static SIMPLE_DEV_PM_OPS(pl330_pm, pm_runtime_force_suspend,
> +                        pm_runtime_force_resume);
>
>  static int
>  pl330_probe(struct amba_device *adev, const struct amba_id *id)
> @@ -2977,11 +2947,7 @@ static int __maybe_unused pl330_resume(struct device *dev)
>                 pcfg->data_buf_dep, pcfg->data_bus_width / 8, pcfg->num_chan,
>                 pcfg->num_peri, pcfg->num_events);
>
> -       pm_runtime_irq_safe(&adev->dev);
> -       pm_runtime_use_autosuspend(&adev->dev);
> -       pm_runtime_set_autosuspend_delay(&adev->dev, PL330_AUTOSUSPEND_DELAY);
> -       pm_runtime_mark_last_busy(&adev->dev);
> -       pm_runtime_put_autosuspend(&adev->dev);
> +       pm_runtime_put(&adev->dev);
>
>         return 0;
>  probe_err3:
> --
> 1.9.1
>



More information about the linux-arm-kernel mailing list