[PATCH 01/10] dma: sun4i: Add support for the DMA engine on sun[457]i SoCs
Maxime Ripard
maxime.ripard at free-electrons.com
Sun Jun 29 06:23:51 PDT 2014
On Wed, Jun 25, 2014 at 07:46:54PM -0300, Emilio López wrote:
> Hi Maxime,
>
> [I have not replied to every single comment; you can assume I fixed
> all the missing parentheses, spaces and comment style issues you
> pointed out.]
>
> El 25/06/14 15:42, Maxime Ripard escribió:
> >On Mon, Jun 16, 2014 at 12:50:26AM -0300, Emilio López wrote:
> >>This patch adds support for the DMA engine present on Allwinner A10,
> >>A13, A10S and A20 SoCs. This engine has two kinds of channels: normal
> >>and dedicated. The main difference is in the mode of operation;
> >>while a single normal channel may be operating at any given time,
> >>dedicated channels may operate simultaneously provided there is no
> >>overlap of source or destination.
> >>
> >>Hardware documentation can be found on A10 User Manual (section 12), A13
> >>User Manual (section 14) and A20 User Manual (section 1.12)
> >>
> >>Signed-off-by: Emilio López <emilio at elopez.com.ar>
> >>---
> (...)
> >>diff --git a/drivers/dma/Kconfig b/drivers/dma/Kconfig
> >>index ba06d1d..a9ee0c9 100644
> >>--- a/drivers/dma/Kconfig
> >>+++ b/drivers/dma/Kconfig
> >>@@ -361,6 +361,16 @@ config FSL_EDMA
> >> multiplexing capability for DMA request sources(slot).
> >> This module can be found on Freescale Vybrid and LS-1 SoCs.
> >>
> >>+config SUN4I_DMA
> >>+ tristate "Allwinner A10/A10S/A13/A20 DMA support"
> >>+ depends on ARCH_SUNXI
> >
> >MACH_SUN4I || MACH_SUN5I || MACH_SUN7I ?
> >
> >That would probably be a good idea to add COMPILE_TEST to the list
> >too.
>
> Yes, now that they're split I'll change it and add COMPILE_TEST.
If you're using writel_relaxed, then forget about
COMPILE_TEST. *_relaxed accessors are not standard one, and are not
defined on all the architectures.
>
> >>+ select DMA_ENGINE
> >>+ select DMA_OF
> >>+ select DMA_VIRTUAL_CHANNELS
> >>+ help
> >>+ Enable support for the DMA controller present in the sun4i,
> >>+ sun5i and sun7i Allwinner ARM SoCs.
> >>+
> >> config DMA_ENGINE
> >> bool
> >>
> >>diff --git a/drivers/dma/Makefile b/drivers/dma/Makefile
> >>index 5150c82..13a7d5d 100644
> >>--- a/drivers/dma/Makefile
> >>+++ b/drivers/dma/Makefile
> >>@@ -46,3 +46,4 @@ obj-$(CONFIG_K3_DMA) += k3dma.o
> >> obj-$(CONFIG_MOXART_DMA) += moxart-dma.o
> >> obj-$(CONFIG_FSL_EDMA) += fsl-edma.o
> >> obj-$(CONFIG_QCOM_BAM_DMA) += qcom_bam_dma.o
> >>+obj-$(CONFIG_SUN4I_DMA) += sun4i-dma.o
> >>diff --git a/drivers/dma/sun4i-dma.c b/drivers/dma/sun4i-dma.c
> >>new file mode 100644
> >>index 0000000..0b14b3f
> >>--- /dev/null
> >>+++ b/drivers/dma/sun4i-dma.c
> >>@@ -0,0 +1,1065 @@
> >>+/*
> >>+ * Copyright (C) 2014 Emilio López
> >>+ * Emilio López <emilio at elopez.com.ar>
> >>+ *
> >>+ * This program is free software; you can redistribute it and/or modify
> >>+ * it under the terms of the GNU General Public License as published by
> >>+ * the Free Software Foundation; either version 2 of the License, or
> >>+ * (at your option) any later version.
> >>+ */
> >>+
> >>+#include <linux/bitmap.h>
> >>+#include <linux/bitops.h>
> >>+#include <linux/clk.h>
> >>+#include <linux/dmaengine.h>
> >>+#include <linux/dmapool.h>
> >>+#include <linux/interrupt.h>
> >>+#include <linux/module.h>
> >>+#include <linux/of_dma.h>
> >>+#include <linux/platform_device.h>
> >>+#include <linux/slab.h>
> >>+#include <linux/spinlock.h>
> >>+
> >>+#include "virt-dma.h"
> >>+
> >>+/** General DMA register values **/
> >>+
> >>+/* DMA source/destination burst length values */
> >>+#define DMA_BURST_LENGTH_1 0
> >>+#define DMA_BURST_LENGTH_4 1
> >>+#define DMA_BURST_LENGTH_8 2
> >
> >An enum maybe?
> >
> >You're not using this anywhere though.
> >
> >>+/* DMA source/destination data width */
> >>+#define DMA_DATA_WIDTH_8BIT 0
> >>+#define DMA_DATA_WIDTH_16BIT 1
> >>+#define DMA_DATA_WIDTH_32BIT 2
> >
> >And you're not using this either.
>
> As discussed on IRC, I'll drop the unused #defines
>
> (...)
>
> >>+
> >>+/* Dedicated DMA parameter register layout */
> >>+#define DDMA_PARA_DEST_DATA_BLK_SIZE(n) (n-1 << 24)
> >>+#define DDMA_PARA_DEST_WAIT_CYCLES(n) (n-1 << 16)
> >>+#define DDMA_PARA_SRC_DATA_BLK_SIZE(n) (n-1 << 8)
> >>+#define DDMA_PARA_SRC_WAIT_CYCLES(n) (n-1 << 0)
> >
> >Since the minus operations has precedence over the shift, I wonder how
> >this can work.
> >
> >(plus, parenthesis around n, and spaces around the minus)
>
> It works because it's not used :)
>
> (...)
>
> >
> >>+
> >>+/** DMA Driver **/
> >>+
> >>+/* Normal DMA has 8 channels, and Dedicated DMA has another 8, so that's
> >>+ * 16 channels. As for endpoints, there's 29 and 21 respectively. Given
> >>+ * that the Normal DMA endpoints can be used as tx/rx, we need 79 vchans
> >>+ * in total
> >>+ */
> >>+#define NDMA_NR_MAX_CHANNELS 8
> >>+#define DDMA_NR_MAX_CHANNELS 8
> >>+#define DMA_NR_MAX_CHANNELS (NDMA_NR_MAX_CHANNELS + DDMA_NR_MAX_CHANNELS)
> >>+#define NDMA_NR_MAX_VCHANS (29*2)
> >
> >I'm counting 29 + 28
>
> I just counted them again, there's 29 on NDMA, and you may want to
> read from or write to them, so 29*2. I could drop one to compensate
> mem2mem being counted twice though, if we want to be really exact
> with this.
Ok.
>
> >>+#define DDMA_NR_MAX_VCHANS 21
> >>+#define DMA_NR_MAX_VCHANS (NDMA_NR_MAX_VCHANS + DDMA_NR_MAX_VCHANS)
> >>+
> >>+struct sun4i_dma_pchan {
> >>+ /* Register base of channel */
> >>+ void __iomem *base;
> >>+ /* vchan currently being serviced */
> >>+ struct sun4i_dma_vchan *vchan;
> >>+ /* Is this a dedicated pchan? */
> >>+ int is_dedicated;
> >>+};
> >>+
> >>+struct sun4i_dma_vchan {
> >>+ struct virt_dma_chan vc;
> >>+ struct dma_slave_config cfg;
> >>+ struct sun4i_dma_pchan *pchan;
> >>+ struct sun4i_dma_promise *processing;
> >>+ struct sun4i_dma_contract *contract;
> >>+ u8 endpoint;
> >>+ int is_dedicated;
> >>+};
> >>+
> >>+struct sun4i_dma_promise {
> >>+ u32 cfg;
> >>+ u32 para;
> >>+ dma_addr_t src;
> >>+ dma_addr_t dst;
> >>+ size_t len;
> >>+ struct list_head list;
> >>+};
> >>+
> >>+/* A contract is a set of promises */
> >>+struct sun4i_dma_contract {
> >>+ struct virt_dma_desc vd;
> >>+ struct list_head demands;
> >>+ struct list_head completed_demands;
> >>+};
> >>+
> >>+struct sun4i_dma_dev {
> >>+ DECLARE_BITMAP(pchans_used, DDMA_NR_MAX_CHANNELS);
> >>+ struct tasklet_struct tasklet;
> >>+ struct dma_device slave;
> >>+ struct sun4i_dma_pchan *pchans;
> >>+ struct sun4i_dma_vchan *vchans;
> >>+ void __iomem *base;
> >>+ struct clk *clk;
> >>+ int irq;
> >>+ spinlock_t lock;
> >>+};
> >>+
> >>+static struct sun4i_dma_dev *to_sun4i_dma_dev(struct dma_device *dev)
> >>+{
> >>+ return container_of(dev, struct sun4i_dma_dev, slave);
> >>+}
> >>+
> >>+static struct sun4i_dma_vchan *to_sun4i_dma_vchan(struct dma_chan *chan)
> >>+{
> >>+ return container_of(chan, struct sun4i_dma_vchan, vc.chan);
> >>+}
> >>+
> >>+static struct sun4i_dma_contract *to_sun4i_dma_contract(struct virt_dma_desc *vd)
> >>+{
> >>+ return container_of(vd, struct sun4i_dma_contract, vd);
> >>+}
> >>+
> >>+static struct device *chan2dev(struct dma_chan *chan)
> >>+{
> >>+ return &chan->dev->device;
> >>+}
> >>+
> >>+static int convert_burst(u32 maxburst)
> >>+{
> >>+ if (maxburst > 8)
> >>+ maxburst = 8;
> >
> >returning an error would be better here.
>
> Ok, I'll do that.
>
> >>+
> >>+ /* 1 -> 0, 4 -> 1, 8 -> 2 */
> >
> >4 seems to be an invalid value on the A20
>
> They define it on the SDK DMA driver though
>
> https://github.com/linux-sunxi/linux-sunxi/blob/sunxi-3.4/arch/arm/mach-sun7i/include/mach/dma.h#L38
>
> And actually use it on the sound codec driver, among other parts
>
> https://github.com/linux-sunxi/linux-sunxi/blob/sunxi-3.4/sound/soc/sunxi/sunxi-codec.c#L1143
>
> So I would prefer to keep it, unless we hear it's actually not
> supported from Allwinner themselves.
Hmmm, weird. Ok.
>
> >
> >>+ return (maxburst >> 2);
> >>+}
> >>+
> >>+static int convert_buswidth(enum dma_slave_buswidth addr_width)
> >>+{
> >>+ if (addr_width > DMA_SLAVE_BUSWIDTH_4_BYTES)
> >>+ return -EINVAL;
> >
> >especially if you're returning one here.
> >
> >>+
> >>+ /* 8 -> 0, 16 -> 1, 32 -> 2 */
> >
> >16 seems to be an invalid value on the A20
>
> Ditto
>
> >
> >>+ return (addr_width >> 4);
> >>+}
> >>+
> (...)
> >>+static void configure_pchan(struct sun4i_dma_pchan *pchan,
> >>+ struct sun4i_dma_promise *d)
> >>+{
> >>+ if (pchan->is_dedicated) {
> >>+ /* Configure addresses and misc parameters */
> >>+ writel_relaxed(d->src, pchan->base + DDMA_SRC_ADDR_REG);
> >>+ writel_relaxed(d->dst, pchan->base + DDMA_DEST_ADDR_REG);
> >>+ writel_relaxed(d->len, pchan->base + DDMA_BYTE_COUNT_REG);
> >>+ writel_relaxed(d->para, pchan->base + DDMA_PARA_REG);
> >>+
> >>+ /* We use a writel here because CFG_LOADING may be set,
> >>+ * and it requires that the rest of the configuration
> >>+ * takes place before the engine is started */
> >
> >You should be ok here.
> >
> >See http://permalink.gmane.org/gmane.linux.ports.arm.kernel/117640
> >
> >>+ writel(d->cfg, pchan->base + DDMA_CFG_REG);
>
> Ok, I've switched this to writel_relaxed as well after the
> explanation on IRC.
>
> >>+ } else {
> >>+ /* Configure addresses and misc parameters */
> >>+ writel_relaxed(d->src, pchan->base + NDMA_SRC_ADDR_REG);
> >>+ writel_relaxed(d->dst, pchan->base + NDMA_DEST_ADDR_REG);
> >>+ writel_relaxed(d->len, pchan->base + NDMA_BYTE_COUNT_REG);
> (...)
> >>+static struct dma_async_tx_descriptor *
> >>+sun4i_dma_prep_dma_memcpy(struct dma_chan *chan, dma_addr_t dest,
> >>+ dma_addr_t src, size_t len, unsigned long flags)
> >>+{
> >>+ struct sun4i_dma_vchan *vchan = to_sun4i_dma_vchan(chan);
> >>+ struct dma_slave_config *sconfig = &vchan->cfg;
> >>+ struct sun4i_dma_promise *promise;
> >>+ struct sun4i_dma_contract *contract;
> >>+
> >>+ contract = generate_dma_contract();
> >>+ if (!contract)
> >>+ return NULL;
> >>+
> >>+ if (vchan->is_dedicated)
> >>+ promise = generate_ddma_promise(chan, src, dest, len, sconfig);
> >>+ else
> >>+ promise = generate_ndma_promise(chan, src, dest, len, sconfig);
> >>+
> >>+ if (!promise) {
> >>+ kfree(contract);
> >>+ return NULL;
> >>+ }
> >>+
> >>+ /* Configure memcpy mode */
> >>+ if (vchan->is_dedicated) {
> >>+ promise->cfg |= DDMA_CFG_SRC_DRQ_TYPE(DDMA_DRQ_TYPE_SDRAM) |
> >>+ DDMA_CFG_SRC_NON_SECURE |
> >>+ DDMA_CFG_DEST_DRQ_TYPE(DDMA_DRQ_TYPE_SDRAM) |
> >>+ DDMA_CFG_DEST_NON_SECURE;
> >>+ } else {
> >>+ promise->cfg |= NDMA_CFG_SRC_DRQ_TYPE(NDMA_DRQ_TYPE_SDRAM) |
> >>+ NDMA_CFG_SRC_NON_SECURE |
> >>+ NDMA_CFG_DEST_DRQ_TYPE(NDMA_DRQ_TYPE_SDRAM) |
> >>+ NDMA_CFG_DEST_NON_SECURE;
> >
> >Hmm, are you sure about that non-secure? Depending on the mode the
> >kernel execute in, wouldn't that change?
>
> dmatest seems to be happy either way on my A20. It's not clear to me
> from the documentation what this flag does, so I suppose I can just
> drop it for now and we can worry about it in the future if it turns
> out we need it for something.
Even when you're starting the kernel itself in secure and !secure?
>
> >>+static enum dma_status sun4i_dma_tx_status(struct dma_chan *chan,
> >>+ dma_cookie_t cookie,
> >>+ struct dma_tx_state *state)
> >>+{
> >>+ struct sun4i_dma_vchan *vchan = to_sun4i_dma_vchan(chan);
> >>+ struct sun4i_dma_pchan *pchan = vchan->pchan;
> >>+ struct sun4i_dma_contract *contract;
> >>+ struct sun4i_dma_promise *promise = NULL;
> >>+ struct virt_dma_desc *vd;
> >>+ unsigned long flags;
> >>+ enum dma_status ret;
> >>+ size_t bytes = 0;
> >>+
> >>+ ret = dma_cookie_status(chan, cookie, state);
> >>+ if (ret == DMA_COMPLETE)
> >>+ return ret;
> >>+
> >>+ spin_lock_irqsave(&vchan->vc.lock, flags);
> >>+ vd = vchan_find_desc(&vchan->vc, cookie);
> >>+ if (!vd) /* TODO */
> >
> >TODO?
>
> I don't actually recall what was left to do here, I should've
> written a better comment :|
>
> (...)
> >>+static int sun4i_dma_remove(struct platform_device *pdev)
> >>+{
> >>+ struct sun4i_dma_dev *priv = platform_get_drvdata(pdev);
> >>+
> >>+ /* Disable IRQ so the tasklet doesn't schedule any longer, then
> >>+ * kill it */
> >>+ disable_irq(priv->irq);
> >>+ tasklet_kill(&priv->tasklet);
> >
> >You might still have your tasklet pending to be scheduled. This is not
> >the proper way to bail out from a tasklet.
> >
> >See https://lwn.net/Articles/588457/
>
> As we talked on IRC, the tasklet does not reschedule itself, and
> after disabling the interrupt, there should be no way for it to get
> rescheduled, so I think calling task_kill should be ok.
>
> >>+ of_dma_controller_free(pdev->dev.of_node);
> >>+ dma_async_device_unregister(&priv->slave);
> >>+
> >>+ clk_disable_unprepare(priv->clk);
> >>+
> >>+ return 0;
> >>+}
> >>+
> >>+static struct of_device_id sun4i_dma_match[] = {
> >>+ { .compatible = "allwinner,sun4i-a10-dma" }
> >
> >The two IPs seem to differ from A10 to A20. Maybe it would be great to
> >introduce several compatibles here?
>
> I'm ok with introducing several compatibles, but as far as I can
> tell the IP is the same - I have not needed to add any conditionals
> depending on the SoC or anything.
Ok.
Maxime
--
Maxime Ripard, Free Electrons
Embedded Linux, Kernel and Android engineering
http://free-electrons.com
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 819 bytes
Desc: Digital signature
URL: <http://lists.infradead.org/pipermail/linux-arm-kernel/attachments/20140629/881ed126/attachment-0001.sig>
More information about the linux-arm-kernel
mailing list