[PATCH v5 2/3] mtd: nand: Qualcomm NAND controller driver

Boris Brezillon boris.brezillon at free-electrons.com
Fri Jan 8 02:31:26 PST 2016


On Fri, 8 Jan 2016 15:53:43 +0530
Archit Taneja <architt at codeaurora.org> wrote:

> 
> 
> On 1/8/2016 1:31 PM, Boris Brezillon wrote:
> > On Fri, 8 Jan 2016 12:03:25 +0530
> > Archit Taneja <architt at codeaurora.org> wrote:
> >
> >> Hi,
> >>
> >> On 1/6/2016 10:35 PM, Boris Brezillon wrote:
> >>> Hi Archit,
> >>>
> >>> On Tue,  5 Jan 2016 10:55:00 +0530
> >>> Archit Taneja <architt at codeaurora.org> wrote:
> >>>
> >>>> The Qualcomm NAND controller is found in SoCs like IPQ806x, MSM7xx,
> >>>> MDM9x15 series.
> >>>>
> >>>> It exists as a sub block inside the IPs EBI2 (External Bus Interface 2)
> >>>> and QPIC (Qualcomm Parallel Interface Controller). These IPs provide a
> >>>> broader interface for external slow peripheral devices such as LCD and
> >>>> NAND/NOR flash memory or SRAM like interfaces.
> >>>>
> >>>> We add support for the NAND controller found within EBI2. For the SoCs
> >>>> of our interest, we only use the NAND controller within EBI2. Therefore,
> >>>> it's safe for us to assume that the NAND controller is a standalone block
> >>>> within the SoC.
> >>>>
> >>>> The controller supports 512B, 2kB, 4kB and 8kB page 8-bit and 16-bit NAND
> >>>> flash devices. It contains a HW ECC block that supports BCH ECC (4, 8 and
> >>>> 16 bit correction/step) and RS ECC(4 bit correction/step) that covers main
> >>>> and spare data. The controller contains an internal 512 byte page buffer
> >>>> to which we read/write via DMA. The EBI2 type NAND controller uses ADM DMA
> >>>> for register read/write and data transfers. The controller performs page
> >>>> reads and writes at a codeword/step level of 512 bytes. It can support up
> >>>> to 2 external chips of different configurations.
> >>>>
> >>>> The driver prepares register read and write configuration descriptors for
> >>>> each codeword, followed by data descriptors to read or write data from the
> >>>> controller's internal buffer. It uses a single ADM DMA channel that we get
> >>>> via dmaengine API. The controller requires 2 ADM CRCIs for command and
> >>>> data flow control. These are passed via DT.
> >>>>
> >>>> The ecc layout used by the controller is syndrome like, but we can't use
> >>>> the standard syndrome ecc ops because of several reasons. First, the amount
> >>>> of data bytes covered by ecc isn't same in each step. Second, writing to
> >>>> free oob space requires us writing to the entire step in which the oob
> >>>> lies. This forces us to create our own ecc ops.
> >>>>
> >>>> One more difference is how the controller accesses the bad block marker.
> >>>> The controller ignores reading the marker when ECC is enabled. ECC needs
> >>>> to be explicity disabled to read or write to the bad block marker. The
> >>>> nand_bbt helpers library hence can't access BBMs for the controller.
> >>>> For now, we skip the creation of BBT and populate chip->block_bad and
> >>>> chip->block_markbad helpers instead.
> >>>>
> >>>> Reviewed-by: Andy Gross <agross at codeaurora.org>
> >>>> Reviewed-by: Stephen Boyd <sboyd at codeaurora.org>
> >>>> Signed-off-by: Stephen Boyd <sboyd at codeaurora.org>
> >>>> Signed-off-by: Archit Taneja <architt at codeaurora.org>
> >>>> ---
> >>>> v5:
> >>>>     - split chip/controller structs
> >>>>     - simplify layout by considering reserved bytes as part of ECC
> >>>>     - create ecc layouts automatically
> >>>>     - implement block_bad and block_markbad chip ops instead of
> >>>>     - read_oob_raw/write_oob_raw ecc ops to access BBMs.
> >>>>     - Add NAND_SKIP_BBTSCAN flag until we get badblockbits support.
> >>>>     - misc clean ups
> >>>>
> >>>> v4:
> >>>>     - Shrink submit_descs
> >>>>     - add desc list node at the end of dma_prep_desc
> >>>>     - Endianness and warning fixes
> >>>>     - Add Stephen's Signed-off since he provided a patch to fix
> >>>>       endianness problems
> >>>>
> >>>> v3:
> >>>>     - Refactor dma functions for maximum reuse
> >>>>     - Use dma_slave_confing on stack
> >>>>     - optimize and clean upempty_page_fixup using memchr_inv
> >>>>     - ensure portability with dma register reads using le32_* funcs
> >>>>     - use NAND_USE_BOUNCE_BUFFER instead of doing it ourselves
> >>>>     - fix handling of return values of dmaengine funcs
> >>>>     - constify wherever possible
> >>>>     - Remove dependency on ADM DMA in Kconfig
> >>>>     - Misc fixes and clean ups
> >>>>
> >>>> v2:
> >>>>     - Use new BBT flag that allows us to read BBM in raw mode
> >>>>     - reduce memcpy-s in the driver
> >>>>     - some refactor and clean ups because of above changes
> >>>>
> >>>>    drivers/mtd/nand/Kconfig      |    7 +
> >>>>    drivers/mtd/nand/Makefile     |    1 +
> >>>>    drivers/mtd/nand/qcom_nandc.c | 1981 +++++++++++++++++++++++++++++++++++++++++
> >>>>    3 files changed, 1989 insertions(+)
> >>>>    create mode 100644 drivers/mtd/nand/qcom_nandc.c
> >>>>
> >>>> diff --git a/drivers/mtd/nand/Kconfig b/drivers/mtd/nand/Kconfig
> >>>> index 95b8d2b..2fccdfb 100644
> >>>> --- a/drivers/mtd/nand/Kconfig
> >>>> +++ b/drivers/mtd/nand/Kconfig
> >>>> @@ -546,4 +546,11 @@ config MTD_NAND_HISI504
> >>>>    	help
> >>>>    	  Enables support for NAND controller on Hisilicon SoC Hip04.
> >>>>
> >>>> +config MTD_NAND_QCOM
> >>>> +	tristate "Support for NAND on QCOM SoCs"
> >>>> +	depends on ARCH_QCOM
> >>>> +	help
> >>>> +	  Enables support for NAND flash chips on SoCs containing the EBI2 NAND
> >>>> +	  controller. This controller is found on IPQ806x SoC.
> >>>> +
> >>>>    endif # MTD_NAND
> >>>> diff --git a/drivers/mtd/nand/Makefile b/drivers/mtd/nand/Makefile
> >>>> index 2c7f014..9450cdc 100644
> >>>> --- a/drivers/mtd/nand/Makefile
> >>>> +++ b/drivers/mtd/nand/Makefile
> >>>> @@ -55,5 +55,6 @@ obj-$(CONFIG_MTD_NAND_BCM47XXNFLASH)	+= bcm47xxnflash/
> >>>>    obj-$(CONFIG_MTD_NAND_SUNXI)		+= sunxi_nand.o
> >>>>    obj-$(CONFIG_MTD_NAND_HISI504)	        += hisi504_nand.o
> >>>>    obj-$(CONFIG_MTD_NAND_BRCMNAND)		+= brcmnand/
> >>>> +obj-$(CONFIG_MTD_NAND_QCOM)		+= qcom_nandc.o
> >>>>
> >>>>    nand-objs := nand_base.o nand_bbt.o nand_timings.o
> >>>> diff --git a/drivers/mtd/nand/qcom_nandc.c b/drivers/mtd/nand/qcom_nandc.c
> >>>> new file mode 100644
> >>>> index 0000000..a4dd922
> >>>> --- /dev/null
> >>>> +++ b/drivers/mtd/nand/qcom_nandc.c
> >
> > [...]
> >
> >>>
> >>>> +
> >>>> +/*
> >>>> + * when using RS ECC, the NAND controller flags an error when reading an
> >>>> + * erased page. however, there are special characters at certain offsets when
> >>>> + * we read the erased page. we check here if the page is really empty. if so,
> >>>> + * we replace the magic characters with 0xffs
> >>>> + */
> >>>> +static bool empty_page_fixup(struct qcom_nand_host *host, u8 *data_buf)
> >>>> +{
> >>>> +	struct qcom_nand_controller *nandc = host->nandc;
> >>>> +	struct nand_chip *chip = &host->chip;
> >>>> +	struct mtd_info *mtd = nand_to_mtd(chip);
> >>>> +	int cwperpage = chip->ecc.steps;
> >>>> +	u8 orig1[MAX_NUM_STEPS], orig2[MAX_NUM_STEPS];
> >>>> +	int i, j;
> >>>> +
> >>>> +	/* if BCH is enabled, HW will take care of detecting erased pages */
> >>>> +	if (host->bch_enabled || !host->use_ecc)
> >>>> +		return false;
> >>>> +
> >>>> +	for (i = 0; i < cwperpage; i++) {
> >>>> +		u8 *empty1, *empty2;
> >>>> +		u32 flash_status = le32_to_cpu(nandc->reg_read_buf[3 * i]);
> >>>> +
> >>>> +		/*
> >>>> +		 * an erased page flags an error in NAND_FLASH_STATUS, check if
> >>>> +		 * the page is erased by looking for 0x54s at offsets 3 and 175
> >>>> +		 * from the beginning of each codeword
> >>>> +		 */
> >>>> +		if (!(flash_status & FS_OP_ERR))
> >>>> +			break;
> >>>> +
> >>>> +		empty1 = &data_buf[3 + i * host->cw_data];
> >>>> +		empty2 = &data_buf[175 + i * host->cw_data];
> >>>> +
> >>>> +		/*
> >>>> +		 * if the error wasn't because of an erased page, bail out and
> >>>> +		 * and let someone else do the error checking
> >>>> +		 */
> >>>> +		if ((*empty1 == 0x54 && *empty2 == 0xff) ||
> >>>> +				(*empty1 == 0xff && *empty2 == 0x54)) {
> >>>> +			orig1[i] = *empty1;
> >>>> +			orig2[i] = *empty2;
> >>>> +
> >>>> +			*empty1 = 0xff;
> >>>> +			*empty2 = 0xff;
> >>>> +		} else {
> >>>> +			break;
> >>>> +		}
> >>>> +	}
> >>>> +
> >>>> +	if (i < cwperpage || memchr_inv(data_buf, 0xff, mtd->writesize))
> >>>> +		goto not_empty;
> >>>> +
> >>>> +	/*
> >>>> +	 * tell the caller that the page was empty and is fixed up, so that
> >>>> +	 * parse_read_errors() doesn't think it's an error
> >>>> +	 */
> >>>> +	return true;
> >>>> +
> >>>> +not_empty:
> >>>> +	/* restore original values if not empty*/
> >>>> +	for (j = 0; j < i; j++) {
> >>>> +		data_buf[3 + j * host->cw_data] = orig1[j];
> >>>> +		data_buf[175 + j * host->cw_data] = orig2[j];
> >>>> +	}
> >>>> +
> >>>> +	return false;
> >>>> +}
> >>>
> >>> This empty page detection seems a bit complicated to me. Could you
> >>> consider using nand_check_erased_ecc_chunk() to check is the chunk is
> >>> containing 0xff data instead of implementing your own logic?
> >>
> >> We wouldn't be able to use nand_check_erased_ecc_chunk directly because
> >> there are certain bytes in each chunk that are intentionally not 0xffs.
> >>
> >> But I could make it a two step process, where I first override those
> >> magic bytes with 0xffs, and then use nand_check_erased_ecc_chunk. That
> >> may not reduce tons of code, but would atleast consider possibility
> >> of bitflips in erased pages, which the driver currently doesn't do.
> >
> > Too bad your ECC engines decides to fix some bits even when it cannot
> > fix all of them. Are you sure there's no flag to disable this behavior?
> > Usually ECC engines just flag the data chunk as uncorrectable and leave
> > its data untouched.
> 
> I don't think the engine is actually writing to the chunk at any
> point when reading an erased chunk.
> 
> This whole method adopted by the engine is just a way of flagging
> to us that it read an erased chunk. Once it reads an erased chunk, it
> flags an error in one of the controller registers, and replaces
> offsets at 3 and 175 in its internal buffer.
> 
> I am not sure why the HW didn't just provide another bit field which
> notifies that it it read an erased page. This is what is done in the
> future version of this controller, and we don't need to any of this
> stuff.

Ok, if that's really the case, and those pattern are just a way to
notify the software that it has detected an empty page, then I'm fine
with this implementation.
Just add an extra nand_check_erased_ecc_chunk() call (as you suggested)
if this function returns false, so that you can handle bitflips in
erased pages.

Best Regards,

Boris


-- 
Boris Brezillon, Free Electrons
Embedded Linux and Kernel engineering
http://free-electrons.com



More information about the linux-mtd mailing list