[PATCH] mtd: nand: omap: Fix BCH bit correction
Sascha Hauer
s.hauer at pengutronix.de
Tue Jun 6 23:53:36 PDT 2017
On Wed, Jun 07, 2017 at 08:49:09AM +0200, Sascha Hauer wrote:
> On Wed, Jun 07, 2017 at 08:45:08AM +0200, Sascha Hauer wrote:
> > +Cc Matt Reimer <mreimer at sdgsystems.com>
> >
> > On Tue, Jun 06, 2017 at 06:10:25PM +0200, Daniel Schultz wrote:
> > > After commit dec7b4d2bf9 was applied our barebox only corrected the
> > > first 512 Bytes of NAND pages.
> > >
> > > This patch separates between Hamming and BCH when finding out the
> > > eccsteps, because BCH always works with 2kB pages.
> > >
> > > Before this patch:
> > >
> > > barebox at Phytec phyCORE AM335x:/ nand_bitflip -r -n 5 /dev/nand0.barebox
> > > nand0.barebox: Flipping bit 5 @ 1796
> > > nand0.barebox: Flipping bit 6 @ 1258
> > > nand0.barebox: Flipping bit 5 @ 1062
> > > nand0.barebox: Flipping bit 2 @ 1399
> > > nand0.barebox: Flipping bit 6 @ 1243
> > > No bitflips found on block 0, offset 0x00000000
> > > barebox at Phytec phyCORE AM335x:/ nand_bitflip -r -n 5 /dev/nand0.barebox
> > > nand0.barebox: Flipping bit 2 @ 872
> > > nand0.barebox: Flipping bit 4 @ 252
> > > nand0.barebox: Flipping bit 3 @ 568
> > > nand0.barebox: Flipping bit 2 @ 247
> > > nand0.barebox: Flipping bit 5 @ 401
> > > page at block 0, offset 0x00000000 has 3 bitflips
> > >
> > > After this patch:
> > >
> > > barebox at Phytec phyCORE AM335x:/ nand_bitflip -r -n 5 /dev/nand0.barebox
> > > nand0.barebox: Flipping bit 2 @ 1962
> > > nand0.barebox: Flipping bit 0 @ 1563
> > > nand0.barebox: Flipping bit 0 @ 1808
> > > nand0.barebox: Flipping bit 6 @ 1460
> > > nand0.barebox: Flipping bit 7 @ 2034
> > > page at block 0, offset 0x00000000 has 5 bitflips
> > > barebox at Phytec phyCORE AM335x:/ nand_bitflip -r -n 5 /dev/nand0.barebox
> > > nand0.barebox: Flipping bit 1 @ 1352
> > > nand0.barebox: Flipping bit 7 @ 1542
> > > nand0.barebox: Flipping bit 2 @ 1021
> > > nand0.barebox: Flipping bit 7 @ 691
> > > nand0.barebox: Flipping bit 6 @ 1196
> > > page at block 0, offset 0x00000000 has 10 bitflips, needs cleanup
> > >
> > > Signed-off-by: Daniel Schultz <d.schultz at phytec.de>
> > > ---
> > > drivers/mtd/nand/nand_omap_gpmc.c | 11 +++++++++--
> > > 1 file changed, 9 insertions(+), 2 deletions(-)
> > >
> > > diff --git a/drivers/mtd/nand/nand_omap_gpmc.c b/drivers/mtd/nand/nand_omap_gpmc.c
> > > index 05c8486..61220da 100644
> > > --- a/drivers/mtd/nand/nand_omap_gpmc.c
> > > +++ b/drivers/mtd/nand/nand_omap_gpmc.c
> > > @@ -302,10 +302,17 @@ static int omap_correct_bch(struct mtd_info *mtd, uint8_t *dat,
> > > unsigned int err_loc[8];
> > > int bitflip_count;
> > > int bch_max_err;
> > > + int eccsteps;
> > >
> > > - int eccsteps = (nand->ecc.mode == NAND_ECC_HW) &&
> > > - (nand->ecc.size == 2048) ? 4 : 1;
> > > int eccsize = oinfo->nand.ecc.bytes;
> > > + if (oinfo->ecc_mode == OMAP_ECC_HAMMING_CODE_HW_ROMCODE)
> >
> > This is wrong. When in Hamming ECC mode you shouldn't get into this
> > function. The test should always fail.
> >
> > > + if ((nand->ecc.mode == NAND_ECC_HW) &&
> > > + (nand->ecc.size == 2048))
> > > + eccsteps = 4;
> > > + else
> > > + eccsteps = 1;
> >
> > The question is why ecc.size is set to the wrong value in the first
> > place:
> >
> > case OMAP_ECC_BCH8_CODE_HW:
> > ...
> > oinfo->nand.ecc.size = 512 * 4;
> >
> > This seems to be wrong. The BCH controller works in 512 Byte chunks, so
> > ecc.size should be 512. This would make the special cases in
> > omap_correct_bch() unnecessary.
> >
> > In dec7b4d2bf9 Matt said:
> >
> > | The fix is to pull over a bit of code from the kernel's
> > | omap_correct_data() that sets eccsteps = 4 when the page size is 2048
> > | bytes and hardware ECC is being used.
> >
> > In fact, this piece is in the kernel code:
> >
> > /* Ex NAND_ECC_HW12_2048 */
> > if ((info->nand.ecc.mode == NAND_ECC_HW) &&
> > (info->nand.ecc.size == 2048))
> > blockCnt = 4;
> > else
> > blockCnt = 1;
> >
> > I just suspect this is never used, because ecc.size is correctly set to 512 in
> > all cases. Then ecc.steps results in 4 for 2k page sizes and the framework correctly
> > iterates over the ecc steps.
> >
> > Please give the attached test a try. It's completely untested.
>
> And can not work. Additionally eccsteps must be set to 1 in
> omap_correct_bch(). This effectively makes the loop in this function
> unnecessary which can then removed.
Which then means omap_gpmc_read_page_bch_rom_mode() has to iterate over
ecc.steps itself, just like the other read_page implementations in the
framework do.
So long, enough of self-replying for now ;)
Sascha
--
Pengutronix e.K. | |
Industrial Linux Solutions | http://www.pengutronix.de/ |
Peiner Str. 6-8, 31137 Hildesheim, Germany | Phone: +49-5121-206917-0 |
Amtsgericht Hildesheim, HRA 2686 | Fax: +49-5121-206917-5555 |
More information about the barebox
mailing list