Few problems in mtd system

Maxim Levitsky maximlevitsky at gmail.com
Sat Dec 19 12:17:40 EST 2009


On Fri, 2009-12-18 at 00:08 +0200, Maxim Levitsky wrote: 
> Hi,
> 
> I am writing a driver for an xD controller found in my notebook.
> xD card is 99.9% nand chip, and the controller fits into the nand
> subsystem nicely.
> 
> However I face few problems, out of which about half is due to that
> '99.9' and rest I believe  are bugs in the nand/mtd system.
> 
> First of all I design a two layer driver.
> First layer is mtd driver which plugs into nand system.
> Second layer is xD specific FTL that already exists (and works) in R/O
> form in mtd/ssfdc.c
> (Of course you are free to use another FTL or UBI on top of mtd driver,
> but you won't be able to read the card in xd readers)
> 
> I am using 2.6.32 vanilla, so maybe something was already fixed.
> 
> First, the biggest problem is that del_mtd_device fails if the mtd
> device is open. 
> 
> However the user can pull the card out of the slot anytime, so this has
> to be handled gracefully.
> Currently I can't even test for that condition, because nand_release
> discards the return value.
> 
> 
> Secondary, the hardware I deal with supports hardware ecc, and will
> return error bit location if correctable error found.
> Thus I use the 'syndrome' set of ecc functions.
> 
> However, the nand_write_oob_syndrome is broken. It uses NAND_CMD_READ0
> with ecc step size offset (this is 512 or minimum 256 in common cases),
> but this offset can't even be sent over 8 bit bus.
> 
> 
> 
> Another problem I discovered recently is that bad block handling other
> that using bad block table is broken.
> Maybe I should use it too, but scan of whole device takes some
> significant time (~1 sec) and I don't like that.
> 
> However, when I attempt to implement custom .block_bad/.block_markbad, I
> run into trouble about how to read oob from hardware.
> Default functions use lots of static helpers, and only way to to that in
> standard way is to use mtd->read_oob, but that leads to deadlock when
> doing the erase.
> nand_erase_nand does nand_get_device and it tests for bad block which
> end in mtd->read_oob (implemented as nand_read_oob and it does
> nand_get_device too...)
> 
> Or I can use very low level functions, but this skips taking locks, and
> could lead to races.
> 
> Note that although maybe I could avoid the deadlock by using the bad
> block table, I still need these functions to determine/write the bad
> status because there is no bad block table on the flash.
> 
> 
> 
> Yet another problem I found is inside add_mtd_blktrans_dev.
> This function takes mtd_table_mutex but never releases it.
> This sounds fishy, and deadlocks when it calls 'add_disk(gd);'
> This function (I think in case partitions are detected on device) opens
> block device, and this takes that lock too
> (blktrans_open->get_mtd_device)
> 
> 
> 
> And lastly, here are problems due to slight incompatibilities:
> 
> First, its possible to use small page nand (256 bytes/page), but then
> unfortunately oob contents of every odd and even page are 'related'
> (odd page contains ecc of both pages).
> This is very hard to support, and since it only present in SmartMedia,
> and to read such old card you need an adapter (which I not sure is
> possible to buy), I'll skip it.
> 
> 
> Another problem is that chip does report correct ID, but doesn't report
> valid settings in extended ID, thus chips > 128 MB aren't detected.
> Also there is whole class of mask rom devices, and they have non nand
> IDs.
> Not sure its valuable to support mask roms (it is also smartmedia only
> feature too), but erasesize/pagesize have to be hardcoded. 
> I think that best solution it to allow to pass custom list of device IDs
> to nand_scan.
> 
> 
> Last problem I discovered recently is that the chip always reports via
> status command that it is write protected. This is maybe a card bug.
> Small workaround for that in the nand system isn't a problem.
> I will test if that report is really bogus though.
> The hardware has seperate register to check write-protect status, and
> card I suspect has a write-protect seal.
> 
> 
> Currently, with workarounds to above problems, my driver does read the
> xD card (using ssfdc.c)
> 
> Best regards,
> Maxim Levitsky


One small and a bit disappointing note.
I found out that ether controller or the card (I strongly suspect the
later) pokes at OOB contents that are written.

For example writing invalid sector number, results in no write, and the
sector reads random data until card is reinserted, at which point read
returns same contents as before.

Also setting 'reserved' field to anything but 0xFFFFFFFF results in it
being written as 0xFFFFFFFF.

I strongly suspect that xD card I have (1 GB) is not 'real' nand card.
Probably inside the card there is same embedded cpu, just as inside all
other cards. 
It just emulates the nand command set, and thus pokes at oob contents
(and doesn't store it fully ether....)

Will be glad to hear from you about rest of the questions I asked.


Best regards,
Maxim Levitsky





More information about the linux-mtd mailing list