UBIFS failure & stable page writes

Mats Kärrman Mats.Karrman at tritech.se
Thu Jun 13 06:54:15 EDT 2013


Hi!

Joining in from thread "UBIFS Orphaned inode 1" since it's the same problem.
This is an orphans block from one of my crashed units:

=== Orphan nodes (LEB 9) ===

$ dd if=/dev/ubi0_0 bs=1 count=130944 skip=1178496 |hd
00000000  31 18 10 06 a9 ec d2 ed  c8 f6 6a 01 00 00 00 00  |1.........j.....|
00000010  30 00 00 00 0b 00 00 00  a4 06 00 00 00 00 00 80  |0...............|
00000020  14 72 12 00 00 00 00 00  01 00 00 00 00 00 00 00  |.r..............|
00000030  ff ff ff ff ff ff ff ff  ff ff ff ff ff ff ff ff  |................|
*
0001ff80

Decoded:
31 18 10 06 - UBIFS_NODE_MAGIC
a9 ec d2 ed - CRC
c8 f6 6a 01 00 00 00 00 - Sequence number
30 00 00 00 - node length = 48 bytes
0b - node type = Orphan node
00 - group type = Not part of group
00 00 - padding
a4 06 00 00 00 00 00 80 - commit number = 0x6a4, last node flag = set
14 72 12 00 00 00 00 00 - inode num = 0x127214
01 00 00 00 00 00 00 00 - inode num = 1 !!!

I have v2.6.35 with a lot of patches from Artem's maintenance tree and some more from the later developments.
I diffed my fs/ubifs to the files from Antion Prins and they are almost identical, mostly debug print changes and some small things that I think are unrelated.

I have not been able to reproduce the error but haven't got the resources to set up 100 test units. So far a hand-full of units has crashed in the field.

BR // Mats


________________________________________
From: linux-mtd [linux-mtd-bounces at lists.infradead.org] on behalf of Prins Anton (ST-CO/ENG1.1) [Anton.Prins at nl.bosch.com]
Sent: Wednesday, June 12, 2013 3:57 PM
To: Adrian Hunter
Cc: linux-mtd at lists.infradead.org; dedekind1 at gmail.com
Subject: RE: UBIFS failure & stable page writes

Ofcourse

Met vriendelijke groeten | Best Regards,
Anton Prins

Bosch Security Systems BV,
Conference Systems (ST-CO/ENG1.1)
Torenallee 49
5617 BA  Eindhoven
The Netherlands
www.boschsecurity.com
T. +31 (0)40 2577077
anton.prins at nl.bosch.com


-----Original Message-----
From: Adrian Hunter [mailto:adrian.hunter at intel.com]
Sent: woensdag 12 juni 2013 15:10
To: Prins Anton (ST-CO/ENG1.1)
Cc: dedekind1 at gmail.com; linux-mtd at lists.infradead.org
Subject: Re: UBIFS failure & stable page writes

Can you tar and send fs/ubifs directory?

On 12/06/13 14:57, Prins Anton (ST-CO/ENG1.1) wrote:
> Ok clear!
>
> Regarding mount debug: I suppose I only will see this on the 'first' boot after the problem is in the persistence storage?
> (Up to now I'm not able to signal the point-of-failure... I'm only looking to an (possibly) after recovery situation).
>
> Or makes this sense to do this mount debug on the existing failing device?
>
> FYI:
> - with 100 devices power cycling every 5 minutes for a weekend we DON'T see a problem.
> - with 100 devices powered on for a weekend and after that give a single power-cycle shows maybe 1 or 2 failing devices (but sometimes '0').
>
> So a main problem in analyzing is the 'moment of failure' and 'reproduction (rate)'.
>
> Met vriendelijke groeten | Best Regards,
> Anton Prins
>
> Met vriendelijke groeten | Best Regards,
> Anton Prins
>
>
> -----Original Message-----
> From: Adrian Hunter [mailto:adrian.hunter at intel.com]
> Sent: woensdag 12 juni 2013 14:00
> To: Prins Anton (ST-CO/ENG1.1)
> Cc: dedekind1 at gmail.com; linux-mtd at lists.infradead.org
> Subject: Re: UBIFS failure & stable page writes
>
> On 12/06/13 14:13, Prins Anton (ST-CO/ENG1.1) wrote:
>> Does it make sense to use chk_orphans?
> That checks for missing orphans but the problem is the opposite: the
> presence of orphans that should not be there.
>
>> I suppose I have to do: $ echo "1" > chk_orphans
>>
>> Or am I wrong?
>>
>> Met vriendelijke groeten | Best Regards,
>> Anton Prins
>>




More information about the linux-mtd mailing list