[PATCH RFC] ubifs: Fix use-after-free in ubifs_tnc_end_commit

Zhihao Cheng chengzhihao1 at huawei.com
Wed Nov 6 23:14:56 PST 2024


在 2024/11/7 0:36, Waqar Hameed 写道:
> Sorry for the late response Zhihao! I've been quite busy these days...
> 
> On Fri, Oct 18, 2024 at 09:40 +0800 Zhihao Cheng <chengzhihao1 at huawei.com> wrote:
> 
>> 在 2024/10/18 2:36, Waqar Hameed 写道:
>>> On Wed, Oct 16, 2024 at 10:11 +0800 Zhihao Cheng <chengzhihao1 at huawei.com> wrote:
>>> [...]
>>>
>>>> BTW, what is the configuration of your flash?(eg. erase size, page size)?
>>> $ mtdinfo /dev/mtd2
>>>     mtd2
>>>     Name:                           firmware
>>>     Type:                           nand
>>>     Eraseblock size:                131072 bytes, 128.0 KiB
>>>     Amount of eraseblocks:          1832 (240123904 bytes, 229.0 MiB)
>>>     Minimum input/output unit size: 2048 bytes
>>>     Sub-page size:                  2048 bytes
>>>     OOB size:                       64 bytes
>>>     Character device major/minor:   90:4
>>>     Bad blocks are allowed:         true
>>>     Device is writable:             true
>>> $ ubinfo /dev/ubi0_0
>>>     Volume ID:   0 (on ubi0)
>>>     Type:        dynamic
>>>     Alignment:   1
>>>     Size:        661 LEBs (83931136 bytes, 80.0 MiB)
>>>     State:       OK
>>>     Name:        test-vol
>>>     Character device major/minor: 244:1
>>> [...]
>>
>> Thanks, I will change my nandsim configurations to generate a mtd device the
>> same model.
> 
> Did you manage to reproduce the issue with this?

I tried, but I still cannot reproduce it on my local machine.
> 
>>>
>>>> Well, let's do a preliminary analysis.
>>>> The znode->cparent[znode->ciip] is a freed address in write_index(), which
>>>> means:
>>>> 1. 'znode->ciip' is valid, znode->cparent is freed by tnc_delete, however znode
>>>> cannot be freed if znode->cnext is not NULL, which means:
>>>>     a) 'znode->cparent' is not dirty, we should add an assertion like
>>>>     ubifs_assert(c, ubifs_zn_dirty(znode->cparent)) in get_znodes_to_commit().
>>>>     Note, please check that 'znode->cparent' is not NULL before the assertion.
>>>>     b) 'znode->cparent' is dirty, but it is not added into list 'c->cnext', we
>>>>     should traverse the entire TNC in get_znodes_to_commit() to make sure that all
>>>>     dirty znodes are collected into list 'c->cnext', so another assertion is
>>>>    needed.
> 
> I'm a little worried that traversing the whole TNC could change the
> timing behavior, and thus might not trigger the race. Let's do that in
> steps? Start with the other asserts (see diff below) and later just do
> this assert. Does that sound reasonable?

Fine. I add one comment below.
> 
> I could modify `dbg_check_tnc()` so that it also checks that each dirty
> `znode` is present in `c->cnext` list. We then call this at the end of
> `get_znodes_to_commit()`.
> 

Sounds good to me, please remove other non-related checks in 
dbg_check_tnc().
>>>> 2. 'znode->ciip' is invalid, and the value beyonds the memory area of
>>>> znode->cparent. All znodes are allocated with size of 'c->max_znode_sz', which
>>>> means that 'znode->ciip' exceeds the 'c->fantout', so we can add an assertion
>>>> like ubifs_assert(c, znode->ciip < c->fantout) in get_znodes_to_commit().
>>>>
>>>> That's what I can think of, are there any other possibilities?
>>> I looked a little more at `get_znodes_to_commit()` when adding the
>>> asserts you suggest, and I have a question: what happens when
>>> `find_next_dirty()` returns `NULL`? In that case
>>> ```
>>> znode->cnext = c->cnext;
>>> ```
>>> but `znode->cparent` and `znode->ciip` are not updated. Shouldn't they?
>>
>> Good thinking.
>> According to the implementation of find_next_dirty(), the order of dirty znodes
>> collection is bottom-up, which means that the last dirty znode is the root
>> znode, so it doesn't have a parent. You can verify that by adding assertion to
>> check whether the last dirty znode is the root.
> 
> [...]
> 
> To summarize, I'll start a run with the following asserts:
> 
> diff --git a/fs/ubifs/tnc_commit.c b/fs/ubifs/tnc_commit.c
> index a55e04822d16..4eef82e02afe 100644
> --- a/fs/ubifs/tnc_commit.c
> +++ b/fs/ubifs/tnc_commit.c
> @@ -652,11 +652,17 @@ static int get_znodes_to_commit(struct ubifs_info *c)
>   	}
>   	cnt += 1;
>   	while (1) {

Please move the check after the assignment of 'znode->cparent', because 
'znode->parent' could be switched by tnc_insert().
> +		ubifs_assert(c, znode->ciip < c->fantout);
> +		if (znode->cparent) {
> +			ubifs_assert(c, ubifs_zn_dirty(znode->cparent));
> +		}
> +
>   		ubifs_assert(c, !ubifs_zn_cow(znode));
>   		__set_bit(COW_ZNODE, &znode->flags);
>   		znode->alt = 0;
>   		cnext = find_next_dirty(znode);
>   		if (!cnext) {
> +			ubifs_assert(c, znode == c->zroot.znode);
>   			znode->cnext = c->cnext;
>   			break;
>   		}
> 

@@ -662,6 +662,10 @@ static int get_znodes_to_commit(struct ubifs_info *c)
                 }
                 znode->cparent = znode->parent;
                 znode->ciip = znode->iip;
+               if (znode->cparent) {
+                       ubifs_assert(c, ubifs_zn_dirty(znode->cparent));
+               }
+               ubifs_assert(c, znode->ciip < c->fantout);
                 znode->cnext = cnext;
                 znode = cnext;
                 cnt += 1;

> Then later, another run with a modified `dbg_check_tnc()` to check that
> all dirty `znode`s are indeed present in the list `c->cnext`.
> .
> 




More information about the linux-mtd mailing list