Fix potential data loss and corruption due to Incorrect BIO Chain Handling
Stephen Zhang
starzhangzsd at gmail.com
Thu Nov 27 17:29:52 PST 2025
Christoph Hellwig <hch at infradead.org> 于2025年11月27日周四 15:17写道:
>
> On Thu, Nov 27, 2025 at 03:05:29PM +0800, Stephen Zhang wrote:
> > No, they are not using bcache.
>
> Then please figure out how bio_chain_endio even gets called in this
> setup. I think for mainline the approach should be to fix bcache
> and eorfs to not call into ->bi_end_io and add a BUG_ON() to
> bio_chain_endio to ensure no new callers appear. I
>
Okay, thanks for the suggestion.
> > If there are no further objections or other insights regarding this issue,
> > I will proceed with creating a v2 of this series.
>
> Not sure how that is helpful. You have a problem on a kernel from stone
> age, can't explain what actually happens and propose something that is
> mostly a no-op in mainline, with the callers that could even reach the
> area being clear API misuse.
>
Analysis of the 4.19 kernel bug confirmed it was not caused by the
->bi_end_io call. Instead, this investigation led us to discover a different bug
in the upstream kernel. The v2 patch series is dedicated to fixing this newly
found upstream issue.
Thanks,
shida
>
More information about the Linux-nvme
mailing list