nvme nvme0: I/O 0 (I/O Cmd) QID 1 timeout, aborting, source drive corruption observed

Keith Busch kbusch at kernel.org
Fri Dec 16 11:08:49 PST 2022


On Fri, Dec 16, 2022 at 07:39:53AM +0100, Christoph Hellwig wrote:
> On Fri, Dec 16, 2022 at 07:30:55AM +0900, J. Hart wrote:
> > I've tried the obvious ones and that didn't help either.  I guess I'll have 
> > to give up on it and return it as defective.  I'll go back to normal 
> > operation and to try and find a controller/device combination that works 
> > with the linux driver if there are any.
> 
> So on the hand I agree with Keith that the device seems really broken.
> On the other hand the fact that source file system on another device
> sees corruption even with the iommu enabled is something that looks
> scrary.  Even if ultimatively caused by the device somehow, that seems
> like the kernel is part of the corruption.  And I have absolutely no
> idea how.  A KASAN run on the device might be helpful, but I'm also
> reluctant to ask a reported to run more reproducers and something that
> corrupts his data.

Oh, I assumed the source was a different partition on the same flakey
looking device. If not, yeah, that's pretty concerning.

How do you know enabling Intel IOMMU in the kernel config does anything
here? I didn't see anything confirming the kernel was actually using it.
I know this CPU model has VT-d capabilities, but I believe the platform
may disable it in BIOS.



More information about the Linux-nvme mailing list