blktests failures with v6.4

Shinichiro Kawasaki shinichiro.kawasaki at wdc.com
Fri Jul 7 00:27:37 PDT 2023


Hi all,

I ran the latest blktests (git hash: 30bdcb72ad99) with v6.4 kernel and observed
test case failures listed below. I call for support to fix them. The list of
failures are same as the report for v6.3 kernel [1].

  [1] https://lore.kernel.org/linux-block/rsmmxrchy6voi5qhl4irss5sprna3f5owkqtvybxglcv2pnylm@xmrnpfu3tfpe/

List of failures
================
#1: block/011
#2: block/024
#3: nvme/003 (fabrics transport)
#4: nvme/030 or nvme/031 (rdma transport with siw)
#5: nvme/* (fc transport)

Failure description
===================

#1: block/011

   This test case shows two failure symptoms.

   Symptom A:

     The test case fails with NVME devices due to lockdep WARNING "possible
     circular locking dependency detected". Reported in Sep/2022 [2] and
     solution was discussed. Waiting a fix.

     [2] https://lore.kernel.org/linux-block/20220930001943.zdbvolc3gkekfmcv@shindev/

   Symptom B:

     The test case occasionally fail with fio assert messages. Fio bug is
     suspected. I'm preparing a fix patch [3].

     block/011 => nvme0n1 (disable PCI device while doing I/O)    [failed]
        runtime  32.250s  ...  1684.324s
        --- tests/block/011.out     2023-04-06 10:11:07.920670529 +0900
        +++ /home/shin/Blktests/blktests/results/nvme0n1/block/011.out.bad  2023-07-07 14:34:26.123295632 +0900
        @@ -1,2 +1,11 @@
         Running block/011
        +fio: ioengines.c:335: td_io_queue: Assertion `(io_u->flags & IO_U_F_FLIGHT) == 0' failed.
        +fio: ioengines.c:335: td_io_queue: Assertion `(io_u->flags & IO_U_F_FLIGHT) == 0' failed.
        +fio: ioengines.c:335: td_io_queue: Assertion `(io_u->flags & IO_U_F_FLIGHT) == 0' failed.
        +fio: ioengines.c:335: td_io_queue: Assertion `(io_u->flags & IO_U_F_FLIGHT) == 0' failed.
        +fio: pid=1492, got signal=6
        +fio: pid=1493, got signal=6
        ...
        (Run 'diff -u tests/block/011.out /home/shin/Blktests/blktests/results/nvme0n1/block/011.out.bad' to see the entire diff)

     [3] https://github.com/kawasaki/fio/commit/f4b68b90c6780a73d4eb017449400a7891443e3b

#2: block/024

   Fails on slow machines. Reported in Dec/2022. Test case side issue is
   suspected. Still needs further investigation.

   block/024 (do I/O faster than a jiffy and check iostats times) [failed]
    runtime    ...  4.347s
    --- tests/block/024.out     2022-12-06 20:51:41.525066605 +0900
    +++ /home/shin/kts/kernel-test-suite/sets/blktests/log/runlog/nodev/block/024.out.bad       2022-12-07 12:51:03.610924521 +0900
    @@ -6,5 +6,5 @@
     read 1 s
     write 1 s
     read 2 s
    -write 3 s
    +write 4 s
     Test complete

#3: nvme/003 (fabrics transport)

   When nvme test group is run with trtype=rdma or tcp, the test case fails
   due to lockdep WARNING "possible circular locking dependency detected".
   Reported in May/2023. Bart suggested a fix for trytpe=rdma [4] but it
   needs more discussion.

   [4] https://lore.kernel.org/linux-nvme/20230511150321.103172-1-bvanassche@acm.org/

#4: nvme/030 or nvme/031 (rdma transport with siw)

   When nvme test group is run with trtype=rdma and use_siw=1 configurations,
   nvme/030 or nvme/031 fail occasionally due to "BUG: KASAN: slab-use-after-
   free in __mutex_lock". Reported to linux-rdma in May/2023. A fix was
   suggested but it did not fix the root cause in rdma iwarp cm [5]. Waiting
   for a good fix.

   [5] https://lore.kernel.org/linux-rdma/20230612054237.1855292-1-shinichiro.kawasaki@wdc.com/

#5: nvme/* (fc transport)

   With trtype=fc configuration, test run on nvme test group hangs. Daniel is
   driving fix work [6].

   [6] https://lore.kernel.org/linux-nvme/20230620133711.22840-1-dwagner@suse.de/



More information about the Linux-nvme mailing list