[PATCH 14/14] nvme: use blk_mq_[un]quiesce_tagset
Shakeel Butt
shakeelb at google.com
Mon Nov 21 12:44:50 PST 2022
On Tue, Nov 01, 2022 at 04:00:50PM +0100, Christoph Hellwig wrote:
> From: Chao Leng <lengchao at huawei.com>
>
> All controller namespaces share the same tagset, so we can use this
> interface which does the optimal operation for parallel quiesce based on
> the tagset type(e.g. blocking tagsets and non-blocking tagsets).
>
> nvme connect_q should not be quiesced when quiesce tagset, so set the
> QUEUE_FLAG_SKIP_TAGSET_QUIESCE to skip it when init connect_q.
>
> Currently we use NVME_NS_STOPPED to ensure pairing quiescing and
> unquiescing. If use blk_mq_[un]quiesce_tagset, NVME_NS_STOPPED will be
> invalided, so introduce NVME_CTRL_STOPPED to replace NVME_NS_STOPPED.
> In addition, we never really quiesce a single namespace. It is a better
> choice to move the flag from ns to ctrl.
>
> Signed-off-by: Chao Leng <lengchao at huawei.com>
> [hch: rebased on top of prep patches]
> Signed-off-by: Christoph Hellwig <hch at lst.de>
> Reviewed-by: Keith Busch <kbusch at kernel.org>
> Reviewed-by: Chao Leng <lengchao at huawei.com>
> Reviewed-by: Hannes Reinecke <hare at suse.de>
> Reviewed-by: Sagi Grimberg <sagi at grimberg.me>
This patch is causing the following crash at the boot and reverting it
fixes the issue. This is next-20221121 kernel.
[ 19.781883] BUG: kernel NULL pointer dereference, address: 0000000000000078
[ 19.788761] #PF: supervisor write access in kernel mode
[ 19.793924] #PF: error_code(0x0002) - not-present page
[ 19.799006] PGD 0 P4D 0
[ 19.801510] Oops: 0002 [#1] SMP PTI
[ 19.804955] CPU: 54 PID: 110 Comm: kworker/u146:0 Tainted: G I 6.1.0-next-20221121-smp-DEV #1
[ 19.814753] Hardware name: ...
[ 19.822662] Workqueue: nvme-reset-wq nvme_reset_work
[ 19.827571] RIP: 0010:mutex_lock+0x13/0x30
[ 19.831617] Code: 07 00 00 f7 c1 00 01 00 00 75 02 31 c0 5b 5d c3 00 00 cc cc 00 00 cc 0f 1f 44 00 00 55 48 89 e5 65 48 8b 0d 7f 9a c2 52 31 c0 <f0> 48 0f b1 0f 74 05 e8 11 00 00 00 5d c3 66 66 66 66 66 66 2e 0f
[ 19.850172] RSP: 0000:ffff96624651bcb0 EFLAGS: 00010246
[ 19.855339] RAX: 0000000000000000 RBX: 0000000000000003 RCX: ffff966246530000
[ 19.862388] RDX: 0000000000000000 RSI: 0000000000000286 RDI: 0000000000000078
[ 19.869442] RBP: ffff96624651bcb0 R08: 0000000000000004 R09: ffff96624651bc74
[ 19.876495] R10: 0000000000000000 R11: 0000000000000000 R12: ffff966246570000
[ 19.883547] R13: ffff966246570000 R14: ffff966249f330b0 R15: 0000000000000000
[ 19.890599] FS: 0000000000000000(0000) GS:ffff9681bf880000(0000) knlGS:0000000000000000
[ 19.898594] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 19.904273] CR2: 0000000000000078 CR3: 0000003c40e0a001 CR4: 00000000001706e0
[ 19.911323] Call Trace:
[ 19.913743] <TASK>
[ 19.915818] blk_mq_quiesce_tagset+0x23/0xc0
[ 19.920039] nvme_stop_queues+0x1e/0x30
[ 19.923829] nvme_dev_disable+0xe1/0x3a0
[ 19.927702] nvme_reset_work+0x14cb/0x1600
[ 19.931751] ? ttwu_queue_wakelist+0xc4/0xd0
[ 19.935970] ? try_to_wake_up+0x1b4/0x2d0
[ 19.939933] process_one_work+0x1a4/0x320
[ 19.943895] worker_thread+0x241/0x3f0
[ 19.947597] ? worker_clr_flags+0x50/0x50
[ 19.951560] kthread+0x10d/0x120
[ 19.954748] ? kthread_blkcg+0x30/0x30
[ 19.958453] ret_from_fork+0x1f/0x30
[ 19.961987] </TASK>
[ 19.964145] Modules linked in:
[ 19.967655] gsmi: Log Shutdown Reason 0x03
[ 19.971700] CR2: 0000000000000078
[ 19.974978] ---[ end trace 0000000000000000 ]---
[ 20.513708] RIP: 0010:mutex_lock+0x13/0x30
[ 20.517757] Code: 07 00 00 f7 c1 00 01 00 00 75 02 31 c0 5b 5d c3 00 00 cc cc 00 00 cc 0f 1f 44 00 00 55 48 89 e5 65 48 8b 0d 7f 9a c2 52 31 c0 <f0> 48 0f b1 0f 74 05 e8 11 00 00 00 5d c3 66 66 66 66 66 66 2e 0f
[ 20.536308] RSP: 0000:ffff96624651bcb0 EFLAGS: 00010246
[ 20.541469] RAX: 0000000000000000 RBX: 0000000000000003 RCX: ffff966246530000
[ 20.548519] RDX: 0000000000000000 RSI: 0000000000000286 RDI: 0000000000000078
[ 20.555568] RBP: ffff96624651bcb0 R08: 0000000000000004 R09: ffff96624651bc74
[ 20.562614] R10: 0000000000000000 R11: 0000000000000000 R12: ffff966246570000
[ 20.569663] R13: ffff966246570000 R14: ffff966249f330b0 R15: 0000000000000000
[ 20.576713] FS: 0000000000000000(0000) GS:ffff9681bf880000(0000) knlGS:0000000000000000
[ 20.584704] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 20.590380] CR2: 0000000000000078 CR3: 0000003c40e0a001 CR4: 00000000001706e0
[ 20.597432] Kernel panic - not syncing: Fatal exception
[ 20.602654] Kernel Offset: 0x2b800000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
[ 20.613747] gsmi: Log Shutdown Reason 0x02
More information about the Linux-nvme
mailing list