[PATCH] nvme-fabrics: add-remove ctrl repeat fix

J Freyensee james_p_freyensee at linux.intel.com
Tue Jul 12 16:15:06 PDT 2016


On Fri, 2016-07-01 at 12:13 -0700, Jay Freyensee wrote:

I haven't seen this patch get folded in on the list yet, though
Christoph OK'ed the fix, minus the 'Fix by:' I used instead of 'From:'.

Has this been folded into 4.8?  Should I go ahead and re-spin the patch
and make the tweak change?  Be nice to eliminate this kernel crash with
fabrics being merged into 4.8 kernel.

Thanks,
Jay

> Fix by: "Ming Lin" <mlin at kernel.org>
> 
> Repeatedly adding then removing the same NVMe-over-Fabrics controller
> over and over again (shown below) can cause a kernel crash (also
> shown
> below).  This patch fixes that.
> 
> [nvmf]# ./setup_nvme_connections.sh
> traddr=192.168.1.100,transport=rdma,trsvcid=4420,nqn=darkside
> -nqn,hostnqn=evil-wins-nqn,nr_io_queues=16 > /dev/nvme-fabrics
> traddr=192.168.1.100,transport=rdma,trsvcid=4420,nqn=lightside
> -nqn,hostnqn=good-wins-nqn > /dev/nvme-fabrics
> [nvmf]# ./remove_nvme_connections.sh 2
> echo 1 > /sys/class/nvme/nvme0/delete_controller
> echo 1 > /sys/class/nvme/nvme1/delete_controller
> [nvmf]# ./setup_nvme_connections.sh
> traddr=192.168.1.100,transport=rdma,trsvcid=4420,nqn=darkside
> -nqn,hostnqn=evil-wins-nqn,nr_io_queues=16 > /dev/nvme-fabrics
> Killed
> 
> [nvmf]# dmesg
> [  313.416908] nvme nvme0: creating 16 I/O queues.
> [  313.523908] nvme nvme0: new ctrl: NQN "darkside-nqn", addr
> 192.168.1.100:4420
> [  313.524857] BUG: unable to handle kernel NULL pointer dereference
> at
> 0000000000000010
> [  313.525262] IP: [<ffffffff8136c60e>] strcmp+0xe/0x30
> [  313.525490] PGD 0
> [  313.525726] Oops: 0000 [#1] SMP
> [  313.525900] Modules linked in: nvme_rdma nvme_fabrics nvme_core
> ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm
> mlx4_en
> mlx4_ib ib_core mlx4_core
> [  313.527085] CPU: 15 PID: 5856 Comm: setup_nvme_conn Not tainted
> 4.7.0-rc2+ #2
> [  313.527259] Hardware name: Supermicro X9DRT-F/IBQF/IBFF/X9DRT
> -F/IBQF/IBFF, BIOS 1.0a 10/09/2012
> [  313.527551] task: ffff88027646cd40 ti: ffff88025b980000 task.ti:
> ffff88025b980000
> [  313.527879] RIP: 0010:[<ffffffff8136c60e>]  [<ffffffff8136c60e>]
> strcmp+0xe/0x30
> [  313.528232] RSP: 0018:ffff88025b983db0  EFLAGS: 00010206
> [  313.528403] RAX: 0000000000000000 RBX: ffff880471879880 RCX:
> fffffffffffffff1
> [  313.528594] RDX: 0000000000000000 RSI: ffff880474afa860 RDI:
> 0000000000000011
> [  313.528778] RBP: ffff88025b983db0 R08: ffff880474afa860 R09:
> ffff880471879058
> [  313.528956] R10: 000000000000002c R11: ffff88047f415000 R12:
> ffff880471879800
> [  313.529129] R13: ffff880471879000 R14: ffff880474afa860 R15:
> fffffffffffffff8
> [  313.529303] FS:  00007f778f510700(0000) GS:ffff88047fbc0000(0000)
> knlGS:0000000000000000
> [  313.529629] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [  313.529817] CR2: 0000000000000010 CR3: 0000000274174000 CR4:
> 00000000000406e0
> [  313.529989] Stack:
> [  313.530154]  ffff88025b983e48 ffffffffa0171c74 0000000000000001
> 0000000000000059
> [  313.530621]  ffff880476f32400 ffff88047e8add80 0000010074b33aa0
> ffff880471879059
> [  313.531162]  ffff88047187904b ffff880471879058 0000000000000000
> ffff88047736e000
> [  313.531629] Call Trace:
> [  313.531797]  [<ffffffffa0171c74>] nvmf_dev_write+0x674/0x840
> [nvme_fabrics]
> [  313.531974]  [<ffffffff81180b53>] __vfs_write+0x23/0x120
> [  313.532146]  [<ffffffff8119daff>] ? __fd_install+0x1f/0xc0
> [  313.532316]  [<ffffffff8119d97a>] ? __alloc_fd+0x3a/0x170
> [  313.532487]  [<ffffffff811811f3>] vfs_write+0xb3/0x1b0
> [  313.532658]  [<ffffffff8117e321>] ? filp_close+0x51/0x70
> [  313.532845]  [<ffffffff811824e1>] SyS_write+0x41/0xa0
> [  313.533016]  [<ffffffff8183055b>]
> entry_SYSCALL_64_fastpath+0x13/0x8f
> [  313.533188] Code: 80 3a 00 75 f7 48 83 c6 01 0f b6 4e ff 48 83 c2
> 01
> 84 c9 88 4a ff 75 ed 5d c3 0f 1f 00 55 48 89 e5 eb 04 84 c0 74 18 48
> 83
> c7 01 <0f> b6 47 ff 48 83 c6 01 3a 46 ff 74 eb 19 c0 83 c8 01 5d c3
> 31
> [  313.536563] RIP  [<ffffffff8136c60e>] strcmp+0xe/0x30
> [  313.536815]  RSP <ffff88025b983db0>
> [  313.536981] CR2: 0000000000000010
> [  313.537151] ---[ end trace 3d952e590e7bc2d5 ]---
> 
> Reported-and-tested-by: Jay Freyensee <james.p.freyensee at intel.com>
> Signed-off-by: Ming Lin <mlin at kernel.org>
> Signed-off-by: Jay Freyensee <james.p.freyensee at intel.com>
> ---
>  drivers/nvme/host/fabrics.c | 4 ++++
>  1 file changed, 4 insertions(+)
> 
> diff --git a/drivers/nvme/host/fabrics.c
> b/drivers/nvme/host/fabrics.c
> index 918310c..f8045e7 100644
> --- a/drivers/nvme/host/fabrics.c
> +++ b/drivers/nvme/host/fabrics.c
> @@ -88,6 +88,10 @@ static void nvmf_host_destroy(struct kref *ref)
>  {
>  	struct nvmf_host *host = container_of(ref, struct nvmf_host,
> ref);
>  
> +	mutex_lock(&nvmf_hosts_mutex);
> +	list_del(&host->list);
> +	mutex_unlock(&nvmf_hosts_mutex);
> +
>  	kfree(host);
>  }
>  



More information about the Linux-nvme mailing list