Unable to reconnect namespace via NVMe/TCP

Anton Gavriliuk antosha20xx at gmail.com
Tue Aug 19 07:29:07 PDT 2025


Hi Yi Zhang

> Please try to add the uuid in the device field, which should fix your issue.

On the NVMe/TCP target for the given device (/dev/nvme0n1) nguid and
uuid are exactly the same

[root at memverge4 ~]# cat /sys/class/block/nvme0n1/uuid
01000000-0000-0000-8ce3-8ee3064aa4f2
[root at memverge4 ~]# cat /sys/class/block/nvme0n1/nguid
01000000-0000-0000-8ce3-8ee3064aa4f2

So I added uuid

      "namespaces": [
        {
          "device": {
            "nguid": "01000000-0000-0000-8ce3-8ee3064aa4f2",
            "uuid": "01000000-0000-0000-8ce3-8ee3064aa4f2",
            "path": "/dev/nvme0n1"
          },
          "enable": 1,
           "nsid": 1
        }
      ],

Yes, this fixed my issue - automatically reconnect namespace after
NVMe/TCP target reboot.

Anton

вт, 19 авг. 2025 г. в 16:43, Yi Zhang <yi.zhang at redhat.com>:
>
> Hi Anton
>
> Please try to add the uuid in the device field, which should fix your issue.
>
> On Tue, Aug 19, 2025 at 6:55 PM Yi Zhang <yi.zhang at redhat.com> wrote:
> >
> > Hi Hannes
> >
> > I tried with the upstream kernel v6.17-rc2, and it can still be reproduced.
> >
> > # dmesg | tail -30
> > [  219.560691] nvme nvme0: Failed reconnect attempt 3/-1
> > [  219.565784] nvme nvme0: Reconnecting in 10 seconds...
> > [  229.795215] nvme nvme0: failed to connect socket: -111
> > [  229.800369] nvme nvme0: Failed reconnect attempt 4/-1
> > [  229.805450] nvme nvme0: Reconnecting in 10 seconds...
> > [  240.034918] nvme nvme0: failed to connect socket: -111
> > [  240.040093] nvme nvme0: Failed reconnect attempt 5/-1
> > [  240.045165] nvme nvme0: Reconnecting in 10 seconds...
> > [  250.274619] nvme nvme0: failed to connect socket: -111
> > [  250.279776] nvme nvme0: Failed reconnect attempt 6/-1
> > [  250.284855] nvme nvme0: Reconnecting in 10 seconds...
> > [  260.514102] nvme nvme0: failed to connect socket: -111
> > [  260.519261] nvme nvme0: Failed reconnect attempt 7/-1
> > [  260.524340] nvme nvme0: Reconnecting in 10 seconds...
> > [  270.754031] nvme nvme0: failed to connect socket: -111
> > [  270.759184] nvme nvme0: Failed reconnect attempt 8/-1
> > [  270.764263] nvme nvme0: Reconnecting in 10 seconds...
> > [  280.993410] nvme nvme0: failed to connect socket: -111
> > [  280.998591] nvme nvme0: Failed reconnect attempt 9/-1
> > [  281.003653] nvme nvme0: Reconnecting in 10 seconds...
> > [  291.249090] nvme nvme0: creating 4 I/O queues.
> > [  291.264959] nvme nvme0: mapped 4/0/0 default/read/poll queues.
> > [  291.271975] nvme nvme0: Successfully reconnected (attempt 10/-1)
> > [  291.273897] nvme nvme0: identifiers changed for nsid 2
> > [  291.283631] block nvme0n1: no available path - failing I/O
> > [  291.289139] block nvme0n1: no available path - failing I/O
> > [  291.294649] block nvme0n1: no available path - failing I/O
> > [  291.300159] block nvme0n1: no available path - failing I/O
> > [  291.305665] block nvme0n1: no available path - failing I/O
> > [  291.311197] block nvme0n1: no available path - failing I/O
> >
> > On Tue, Aug 19, 2025 at 2:11 PM Hannes Reinecke <hare at suse.de> wrote:
> > >
> > > On 8/12/25 17:48, Anton Gavriliuk wrote:
> > > > Hi
> > > >
> > > > There are NVMe/TCP target and initiator servers, both running on
> > > > RHEL10 (6.12.0-55.25.1.el10_0.x86_64)
> > > >
> > > > NVMe/TCP target exports single NVMe SSD
> > > >
> > > >        "namespaces": [
> > > >          {
> > > >            "device": {
> > > >              "nguid": "01000000-0000-0000-8ce3-8ee3064aa4f2",
> > > >              "path": "/dev/nvme0n1"
> > > >            },
> > > >            "enable": 1,
> > > >            "nsid": 1
> > > >          }
> > > >        ],
> > > >
> > > > If NVMe/TCP target is not available, initiator tries to reconnect
> > > > every 10 seconds
> > > >
> > > > [ 2586.071048] nvme nvme9: failed to connect socket: -111
> > > > [ 2586.071403] nvme nvme9: Failed reconnect attempt 16/-1
> > > > [ 2586.071565] nvme nvme9: Reconnecting in 10 seconds...
> > > > [ 2596.310921] nvme nvme9: failed to connect socket: -111
> > > > [ 2596.311186] nvme nvme9: Failed reconnect attempt 17/-1
> > > > [ 2596.311349] nvme nvme9: Reconnecting in 10 seconds...
> > > > [ 2606.550772] nvme nvme9: failed to connect socket: -111
> > > > [ 2606.551252] nvme nvme9: Failed reconnect attempt 18/-1
> > > > [ 2606.551592] nvme nvme9: Reconnecting in 10 seconds...
> > > >
> > > > when NVMe/TCP target become available, initiator failed reconnect the namespace
> > > >
> > > > [ 2606.551592] nvme nvme9: Reconnecting in 10 seconds...
> > > > [ 2616.793080] nvme nvme9: creating 16 I/O queues.
> > > > [ 2616.829881] nvme nvme9: mapped 16/0/0 default/read/poll queues.
> > > > [ 2616.833685] nvme nvme9: Successfully reconnected (attempt 19/-1)
> > > > [ 2616.834446] nvme nvme9: identifiers changed for nsid 1
> > > > [ 2616.835618] block nvme9n1: no usable path - requeuing I/O
> > > > [ 2616.856602] block nvme9n1: no available path - failing I/O
> > > > [ 2616.856811] block nvme9n1: no available path - failing I/O
> > > >
> > > > and there is no nvme9n1 namespace in the "nvme list" output.
> > > >
> > > This looks like the missed re-scan issue I found recently.
> > > Should be fixed with
> > > 9546ad1a9bda ("nvme: requeue namespace scan on missed AENs")
> > >
> > > (And you are running RHEL. Please open a bugzilla with RH.)
> > > (And why am I even answering that?)
> > >
> > > Cheers,
> > >
> > > Hannes
> > > --
> > > Dr. Hannes Reinecke                  Kernel Storage Architect
> > > hare at suse.de                                +49 911 74053 688
> > > SUSE Software Solutions GmbH, Frankenstr. 146, 90461 Nürnberg
> > > HRB 36809 (AG Nürnberg), GF: I. Totev, A. McDonald, W. Knoblich
> > >
> >
> >
> > --
> > Best Regards,
> >   Yi Zhang
>
>
>
> --
> Best Regards,
>   Yi Zhang
>



More information about the Linux-nvme mailing list