[PATCH] nvme-fabrics: handle zero MAXCMD without closing the connection

Laurence Oberman loberman at redhat.com
Fri Nov 29 10:39:15 PST 2024


On Fri, 2024-11-29 at 15:17 +0100, Maurizio Lombardi wrote:
> The NVMe specification states that MAXCMD is mandatory
> for NVMe-over-Fabrics implementations. However, some NVMe/TCP
> and NVMe/FC arrays from major vendors have buggy firmware
> that reports MAXCMD as zero in the Identify Controller data
> structure.
> 
> Currently, the implementation closes the connection in such cases,
> completely preventing the host from connecting to the target.
> 
> Fix the issue by printing a clear error message about the firmware
> bug
> and allowing the connection to proceed. It assumes that the
> target supports a MAXCMD value of SQSIZE + 1. If any issues arise,
> the user can manually adjust SQSIZE to mitigate them.
> 
> Fixes: 4999568184e5 ("nvme-fabrics: check max outstanding commands")
> Signed-off-by: Maurizio Lombardi <mlombard at redhat.com>
> ---
>  drivers/nvme/host/core.c | 5 +++--
>  1 file changed, 3 insertions(+), 2 deletions(-)
> 
> diff --git a/drivers/nvme/host/core.c b/drivers/nvme/host/core.c
> index 1a8d32a4a5c3..91ee2c3aa95e 100644
> --- a/drivers/nvme/host/core.c
> +++ b/drivers/nvme/host/core.c
> @@ -3257,8 +3257,9 @@ static int nvme_check_ctrl_fabric_info(struct
> nvme_ctrl *ctrl, struct nvme_id_ct
>         }
>  
>         if (!ctrl->maxcmd) {
> -               dev_err(ctrl->device, "Maximum outstanding commands
> is 0\n");
> -               return -EINVAL;
> +               dev_err(ctrl->device,
> +                       "Firmware bug: maximum outstanding commands
> is 0\n");
> +               ctrl->maxcmd = ctrl->sqsize + 1;
>         }
>  
>         return 0;

A similar fix was tested by a customer seeing this issue and they can
now connect to their array again.
Looks good.

Reviewed-by: Laurence Oberman <loberman at redhat.com>




More information about the Linux-nvme mailing list