[PATCH v3 9/9] nvme-rdma: fix reset hang if controller died in the middle of a reset
Christoph Hellwig
hch at lst.de
Thu Aug 20 02:10:56 EDT 2020
On Wed, Aug 19, 2020 at 10:36:51PM -0700, Sagi Grimberg wrote:
> If the controller becomes unresponsive in the middle of a reset, we
> will hang because we are waiting for the freeze to complete, but that
> cannot happen since we have commands that are inflight holding the
> q_usage_counter, and we can't blindly fail requests that times out.
>
> So give a timeout and if we cannot wait for queue freeze before
> unfreezing, fail and have the error handling take care how to
> proceed (either schedule a reconnect of remove the controller).
>
> Signed-off-by: Sagi Grimberg <sagi at grimberg.me>
Looks good,
Reviewed-by: Christoph Hellwig <hch at lst.de>
More information about the Linux-nvme
mailing list