[PATCH 5/7] nvme: Fix controller creation races with teardown flow
James Smart
james.smart at broadcom.com
Wed Aug 19 19:16:26 EDT 2020
On 3/24/2020 8:29 AM, Israel Rukshin wrote:
> Calling nvme_sysfs_delete() when the controller is in the middle of
> creation may cause several bugs. If the controller is in NEW state we
> remove delete_controller file and don't delete the controller. The user
> will not be able to use nvme disconnect command on that controller again,
> although the controller may be active. Other bugs may happen if the
> controller is in the middle of create_ctrl callback and
> nvme_do_delete_ctrl() starts. For example, freeing I/O tagset at
> nvme_do_delete_ctrl() before it was allocated at create_ctrl callback.
>
> To fix all those races don't allow the user to delete the controller
> before it was fully created.
>
> Signed-off-by: Israel Rukshin <israelr at mellanox.com>
> Reviewed-by: Max Gurtovoy <maxg at mellanox.com>
> Reviewed-by: Christoph Hellwig <hch at lst.de>
> ---
> drivers/nvme/host/core.c | 5 +++++
> drivers/nvme/host/nvme.h | 1 +
> 2 files changed, 6 insertions(+)
>
> diff --git a/drivers/nvme/host/core.c b/drivers/nvme/host/core.c
> index ba064fd..9961d0e 100644
> --- a/drivers/nvme/host/core.c
> +++ b/drivers/nvme/host/core.c
> @@ -3228,6 +3228,10 @@ static ssize_t nvme_sysfs_delete(struct device *dev,
> {
> struct nvme_ctrl *ctrl = dev_get_drvdata(dev);
>
> + /* Can't delete non-created controllers */
> + if (!ctrl->created)
> + return -EBUSY;
> +
> if (device_remove_file_self(dev, attr))
> nvme_delete_ctrl_sync(ctrl);
> return count;
> @@ -4039,6 +4043,7 @@ void nvme_start_ctrl(struct nvme_ctrl *ctrl)
> nvme_queue_scan(ctrl);
> nvme_start_queues(ctrl);
> }
> + ctrl->created = true;
> }
> EXPORT_SYMBOL_GPL(nvme_start_ctrl);
>
FYI - I've hit a scenario with this patch, where if the device starts
rejecting the initial connections or they continuously hit a failure -
we're forced to wait ctrl_loss_tmo before it goes away. We can't
forcibly delete the controller via sysfs. This shouldn't be possible.
I understand the race conditions with delete and am looking at the same
thing on FC. Looking at what was trying to be achieved, it seems to
overlap with some of the teardown that Sagi is synchronizing with the
teardown_lock.
We may want to revisit this change.
-- james
More information about the Linux-nvme
mailing list