[PATCH AUTOSEL 6.18-6.6] nvmet-tcp: fixup hang in nvmet_tcp_listen_data_ready()

Tue Jan 20 11:34:50 PST 2026

From: Hannes Reinecke <hare at kernel.org>

[ Upstream commit 2fa8961d3a6a1c2395d8d560ffed2c782681bade ]

When the socket is closed while in TCP_LISTEN a callback is run to
flush all outstanding packets, which in turns calls
nvmet_tcp_listen_data_ready() with the sk_callback_lock held.
So we need to check if we are in TCP_LISTEN before attempting
to get the sk_callback_lock() to avoid a deadlock.

Link: https://lore.kernel.org/linux-nvme/CAHj4cs-zu7eVB78yUpFjVe2UqMWFkLk8p+DaS3qj+uiGCXBAoA@mail.gmail.com/
Tested-by:  Yi Zhang <yi.zhang at redhat.com>
Reviewed-by: Sagi Grimberg <sagi at grimberg.me>
Signed-off-by: Hannes Reinecke <hare at kernel.org>
Signed-off-by: Keith Busch <kbusch at kernel.org>
Signed-off-by: Sasha Levin <sashal at kernel.org>
---

LLM Generated explanations, may be completely bogus:

Let me understand the context of the commit better - it might be that
the commit requires a minor adjustment for backport since it uses
`nvmet_wq` but in the current tree (and stable trees), the workqueue is
called `nvmet_tcp_wq`. Looking at the diff more carefully:

The diff shows:
```c
+       if (port)
                queue_work(nvmet_wq, &port->accept_work);
```

Wait, the space before `queue_work` in the diff context shows this is
the EXISTING code (no + sign). But wait, looking at the current code:
```c
schedule_work(&port->accept_work);
```

So there's a discrepancy. The commit I'm analyzing might be from a tree
where `schedule_work` was changed to `queue_work(nvmet_wq, ...)` in
another commit.

For the purpose of backporting analysis, the KEY FIX is:
1. **Check `sk->sk_state != TCP_LISTEN` BEFORE acquiring the lock** -
   this is the essential fix for the deadlock
2. The change from `schedule_work` to `queue_work(nvmet_wq, ...)` is
   secondary

For stable backports, minor adjustments may be needed (using
`schedule_work` instead of `queue_work(nvmet_wq, ...)`), but the core
fix (early state check before lock acquisition) is applicable.

## SUMMARY

**What the commit fixes**: A deadlock that occurs when
`nvmet_tcp_listen_data_ready()` is called during socket cleanup with
`sk_callback_lock` already held. The fix checks the socket state before
acquiring the lock.

**Why it matters for stable users**: This is a deadlock/hang that can
occur during NVMe-over-TCP disconnect scenarios. Systems using NVMe-
over-TCP will hang, requiring a hard reset.

**Meets stable kernel rules**:
- Obviously correct: Yes - simple early return check before lock
  acquisition
- Fixes a real bug: Yes - deadlock reported and reproduced via blktests
- Small scope: Yes - 10 lines changed in one function
- No new features: Correct - pure bug fix
- Tested: Yes - "Tested-by" tag from Red Hat QE

**Risk assessment**: Very low. The fix simplifies the logic and cannot
cause regression.

**Dependencies/Backport notes**: The commit uses `queue_work(nvmet_wq,
...)` but older stable trees use `schedule_work(&port->accept_work)`. A
minor adjustment may be needed for clean backport, but the essential fix
(early state check) applies cleanly.

**YES** - This is a clear deadlock fix that should be backported to
stable kernels. The commit:
1. Fixes a real, reproducible deadlock
2. Is minimal and obviously correct
3. Has been tested and reviewed
4. Affects users of NVMe-over-TCP (a production storage protocol)
5. Has very low regression risk

**YES**

 drivers/nvme/target/tcp.c | 9 ++++-----
 1 file changed, 4 insertions(+), 5 deletions(-)

diff --git a/drivers/nvme/target/tcp.c b/drivers/nvme/target/tcp.c
index 470bf37e5a637..2e9a3e698b700 100644
--- a/drivers/nvme/target/tcp.c
+++ b/drivers/nvme/target/tcp.c
@@ -1992,14 +1992,13 @@ static void nvmet_tcp_listen_data_ready(struct sock *sk)
 
 	trace_sk_data_ready(sk);
 
+	if (sk->sk_state != TCP_LISTEN)
+		return;
+
 	read_lock_bh(&sk->sk_callback_lock);
 	port = sk->sk_user_data;
-	if (!port)
-		goto out;
-
-	if (sk->sk_state == TCP_LISTEN)
+	if (port)
 		queue_work(nvmet_wq, &port->accept_work);
-out:
 	read_unlock_bh(&sk->sk_callback_lock);
 }
 
-- 
2.51.0