[PATCH] nvme-rdma: fix crash for no IO queues

Chao Leng lengchao at huawei.com
Wed Feb 24 00:52:20 EST 2021



On 2021/2/24 6:03, Chaitanya Kulkarni wrote:
> On 2/22/21 23:30, Chao Leng wrote:
>> A crash happens when set feature(NVME_FEAT_NUM_QUEUES) timeout in nvme
>> over rdma(roce) reconnection, the reason is use the queue which is not
>> alloced.
>>
>> If it is not discovery and no io queues, the connection should fail.
>>
>> Signed-off-by: Chao Leng <lengchao at huawei.com>
> 
> Can you please share more information about
> 
> "when set feature(NVME_FEAT_NUM_QUEUES) timeout" scenario ?
Inject a large number of bit errors intermittently. This will cause
request time out, and then reconnection will trigger by error recovery.
The requests of reconnect may also time out. If set feature
(NVME_FEAT_NUM_QUEUES, which is called by nvme_set_queue_count) time out
and the reconnection may success. Block will continue send request,
and then crash due to use unallocated queue.




More information about the Linux-nvme mailing list