nvmf/rdma host crash during heavy load and keep alive recovery

Sagi Grimberg sagi at grimberg.me
Sun Sep 18 10:02:16 PDT 2016


>> This stack is creating hctx queues for the namespace created for this target
>> device.
>>
>> Sagi,
>>
>> Should nvme_rdma_error_recovery_work() be stopping the hctx queues for
>> ctrl->ctrl.connect_q too?
>
> Oh.  Actually we'll probably need to take care of the connect_q just
> about anywhere we do anything to the other queues..

Why should we?

We control the IOs on the connect_q (we only submit connect to it) and
we only submit to it if our queue is established.

I still don't see how this explains why Steves is seeing bogus
queue/hctx mappings...



More information about the Linux-nvme mailing list