cqe dump errors on target while running nvme-of large block read IO

Majd Dibbiny majd at mellanox.com
Wed Apr 26 21:29:27 PDT 2017


> On Apr 27, 2017, at 2:57 AM, Sagi Grimberg <sagi at grimberg.me> wrote:
> 
> 
>> Folks, I have to apologize here.  Flow control was at one point configured correctly, but we had a power outage in our lab two weeks ago.
>> We thought the switch had come back up OK, but late week we were double-checking all the ports in use and we found one port had lost some
>> settings and no longer had flow control enabled (it had also reverted from 100Gb to 50Gb for some reason).  After fixing the port settings
>> we ran IO through the weekend and the early part of this week on a variety of workloads.  We don't seem to be able to reproduce the failure
>> after fixing the port settings.  It looks like this one may have been caused by the lost flow control setting on the switch.  Sorry for the confusion!
> 
> Yep, don't expect RoCE to work without flow-control.
Sagi and all,

Starting in next release, for ConnectX-4 and above PFC isn't required as long as ECN is enabled.

With current release we also expect to get good results with ECN and no PFC.

Joseph - is ECN enabled on your system?

Thanks
> --
> To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
> the body of a message to majordomo at vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html



More information about the Linux-nvme mailing list