[PATCH v2 0/3] nvmet-rdma: SRQ per completion vector
Max Gurtovoy
maxg at mellanox.com
Fri Nov 17 11:32:42 PST 2017
On 11/16/2017 8:36 PM, Sagi Grimberg wrote:
>
>> Since there is an active discussion regarding the CQ pool
>> architecture, I decided to push
>> this feature (maybe it can be pushed before CQ pool).
>>
>> This is a new feature for NVMEoF RDMA target,
>
> Any chance having this for the rest? isert, srpt, svcrdma?
>
We can implement it for isert, but I think it's better to see how the CQ
pool will be defined first.
It can bring a big benefit and improvement for ib_srpt (similar to
NVMEoF target) but I'm not sure if I can commit for that one soon..
>> results:
>>
>> BS 1.read (target CPU) 2.read (target CPU) 3.read (target
>> CPU) 4.read (target CPU)
>> --- --------------------- --------------------- ---------------------
>> ----------------------
>> 1k 5.88M (80%) 5.45M (72%) 6.77M
>> (91%) 2.2M (72%)
>>
>> 2k 3.56M (65%) 3.45M (59%) 3.72M
>> (64%) 2.12M (59%)
>>
>> 4k 1.8M (33%) 1.87M (32%) 1.88M
>> (32%) 1.59M (34%)
>
> I'm blown by the fact that there is such a small difference for 4k
> reads, how many cpu-cores did you have on the target-system? single
> numa-node? Maybe if that is the case we can use less srqs than
> per-completion-vector...
These results were taken without the performance improvments we sent few
weeks ago and on different and weaker servers.
Are you suggesting another module param for srq_count per device ? I
tried to avoid that.
More information about the Linux-nvme
mailing list