Do you have any performance numbers for this change? Also we shouldn't need the hack for the flush special case in nvme_rdma_post_send once we stop embedding the MR in struct request.