[PATCHv10 0/9] write hints with nvme fdp, scsi streams
Christoph Hellwig
hch at lst.de
Tue Nov 5 07:50:14 PST 2024
I've pushed my branch that tries to make this work with the XFS
data separation here:
http://git.infradead.org/?p=users/hch/xfs.git;a=shortlog;h=refs/heads/xfs-zoned-streams
This is basically my current WIP xfs zoned (aka always write out place)
work optimistically destined for 6.14 + the patch set in this thread +
a little fix to make it work for nvme-multipath plus the tiny patch to
wire it up.
The good news is that the API from Keith mostly works. I don't really
know how to cope with the streams per partition bitmap, and I suspect
this will need to be dealt with a bit better. One option might be
to always have a bitmap, which would also support discontiguous
write stream numbers as actually supported by the underlying NVMe
implementation, another option would be to always map to consecutive
numbers.
The bad news is that for file systems or applications to make full use
of the API we also really need an API to expose how much space is left
in a write stream, as otherwise they can easily get out of sync on
a power fail. I've left that code in as a TODO, it should not affect
basic testing.
We get the same kind of performance numbers as the ZNS support on
comparable hardware platforms, which is expected. Testing on an
actual state of the art non-prototype hardware will take more time
as the capacities are big enough that getting serious numbers will
take a lot more time.
More information about the Linux-nvme
mailing list