[PATCH] nvme: set physical block size to value discovered in Identify Namespace
Keith Busch
keith.busch at intel.com
Wed Sep 20 12:07:52 PDT 2017
On Wed, Sep 20, 2017 at 10:48:14AM -0700, Christoph Hellwig wrote:
> On Wed, Sep 20, 2017 at 01:48:42PM -0400, Keith Busch wrote:
> > I think what you're wanting to say is:
> >
> > The physical block size will default to the logical block size unless
> > otherwise specified. While NVMe doesn't provide a way to discover the
> > physical block size, the format with best relative performance is a
> > good indicator as to the underlying block size.
>
> I don't like this at all. It's a really nasty guesswork. If you need
> this to get reasonable performance out of a specific device please
> quirk it.
I don't think it's about "reasonable" performance; it's about getting
extra relative performance. What else can the best performing LBAF
indicate other than the device's preferred access alignment/granularity?
The spec provides this hint, so it's not really a guess, but maybe
there's a better way to make use of it instead of considering it to be
the physical block size? io_opt?
On a slightly related topic, I think we should fix the consistency
in what's reported in the queue's attributes after reformatting the
namespace. Check out the following for what happens today:
Start with a 512b format:
# cat /sys/block/nvme0n1/queue/{minimum_io_size,logical_block_size,physical_block_size}
512
512
512
Format it to 4k:
# cat /sys/block/nvme0n1/queue/{minimum_io_size,logical_block_size,physical_block_size}
4096
4096
4096
Format it back to 512b:
# cat /sys/block/nvme0n1/queue/{minimum_io_size,logical_block_size,physical_block_size}
4096
512
4096
The first and last are the exact same NVMe format, but they're reported
differently.
More information about the Linux-nvme
mailing list