[PATCHv3 0/7] nvme: export additional diagnostic counters via sysfs
Chaitanya Kulkarni
chaitanyak at nvidia.com
Wed Feb 25 21:37:47 PST 2026
On 2/20/26 09:48, Nilay Shroff wrote:
> Hi,
>
> The NVMe driver encounters various events and conditions during normal
> operation that are either not tracked today or not exposed to userspace
> via sysfs. Lack of visibility into these events can make it difficult to
> diagnose subtle issues related to controller behavior, multipath
> stability, and I/O reliability.
>
> This patchset adds several diagnostic counters that provide improved
> observability into NVMe behavior. These counters are intended to help
> users understand events such as transient path unavailability,
> controller retries/reconnect/reset, failovers, and I/O failures. They
> can also be consumed by monitoring tools such as nvme-top.
>
> Specifically, this series proposes to export the following counters via
> sysfs:
> - Command retry count
> - Multipath failover count
> - Command error count
> - I/O requeue count
> - I/O failure count
> - Controller reset event counts
> - Controller reconnect counts
>
> The patchset consists of seven patches:
> Patch 1: Export command retry count
> Patch 2: Export multipath failover count
> Patch 3: Export command error count
> Patch 4: Export I/O requeue count
> Patch 5: Export I/O failure count
> Patch 6: Export controller reset event counts
> Patch 7: Export controller reconnect event count
Cover letter automatically logs number of patches and respective author,
you can avoid adding this from next time.
For the whole series, looks good.
Reviewed-by: Chaitanya Kulkarni <kch at nvidia.com>
-ck
More information about the Linux-nvme
mailing list