[PATCH V7 0/1] nvme: add verbose error logging

Chaitanya Kulkarni chaitanyak at nvidia.com
Thu Feb 3 00:11:52 PST 2022


From: Chaitanya Kulkarni <kch at nvidia.com>

Hi,

I spent sometime on reviwing this patch, thought it will be
helpeful if I fix things while reviwing it along with
Christoph's comments.

Follwing is the original cover-letter:-

This patch improves logging for NVMe errors. Currently, we only get
a vague idea as to why commands fail since only the block layer status
is captured on error. This patch allows us to see why a command was
failed by the controller. This is very useful when debugging problems
in the field.

An example of an improved logged error:

[  183.333734] nvme0n1: Read(0x2) @ LBA 0, 1 blocks, Unrecovered Read Error (sct 0x2 / sc 0x81) DNR

[  227.767945] nvme0: Activate Firmware(0x10), Invalid Field in Command (sct 0x0 / sc 0x2) DNR

-ck

V7:
- Remove local variables from helpers nvme_get_error_status_str()
  nvme_get_opcode_str(), nvme_get_admin_opcode_str().
- Fix commit message of patch and remove the example since 
  cover-letter already has it.
- Un-export helper functions.
- Remove forward declaration of nvme_log_error().
- Move nvme_log_error() before its caller.
- Remove local variables initialization admin_op_str and op_str
  as they are getting overwritten by respective functions in
  nvme_log_error().
- Rename the error.c file to constants.c
- Fix the return type of the stubs
  nvme_get_error_status_str(u16 status),
  nvme_get_opcode_str(u8 opcode), and
  nvme_get_admin_opcode_str(u8 opcode) to const unsigned char *
  to match the what have in the constants.c.
- Add tabs before the comments for pr_err_ratelimited() functions
  arguments so that it will be consistent for nvme and admin commands.

V6:
- Create helpers that are stubbed out if
  CONFIG_NVME_VERBOSE_ERRORS is not set.
- Couple more nits.

V5:
- Change nvme_ops[] and nvme_admin_ops[] to a sparse array where the
  opcode is used as an index into the array.
- Various other nits.

V4:
- Adding logging for admin commands.
- Display NVMe Command Opcode value rather than generic block opcode.
- Various other nits.

V3:
- Don't populate nvme_errors[] array with NULL strings.

V2:
- Change nvme_errors[] to a sparse array where the status is used as
  an index into the array.

- Change pr_err() to pr_err_ratelimited().

- By enabling CONFIG_NVME_VERBOSE_ERRORS, the verbose status error is
  displayed.  If it is not enabled, then a message will still be
  displayed, but without the verbose status.

- Remove call to nvme_error_status() when determining whether to call
  nvme_error_log(). Speeds up the fast path just a bit.

Chaitanya Kulkarni (1):
  nvme-core: Add verbose error logging

 drivers/nvme/host/Kconfig     |   8 ++
 drivers/nvme/host/Makefile    |   2 +-
 drivers/nvme/host/constants.c | 185 ++++++++++++++++++++++++++++++++++
 drivers/nvme/host/core.c      |  34 +++++++
 drivers/nvme/host/nvme.h      |  19 ++++
 include/linux/nvme.h          |   1 +
 6 files changed, 248 insertions(+), 1 deletion(-)
 create mode 100644 drivers/nvme/host/constants.c

-- 
2.29.0




More information about the Linux-nvme mailing list