[PATCH 3/3] nvme: add test for controller rescan under I/O load

Nilay Shroff nilay at linux.ibm.com
Fri Aug 23 03:18:32 PDT 2024



On 8/23/24 01:08, Martin Wilck wrote:
> Add a test that repeatedly rescans nvme controllers while doing IO
> on an nvme namespace connected to these controllers. The purpose
> of the test is to make sure that no I/O errors or data corruption
> occurs because of the rescan operations.
> 
> Signed-off-by: Martin Wilck <mwilck at suse.com>
> ---
>  tests/nvme/053 | 56 ++++++++++++++++++++++++++++++++++++++++++++++++++
>  tests/nvme/rc  | 18 ++++++++++++++++
>  2 files changed, 74 insertions(+)
>  create mode 100755 tests/nvme/053
> 
> diff --git a/tests/nvme/053 b/tests/nvme/053
> new file mode 100755
> index 0000000..41dc8f2
> --- /dev/null
> +++ b/tests/nvme/053
> @@ -0,0 +1,56 @@
> +#!/bin/bash
> +# SPDX-License-Identifier: GPL-3.0+
> +# Copyright (C) 2024 Martin Wilck, SUSE LLC
> +
> +. tests/nvme/rc
> +
> +DESCRIPTION="test controller rescan under I/O load"
> +TIMED=1
> +: "${TIMEOUT:=60}"
> +
> +rescan_controller() {
> +	local finish
> +
> +	[[ -f "$1/rescan_controller" ]] || {
> +		echo "cannot rescan $1"
> +		return 1
> +	}
> +
> +	finish=$(($(date +%s) + TIMEOUT))
> +	while [[ $(date +%s) -le $finish ]]; do
> +		# sleep interval between 0.1 and 5s
> +		usleep "$(((RANDOM%50 + 1)*100000))"
> +		echo 1 >"$1/rescan_controller"
> +	done
> +}
I think here usleep may not be available by default on all systems.
For instance, on fedora/rhel I don't have usleep installed in the 
defualt configuration and so I have to first install it. So you may
want to add "usleep" as per-requisite for this test. Moreover, after 
I installed usleep on fedora and ran the above test I see this warning:

warning: usleep is deprecated, and will be removed in near future!

Due to above warning the test fails. So is it possible to replace 
usleep with sleep?
 
> +
> +test_device() {
> +	local -a ctrls
> +	local c
> +
> +	echo "Running ${TEST_NAME}"
> +	ctrls=($(_nvme_get_ctrl_list))
> +
> +	_run_fio_verify_io --filename="$TEST_DEV" --time_based &> "$FULL" &
> +
> +	for c in "${ctrls[@]}"; do
> +		rescan_controller "$c" &
> +	done
> +
> +	while true; do
> +		wait -n &>/dev/null
> +		st=$?
> +		case $st in
> +			127)
> +				break
> +				;;
> +			0)
> +				;;
> +			*)
> +				echo "child process exited with $st!"
> +				;;
> +		esac
> +	done
> +
> +	echo "Test complete"
> +}
> diff --git a/tests/nvme/rc b/tests/nvme/rc
> index e7d2ab1..93b0571 100644
> --- a/tests/nvme/rc
> +++ b/tests/nvme/rc
> @@ -192,6 +192,24 @@ _test_dev_nvme_nsid() {
>  	cat "${TEST_DEV_SYSFS}/nsid"
>  }
>  
> +_nvme_get_ctrl_list() {
> +	local subsys
> +	local c
> +
> +	subsys=$(readlink  "${TEST_DEV_SYSFS}/device/subsystem")
> +	case $subsys in
> +		*/nvme)
> +			readlink -f "${TEST_DEV_SYSFS}/device"
> +			;;
> +		*/nvme-subsystem)
> +			for c in "${TEST_DEV_SYSFS}"/device/nvme*; do
> +				[[ -L "$c" ]] || continue
> +				[[ -f "$c/dev" ]] && readlink -f "$c"
> +			done
> +			;;
> +	esac
> +}
> +
I don't know if I am missing anything here but just curious to know 
for which case $subsys would point to link ending in */nvme?
I think that for all cases $subsys shall point to link which ends 
in */nvme-subsystem, isn't it? I assume here that $TEST_DEV_SYSFS would 
always resolve to a nvme block device.

And the last point: I don't see 053.out file in your patchset. Did you forget
to add this file?

Thanks,
--Nilay







More information about the Linux-nvme mailing list