Sequential read from NVMe/XFS twice slower on Fedora 42 than on Rocky 9.5
Dave Chinner
david at fromorbit.com
Sat May 3 15:16:18 PDT 2025
On Sun, May 04, 2025 at 12:04:16AM +0300, Anton Gavriliuk wrote:
> There are 12 Kioxia CM-7 NVMe SSDs configured in mdadm/raid0 and
> mounted to /mnt.
>
> Exactly the same fio command running under Fedora 42
> (6.14.5-300.fc42.x86_64) and then under Rocky 9.5
> (5.14.0-503.40.1.el9_5.x86_64) shows twice the performance difference.
>
> /mnt/testfile size 1TB
> server's total dram 192GB
>
> Fedora 42
>
> [root at localhost ~]# fio --name=test --rw=read --bs=256k
> --filename=/mnt/testfile --direct=1 --numjobs=1 --iodepth=64 --exitall
> --group_reporting --ioengine=libaio --runtime=30 --time_based
> test: (g=0): rw=read, bs=(R) 256KiB-256KiB, (W) 256KiB-256KiB, (T)
> 256KiB-256KiB, ioengine=libaio, iodepth=64
> fio-3.39-44-g19d9
> Starting 1 process
> Jobs: 1 (f=1): [R(1)][100.0%][r=49.6GiB/s][r=203k IOPS][eta 00m:00s]
> test: (groupid=0, jobs=1): err= 0: pid=2465: Sat May 3 17:51:24 2025
> read: IOPS=203k, BW=49.6GiB/s (53.2GB/s)(1487GiB/30001msec)
> slat (usec): min=3, max=1053, avg= 4.60, stdev= 1.76
> clat (usec): min=104, max=4776, avg=310.53, stdev=29.49
> lat (usec): min=110, max=4850, avg=315.13, stdev=29.82
> Rocky 9.5
>
> [root at localhost ~]# fio --name=test --rw=read --bs=256k
> --filename=/mnt/testfile --direct=1 --numjobs=1 --iodepth=64 --exitall
> --group_reporting --ioengine=libaio --runtime=30 --time_based
> test: (g=0): rw=read, bs=(R) 256KiB-256KiB, (W) 256KiB-256KiB, (T)
> 256KiB-256KiB, ioengine=libaio, iodepth=64
> fio-3.39-44-g19d9
> Starting 1 process
> Jobs: 1 (f=1): [R(1)][100.0%][r=96.0GiB/s][r=393k IOPS][eta 00m:00s]
> test: (groupid=0, jobs=1): err= 0: pid=15467: Sun May 4 00:00:39 2025
> read: IOPS=390k, BW=95.3GiB/s (102GB/s)(2860GiB/30001msec)
> slat (nsec): min=1111, max=183816, avg=2117.94, stdev=1412.34
> clat (usec): min=81, max=1086, avg=161.60, stdev=19.67
> lat (usec): min=82, max=1240, avg=163.72, stdev=19.73
>
Completely latency has doubled on the fc42 kernel. For a read, there
isn't much in terms of filesystem work to be done on direct IO
completion, so I'm not sure this is a filesystem issue...
What's the comparitive performance of an identical read profile
directly on the raw MD raid0 device?
-Dave.
--
Dave Chinner
david at fromorbit.com
More information about the Linux-nvme
mailing list