[axboe-block:for-next] [block] 1122c0c1cc: aim7.jobs-per-min 22.6% improvement

kernel test robot oliver.sang at intel.com
Mon Jun 24 19:28:54 PDT 2024



Hello,

kernel test robot noticed a 22.6% improvement of aim7.jobs-per-min on:


commit: 1122c0c1cc71f740fa4d5f14f239194e06a1d5e7 ("block: move cache control settings out of queue->flags")
https://git.kernel.org/cgit/linux/kernel/git/axboe/linux-block.git for-next

testcase: aim7
test machine: 96 threads 2 sockets Intel(R) Xeon(R) Platinum 8260L CPU @ 2.40GHz (Cascade Lake) with 128G memory
parameters:

	disk: 4BRD_12G
	md: RAID0
	fs: xfs
	test: sync_disk_rw
	load: 300
	cpufreq_governor: performance






Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20240625/202406250948.e0044f1d-oliver.sang@intel.com

=========================================================================================
compiler/cpufreq_governor/disk/fs/kconfig/load/md/rootfs/tbox_group/test/testcase:
  gcc-13/performance/4BRD_12G/xfs/x86_64-rhel-8.3/300/RAID0/debian-12-x86_64-20240206.cgz/lkp-csl-2sp3/sync_disk_rw/aim7

commit: 
  70905f8706 ("block: remove blk_flush_policy")
  1122c0c1cc ("block: move cache control settings out of queue->flags")

70905f8706b62113 1122c0c1cc71f740fa4d5f14f23 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
    153.19           -13.3%     132.81        uptime.boot
   2.8e+09           -11.9%  2.466e+09        cpuidle..time
  21945319 ±  2%     -40.4%   13076160        cpuidle..usage
     29.31            +7.8%      31.58 ±  2%  iostat.cpu.idle
     69.87            -3.6%      67.35        iostat.cpu.system
      0.04 ±  4%      +0.0        0.08 ±  5%  mpstat.cpu.all.iowait%
      0.78 ±  2%      +0.2        0.99 ±  2%  mpstat.cpu.all.usr%
     52860 ± 49%     -78.2%      11536 ± 78%  numa-numastat.node0.other_node
     46804 ± 56%     +88.4%      88190 ± 10%  numa-numastat.node1.other_node
    955871 ± 10%     -43.3%     542216 ± 14%  numa-meminfo.node1.Active
    955871 ± 10%     -43.3%     542216 ± 14%  numa-meminfo.node1.Active(anon)
   1015354 ± 10%     -34.7%     662696 ± 13%  numa-meminfo.node1.Shmem
      6008           -14.3%       5146 ±  2%  perf-c2c.DRAM.remote
      7889           -12.4%       6908 ±  2%  perf-c2c.HITM.local
      3839           -16.5%       3203 ±  2%  perf-c2c.HITM.remote
     11728           -13.8%      10112 ±  2%  perf-c2c.HITM.total
    695109           +20.5%     837625        vmstat.io.bo
    105.99 ±  7%     -23.7%      80.83 ± 11%  vmstat.procs.r
    803244           -30.9%     555360        vmstat.system.cs
    209736           -12.9%     182626        vmstat.system.in
      1448 ± 89%    +207.9%       4459 ±  6%  numa-vmstat.node0.nr_page_table_pages
     52860 ± 49%     -78.2%      11536 ± 78%  numa-vmstat.node0.numa_other
    239214 ± 10%     -43.6%     134883 ± 13%  numa-vmstat.node1.nr_active_anon
    254124 ± 10%     -34.9%     165421 ± 13%  numa-vmstat.node1.nr_shmem
    239214 ± 10%     -43.6%     134883 ± 13%  numa-vmstat.node1.nr_zone_active_anon
     46805 ± 56%     +88.4%      88190 ± 10%  numa-vmstat.node1.numa_other
     17374           +22.6%      21299        aim7.jobs-per-min
    103.64           -18.4%      84.58        aim7.time.elapsed_time
    103.64           -18.4%      84.58        aim7.time.elapsed_time.max
   4641240           -83.4%     770073        aim7.time.involuntary_context_switches
     32705            -4.3%      31289 ±  2%  aim7.time.minor_page_faults
      6562            -3.1%       6359        aim7.time.percent_of_cpu_this_job_got
      6775           -21.0%       5351 ±  2%  aim7.time.system_time
  49095202           -38.3%   30299361        aim7.time.voluntary_context_switches
   1297567           -37.0%     817692        meminfo.Active
   1297567           -37.0%     817692        meminfo.Active(anon)
     97760 ±  5%     -23.4%      74859 ± 20%  meminfo.AnonHugePages
   2390317           -15.3%    2024905        meminfo.Committed_AS
    884407           +11.9%     989723        meminfo.Inactive
    743152 ±  2%     +14.8%     853331        meminfo.Inactive(anon)
    159265 ±  8%     +38.6%     220668 ±  3%  meminfo.Mapped
   1382079           -27.1%    1007445        meminfo.Shmem
    324534           -37.2%     203663 ±  2%  proc-vmstat.nr_active_anon
   1165686            -8.2%    1070277        proc-vmstat.nr_file_pages
    185928 ±  2%     +14.9%     213697        proc-vmstat.nr_inactive_anon
     35436            -2.9%      34420        proc-vmstat.nr_inactive_file
     40463 ±  8%     +38.2%      55918 ±  3%  proc-vmstat.nr_mapped
    345824           -27.3%     251424        proc-vmstat.nr_shmem
     28871            -1.4%      28477        proc-vmstat.nr_slab_reclaimable
    324534           -37.2%     203663 ±  2%  proc-vmstat.nr_zone_active_anon
    185928 ±  2%     +14.9%     213697        proc-vmstat.nr_zone_inactive_anon
     35436            -2.9%      34420        proc-vmstat.nr_zone_inactive_file
   5120744            -2.4%    4996195        proc-vmstat.numa_hit
   5020486            -2.5%    4896473        proc-vmstat.numa_local
    207026 ± 10%     +50.2%     310941        proc-vmstat.pgactivate
   5196440            -2.7%    5057618        proc-vmstat.pgalloc_normal
    763396 ±  6%     -11.8%     673464        proc-vmstat.pgfault
  74254490            -1.3%   73292473        proc-vmstat.pgpgout
     11.25 ± 24%     -60.0%       4.50 ± 29%  sched_debug.cfs_rq:/.h_nr_running.max
      1.59 ± 20%     -42.7%       0.91 ± 13%  sched_debug.cfs_rq:/.h_nr_running.stddev
    968.29 ±  5%     -13.2%     840.04 ±  5%  sched_debug.cfs_rq:/.runnable_avg.avg
      5533 ± 21%     -47.1%       2925 ± 21%  sched_debug.cfs_rq:/.runnable_avg.max
    798.88 ± 13%     -38.3%     492.63 ±  9%  sched_debug.cfs_rq:/.runnable_avg.stddev
    578.50 ±  5%      -9.9%     521.30 ±  4%  sched_debug.cfs_rq:/.util_avg.avg
      3120 ± 20%     -40.3%       1862 ± 19%  sched_debug.cfs_rq:/.util_avg.max
    479.36 ± 12%     -30.4%     333.40 ±  8%  sched_debug.cfs_rq:/.util_avg.stddev
      4592 ± 24%     -51.8%       2215 ± 31%  sched_debug.cfs_rq:/.util_est.max
    615.47 ± 21%     -35.7%     395.64 ± 15%  sched_debug.cfs_rq:/.util_est.stddev
     11.33 ± 24%     -58.8%       4.67 ± 26%  sched_debug.cpu.nr_running.max
      1.62 ± 20%     -42.6%       0.93 ± 11%  sched_debug.cpu.nr_running.stddev
    224323           -28.2%     161088        sched_debug.cpu.nr_switches.avg
    242363 ±  2%     -27.9%     174695 ±  2%  sched_debug.cpu.nr_switches.max
    197870 ±  2%     -27.6%     143186        sched_debug.cpu.nr_switches.min
      7911 ± 19%     -33.1%       5295 ± 10%  sched_debug.cpu.nr_switches.stddev
      1.23            -4.8%       1.17        perf-stat.i.MPKI
 1.105e+10            +5.6%  1.167e+10        perf-stat.i.branch-instructions
      1.20 ±  2%      +0.1        1.29 ±  2%  perf-stat.i.branch-miss-rate%
    820863           -30.7%     569230        perf-stat.i.context-switches
      3.79           -10.2%       3.41        perf-stat.i.cpi
 2.176e+11            -3.2%  2.106e+11        perf-stat.i.cpu-cycles
    212040           -27.8%     153137        perf-stat.i.cpu-migrations
 5.416e+10            +6.8%  5.785e+10        perf-stat.i.instructions
      0.32           +11.8%       0.36        perf-stat.i.ipc
      0.05 ± 77%    +233.9%       0.17 ± 50%  perf-stat.i.major-faults
     10.74           -30.2%       7.50        perf-stat.i.metric.K/sec
      1.28            -4.3%       1.22        perf-stat.overall.MPKI
      4.02            -9.4%       3.64        perf-stat.overall.cpi
      3145            -5.3%       2979        perf-stat.overall.cycles-between-cache-misses
      0.25           +10.3%       0.27        perf-stat.overall.ipc
 1.094e+10            +5.4%  1.153e+10        perf-stat.ps.branch-instructions
    812563           -30.8%     562343        perf-stat.ps.context-switches
 2.156e+11            -3.4%  2.082e+11        perf-stat.ps.cpu-cycles
    209965           -28.0%     151248        perf-stat.ps.cpu-migrations
 5.365e+10            +6.6%  5.717e+10        perf-stat.ps.instructions
 5.641e+12           -13.1%  4.905e+12 ±  2%  perf-stat.total.instructions
     14.88 ±  5%     -14.9        0.00        perf-profile.calltrace.cycles-pp.blkdev_issue_flush.xfs_file_fsync.xfs_file_buffered_write.vfs_write.ksys_write
     14.86 ±  5%     -14.9        0.00        perf-profile.calltrace.cycles-pp.submit_bio_wait.blkdev_issue_flush.xfs_file_fsync.xfs_file_buffered_write.vfs_write
     14.77 ±  5%     -14.8        0.00        perf-profile.calltrace.cycles-pp.__submit_bio_noacct.submit_bio_wait.blkdev_issue_flush.xfs_file_fsync.xfs_file_buffered_write
     14.76 ±  5%     -14.8        0.00        perf-profile.calltrace.cycles-pp.__submit_bio.__submit_bio_noacct.submit_bio_wait.blkdev_issue_flush.xfs_file_fsync
     14.74 ±  5%     -14.7        0.00        perf-profile.calltrace.cycles-pp.md_handle_request.__submit_bio.__submit_bio_noacct.submit_bio_wait.blkdev_issue_flush
     14.72 ±  5%     -14.7        0.00        perf-profile.calltrace.cycles-pp.raid0_make_request.md_handle_request.__submit_bio.__submit_bio_noacct.submit_bio_wait
     14.71 ±  5%     -14.7        0.00        perf-profile.calltrace.cycles-pp.md_flush_request.raid0_make_request.md_handle_request.__submit_bio.__submit_bio_noacct
     13.32 ±  5%     -13.3        0.00        perf-profile.calltrace.cycles-pp._raw_spin_lock_irq.md_flush_request.raid0_make_request.md_handle_request.__submit_bio
     13.25 ±  5%     -13.3        0.00        perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irq.md_flush_request.raid0_make_request.md_handle_request
      9.70 ±  3%      -1.1        8.61 ±  3%  perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.common_startup_64
      9.70 ±  3%      -1.1        8.61 ±  3%  perf-profile.calltrace.cycles-pp.start_secondary.common_startup_64
      9.70 ±  3%      -1.1        8.61 ±  3%  perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.common_startup_64
      9.80 ±  3%      -1.1        8.71 ±  3%  perf-profile.calltrace.cycles-pp.common_startup_64
      9.12 ±  3%      -1.0        8.15 ±  3%  perf-profile.calltrace.cycles-pp.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary.common_startup_64
      8.95 ±  3%      -0.9        8.01 ±  3%  perf-profile.calltrace.cycles-pp.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry
      8.95 ±  3%      -0.9        8.02 ±  3%  perf-profile.calltrace.cycles-pp.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary
      2.21            -0.4        1.78 ±  2%  perf-profile.calltrace.cycles-pp.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
      2.22            -0.4        1.79 ±  2%  perf-profile.calltrace.cycles-pp.kthread.ret_from_fork.ret_from_fork_asm
      2.22            -0.4        1.79 ±  2%  perf-profile.calltrace.cycles-pp.ret_from_fork.ret_from_fork_asm
      2.22            -0.4        1.79 ±  2%  perf-profile.calltrace.cycles-pp.ret_from_fork_asm
      2.08            -0.4        1.68 ±  2%  perf-profile.calltrace.cycles-pp.process_one_work.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
      3.09            -0.2        2.86 ±  2%  perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.remove_wait_queue.xlog_wait_on_iclog.xfs_log_force_seq
      3.10            -0.2        2.87 ±  2%  perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.remove_wait_queue.xlog_wait_on_iclog.xfs_log_force_seq.xfs_file_fsync
      3.10            -0.2        2.87 ±  2%  perf-profile.calltrace.cycles-pp.remove_wait_queue.xlog_wait_on_iclog.xfs_log_force_seq.xfs_file_fsync.xfs_file_buffered_write
      3.44            -0.2        3.23 ±  4%  perf-profile.calltrace.cycles-pp.xlog_wait_on_iclog.xfs_log_force_seq.xfs_file_fsync.xfs_file_buffered_write.vfs_write
      0.95            +0.1        1.04        perf-profile.calltrace.cycles-pp.mutex_spin_on_owner.__mutex_lock.__flush_workqueue.xlog_cil_push_now.xlog_cil_force_seq
      0.57            +0.1        0.71 ±  2%  perf-profile.calltrace.cycles-pp.iomap_file_buffered_write.xfs_file_buffered_write.vfs_write.ksys_write.do_syscall_64
      0.58 ±  2%      +0.3        0.84 ±  3%  perf-profile.calltrace.cycles-pp.xfs_end_ioend.xfs_end_io.process_one_work.worker_thread.kthread
      0.59 ±  2%      +0.3        0.85 ±  2%  perf-profile.calltrace.cycles-pp.xfs_end_io.process_one_work.worker_thread.kthread.ret_from_fork
      0.90 ±  2%      +0.4        1.27 ±  3%  perf-profile.calltrace.cycles-pp.__submit_bio_noacct.iomap_submit_ioend.iomap_writepages.xfs_vm_writepages.do_writepages
      0.88 ±  2%      +0.4        1.26 ±  3%  perf-profile.calltrace.cycles-pp.__submit_bio.__submit_bio_noacct.iomap_submit_ioend.iomap_writepages.xfs_vm_writepages
      0.92 ±  3%      +0.4        1.30 ±  3%  perf-profile.calltrace.cycles-pp.iomap_submit_ioend.iomap_writepages.xfs_vm_writepages.do_writepages.filemap_fdatawrite_wbc
      0.57 ±  3%      +0.4        0.95 ±  6%  perf-profile.calltrace.cycles-pp.xlog_cil_commit.__xfs_trans_commit.xfs_vn_update_time.kiocb_modified.xfs_file_write_checks
      0.64 ±  3%      +0.4        1.03 ±  6%  perf-profile.calltrace.cycles-pp.__xfs_trans_commit.xfs_vn_update_time.kiocb_modified.xfs_file_write_checks.xfs_file_buffered_write
      6.90 ±  2%      +0.5        7.40 ±  3%  perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
      0.92 ±  4%      +0.5        1.43 ±  6%  perf-profile.calltrace.cycles-pp.xfs_vn_update_time.kiocb_modified.xfs_file_write_checks.xfs_file_buffered_write.vfs_write
      0.00            +0.5        0.52        perf-profile.calltrace.cycles-pp.complete.__flush_workqueue.xlog_cil_push_now.xlog_cil_force_seq.xfs_log_force_seq
      0.94 ±  4%      +0.5        1.46 ±  6%  perf-profile.calltrace.cycles-pp.kiocb_modified.xfs_file_write_checks.xfs_file_buffered_write.vfs_write.ksys_write
      0.96 ±  4%      +0.5        1.48 ±  6%  perf-profile.calltrace.cycles-pp.xfs_file_write_checks.xfs_file_buffered_write.vfs_write.ksys_write.do_syscall_64
      0.00            +0.5        0.54 ±  2%  perf-profile.calltrace.cycles-pp.xfs_iomap_write_unwritten.xfs_end_ioend.xfs_end_io.process_one_work.worker_thread
      0.00            +0.5        0.55 ±  2%  perf-profile.calltrace.cycles-pp.iomap_write_iter.iomap_file_buffered_write.xfs_file_buffered_write.vfs_write.ksys_write
      0.00            +0.6        0.56 ± 10%  perf-profile.calltrace.cycles-pp.__folio_start_writeback.iomap_writepage_map.iomap_writepages.xfs_vm_writepages.do_writepages
      0.00            +0.6        0.57 ±  6%  perf-profile.calltrace.cycles-pp.__folio_end_writeback.folio_end_writeback.iomap_finish_ioend.md_end_clone_io.__submit_bio
      0.00            +0.6        0.58 ±  7%  perf-profile.calltrace.cycles-pp.folio_end_writeback.iomap_finish_ioend.md_end_clone_io.__submit_bio.__submit_bio_noacct
      0.00            +0.6        0.60 ±  6%  perf-profile.calltrace.cycles-pp.iomap_finish_ioend.md_end_clone_io.__submit_bio.__submit_bio_noacct.iomap_submit_ioend
      0.08 ±223%      +0.6        0.72 ±  5%  perf-profile.calltrace.cycles-pp.md_end_clone_io.__submit_bio.__submit_bio_noacct.iomap_submit_ioend.iomap_writepages
      1.45 ±  4%      +0.7        2.15 ±  4%  perf-profile.calltrace.cycles-pp.iomap_writepages.xfs_vm_writepages.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range
      1.46 ±  4%      +0.7        2.16 ±  4%  perf-profile.calltrace.cycles-pp.xfs_vm_writepages.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range
      1.48 ±  4%      +0.7        2.18 ±  4%  perf-profile.calltrace.cycles-pp.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range.xfs_file_fsync
      1.51 ±  4%      +0.7        2.22 ±  4%  perf-profile.calltrace.cycles-pp.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range.xfs_file_fsync.xfs_file_buffered_write
      1.51 ±  3%      +0.7        2.23 ±  4%  perf-profile.calltrace.cycles-pp.__filemap_fdatawrite_range.file_write_and_wait_range.xfs_file_fsync.xfs_file_buffered_write.vfs_write
      0.00            +0.7        0.72 ±  7%  perf-profile.calltrace.cycles-pp.iomap_writepage_map.iomap_writepages.xfs_vm_writepages.do_writepages.filemap_fdatawrite_wbc
      1.60 ±  3%      +0.8        2.36 ±  4%  perf-profile.calltrace.cycles-pp.file_write_and_wait_range.xfs_file_fsync.xfs_file_buffered_write.vfs_write.ksys_write
     85.48            +0.8       86.24        perf-profile.calltrace.cycles-pp.xfs_file_fsync.xfs_file_buffered_write.vfs_write.ksys_write.do_syscall_64
     87.06            +1.4       88.49        perf-profile.calltrace.cycles-pp.xfs_file_buffered_write.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
     87.18            +1.5       88.64        perf-profile.calltrace.cycles-pp.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
     87.36            +1.5       88.82        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
     87.19            +1.5       88.65        perf-profile.calltrace.cycles-pp.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
     87.36            +1.5       88.82        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.write
     87.62            +1.5       89.10        perf-profile.calltrace.cycles-pp.write
     56.74           +13.7       70.42        perf-profile.calltrace.cycles-pp.osq_lock.__mutex_lock.__flush_workqueue.xlog_cil_push_now.xlog_cil_force_seq
     57.89           +13.8       71.74        perf-profile.calltrace.cycles-pp.__mutex_lock.__flush_workqueue.xlog_cil_push_now.xlog_cil_force_seq.xfs_log_force_seq
     60.36           +14.6       74.96        perf-profile.calltrace.cycles-pp.__flush_workqueue.xlog_cil_push_now.xlog_cil_force_seq.xfs_log_force_seq.xfs_file_fsync
     61.48           +14.6       76.09        perf-profile.calltrace.cycles-pp.xlog_cil_push_now.xlog_cil_force_seq.xfs_log_force_seq.xfs_file_fsync.xfs_file_buffered_write
     68.74           +14.8       83.60        perf-profile.calltrace.cycles-pp.xfs_log_force_seq.xfs_file_fsync.xfs_file_buffered_write.vfs_write.ksys_write
     64.97           +15.1       80.03        perf-profile.calltrace.cycles-pp.xlog_cil_force_seq.xfs_log_force_seq.xfs_file_fsync.xfs_file_buffered_write.vfs_write
     14.86 ±  5%     -14.9        0.00        perf-profile.children.cycles-pp.submit_bio_wait
     14.96 ±  5%     -14.8        0.12 ±  4%  perf-profile.children.cycles-pp.md_handle_request
     14.94 ±  5%     -14.8        0.11 ±  3%  perf-profile.children.cycles-pp.raid0_make_request
     14.83 ±  5%     -14.8        0.00        perf-profile.children.cycles-pp.md_flush_request
     14.88 ±  5%     -14.8        0.06 ±  6%  perf-profile.children.cycles-pp.blkdev_issue_flush
     15.82 ±  5%     -14.5        1.32 ±  3%  perf-profile.children.cycles-pp.__submit_bio_noacct
     15.81 ±  5%     -14.5        1.31 ±  3%  perf-profile.children.cycles-pp.__submit_bio
     13.86 ±  5%     -13.6        0.29 ±  3%  perf-profile.children.cycles-pp._raw_spin_lock_irq
     22.32 ±  3%     -13.1        9.23 ±  4%  perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
      1.96 ±  9%      -1.5        0.49 ±  4%  perf-profile.children.cycles-pp.intel_idle_irq
      9.70 ±  3%      -1.1        8.61 ±  3%  perf-profile.children.cycles-pp.start_secondary
      9.80 ±  3%      -1.1        8.71 ±  3%  perf-profile.children.cycles-pp.common_startup_64
      9.80 ±  3%      -1.1        8.71 ±  3%  perf-profile.children.cycles-pp.cpu_startup_entry
      9.79 ±  3%      -1.1        8.71 ±  3%  perf-profile.children.cycles-pp.do_idle
      9.20 ±  3%      -1.0        8.25 ±  3%  perf-profile.children.cycles-pp.cpuidle_idle_call
      9.04 ±  3%      -0.9        8.11 ±  3%  perf-profile.children.cycles-pp.cpuidle_enter
      9.04 ±  3%      -0.9        8.11 ±  3%  perf-profile.children.cycles-pp.cpuidle_enter_state
      2.21            -0.4        1.78 ±  2%  perf-profile.children.cycles-pp.worker_thread
      2.22            -0.4        1.79 ±  2%  perf-profile.children.cycles-pp.kthread
      2.22            -0.4        1.79 ±  2%  perf-profile.children.cycles-pp.ret_from_fork
      2.22            -0.4        1.79 ±  2%  perf-profile.children.cycles-pp.ret_from_fork_asm
      2.08            -0.4        1.68 ±  2%  perf-profile.children.cycles-pp.process_one_work
      0.57            -0.3        0.24        perf-profile.children.cycles-pp.__wake_up
      0.63            -0.3        0.32 ±  2%  perf-profile.children.cycles-pp.__wake_up_common
      1.26            -0.3        0.99        perf-profile.children.cycles-pp.try_to_wake_up
      3.56 ±  2%      -0.2        3.34 ±  4%  perf-profile.children.cycles-pp.xlog_wait_on_iclog
      0.46 ±  2%      -0.1        0.36 ±  2%  perf-profile.children.cycles-pp.select_task_rq
      0.86 ±  3%      -0.1        0.75 ±  2%  perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
      0.43 ±  2%      -0.1        0.33 ±  2%  perf-profile.children.cycles-pp.select_task_rq_fair
      0.64            -0.1        0.55 ±  2%  perf-profile.children.cycles-pp.ttwu_do_activate
      0.71 ±  3%      -0.1        0.62 ±  3%  perf-profile.children.cycles-pp.activate_task
      0.57            -0.1        0.48        perf-profile.children.cycles-pp.__flush_smp_call_function_queue
      0.17 ±  2%      -0.1        0.08        perf-profile.children.cycles-pp.xlog_state_release_iclog
      0.48            -0.1        0.41 ±  2%  perf-profile.children.cycles-pp.sched_ttwu_pending
      0.61 ±  3%      -0.1        0.54 ±  3%  perf-profile.children.cycles-pp.enqueue_task_fair
      0.28 ±  3%      -0.1        0.21 ±  3%  perf-profile.children.cycles-pp.select_idle_sibling
      0.19            -0.1        0.13 ±  2%  perf-profile.children.cycles-pp.schedule_idle
      0.22 ±  3%      -0.1        0.16 ±  4%  perf-profile.children.cycles-pp.select_idle_cpu
      0.47 ±  4%      -0.1        0.41 ±  5%  perf-profile.children.cycles-pp.update_load_avg
      0.35 ±  2%      -0.1        0.29 ±  2%  perf-profile.children.cycles-pp.flush_smp_call_function_queue
      0.42 ±  3%      -0.1        0.37 ±  2%  perf-profile.children.cycles-pp.enqueue_entity
      0.11 ±  6%      -0.1        0.06 ±  8%  perf-profile.children.cycles-pp.finish_task_switch
      0.18 ±  5%      -0.0        0.13 ±  5%  perf-profile.children.cycles-pp.available_idle_cpu
      0.33            -0.0        0.28        perf-profile.children.cycles-pp.xlog_write
      0.12 ±  3%      -0.0        0.07 ±  5%  perf-profile.children.cycles-pp.xlog_write_partial
      0.30 ±  3%      -0.0        0.25 ±  3%  perf-profile.children.cycles-pp.asm_sysvec_call_function_single
      0.12 ±  4%      -0.0        0.07 ±  5%  perf-profile.children.cycles-pp.xlog_write_get_more_iclog_space
      0.37 ±  5%      -0.0        0.32 ±  8%  perf-profile.children.cycles-pp.dequeue_entity
      0.08            -0.0        0.03 ± 70%  perf-profile.children.cycles-pp.__cond_resched
      0.46            -0.0        0.41        perf-profile.children.cycles-pp.xlog_cil_push_work
      0.27 ±  3%      -0.0        0.23 ±  3%  perf-profile.children.cycles-pp.sysvec_call_function_single
      0.08 ±  6%      -0.0        0.04 ± 44%  perf-profile.children.cycles-pp.select_idle_core
      0.26 ±  2%      -0.0        0.22 ±  3%  perf-profile.children.cycles-pp.__sysvec_call_function_single
      0.12 ±  3%      -0.0        0.09 ±  5%  perf-profile.children.cycles-pp.queue_work_on
      0.14 ±  3%      -0.0        0.12 ±  6%  perf-profile.children.cycles-pp.prepare_task_switch
      0.12 ±  3%      -0.0        0.09        perf-profile.children.cycles-pp.ttwu_queue_wakelist
      0.26 ±  5%      -0.0        0.23 ±  6%  perf-profile.children.cycles-pp.update_curr
      0.12            -0.0        0.10 ±  5%  perf-profile.children.cycles-pp.perf_trace_sched_wakeup_template
      0.13 ±  3%      -0.0        0.11        perf-profile.children.cycles-pp.wake_affine
      0.08 ±  4%      -0.0        0.06 ±  8%  perf-profile.children.cycles-pp.set_next_entity
      0.10 ±  5%      -0.0        0.07 ±  6%  perf-profile.children.cycles-pp.kick_pool
      0.11 ±  4%      -0.0        0.09 ±  4%  perf-profile.children.cycles-pp.__queue_work
      0.10 ±  3%      -0.0        0.08 ±  4%  perf-profile.children.cycles-pp.__switch_to_asm
      0.10 ±  4%      -0.0        0.08 ±  6%  perf-profile.children.cycles-pp.switch_mm_irqs_off
      0.07            -0.0        0.05        perf-profile.children.cycles-pp.__smp_call_single_queue
      0.11            -0.0        0.09        perf-profile.children.cycles-pp.xlog_cil_set_ctx_write_state
      0.10            -0.0        0.08 ±  4%  perf-profile.children.cycles-pp.task_h_load
      0.08 ±  4%      -0.0        0.06        perf-profile.children.cycles-pp.sched_mm_cid_migrate_to
      0.08 ±  4%      -0.0        0.06        perf-profile.children.cycles-pp.set_task_cpu
      0.07 ±  5%      -0.0        0.05        perf-profile.children.cycles-pp.__switch_to
      0.13 ±  4%      -0.0        0.11 ±  3%  perf-profile.children.cycles-pp.menu_select
      0.13 ±  6%      -0.0        0.11 ±  5%  perf-profile.children.cycles-pp.reweight_entity
      0.11            -0.0        0.09 ±  4%  perf-profile.children.cycles-pp.xlog_cil_write_commit_record
      0.06 ±  6%      -0.0        0.05        perf-profile.children.cycles-pp.___perf_sw_event
      0.08 ±  5%      -0.0        0.07 ±  6%  perf-profile.children.cycles-pp.avg_vruntime
      0.06            -0.0        0.05        perf-profile.children.cycles-pp.perf_tp_event
      0.06            -0.0        0.05        perf-profile.children.cycles-pp.place_entity
      0.06            -0.0        0.05        perf-profile.children.cycles-pp.sched_clock
      0.05            +0.0        0.06        perf-profile.children.cycles-pp.rep_movs_alternative
      0.05            +0.0        0.06 ±  6%  perf-profile.children.cycles-pp.kfree
      0.06            +0.0        0.07 ±  5%  perf-profile.children.cycles-pp.copy_page_from_iter_atomic
      0.10 ±  3%      +0.0        0.12 ±  4%  perf-profile.children.cycles-pp.xfs_inode_item_format_data_fork
      0.05            +0.0        0.06 ±  7%  perf-profile.children.cycles-pp.xfs_trans_read_buf_map
      0.06            +0.0        0.07 ±  6%  perf-profile.children.cycles-pp.xfs_btree_lookup_get_block
      0.07 ±  5%      +0.0        0.08 ±  5%  perf-profile.children.cycles-pp.filemap_get_entry
      0.09 ±  5%      +0.0        0.10 ±  3%  perf-profile.children.cycles-pp.memcpy_orig
      0.12 ±  3%      +0.0        0.14 ±  3%  perf-profile.children.cycles-pp.xlog_state_clean_iclog
      0.07 ±  5%      +0.0        0.08 ±  5%  perf-profile.children.cycles-pp.filemap_dirty_folio
      0.07            +0.0        0.09 ±  5%  perf-profile.children.cycles-pp.iomap_set_range_uptodate
      0.07 ±  5%      +0.0        0.08 ±  5%  perf-profile.children.cycles-pp.writeback_get_folio
      0.07            +0.0        0.09 ±  5%  perf-profile.children.cycles-pp.xfs_end_bio
      0.06 ±  9%      +0.0        0.07 ±  5%  perf-profile.children.cycles-pp.io_schedule
      0.10            +0.0        0.12 ±  3%  perf-profile.children.cycles-pp.xfs_buffered_write_iomap_begin
      0.09            +0.0        0.11 ±  6%  perf-profile.children.cycles-pp.xfs_btree_lookup
      0.10 ±  3%      +0.0        0.12 ±  5%  perf-profile.children.cycles-pp.writeback_iter
      0.09            +0.0        0.11        perf-profile.children.cycles-pp.xfs_trans_committed_bulk
      0.26            +0.0        0.28        perf-profile.children.cycles-pp.flush_workqueue_prep_pwqs
      0.10            +0.0        0.12 ±  3%  perf-profile.children.cycles-pp.__filemap_get_folio
      0.07 ±  7%      +0.0        0.09 ±  4%  perf-profile.children.cycles-pp.folio_wait_bit_common
      0.16 ±  3%      +0.0        0.19 ±  3%  perf-profile.children.cycles-pp.xfs_inode_item_format
      0.08 ±  5%      +0.0        0.11        perf-profile.children.cycles-pp.__filemap_fdatawait_range
      0.07 ±  5%      +0.0        0.09 ±  5%  perf-profile.children.cycles-pp.wake_page_function
      0.07 ±  7%      +0.0        0.09 ±  4%  perf-profile.children.cycles-pp.folio_wait_writeback
      0.12 ±  4%      +0.0        0.14 ±  2%  perf-profile.children.cycles-pp.iomap_writepage_map_blocks
      0.07 ±  6%      +0.0        0.10 ±  5%  perf-profile.children.cycles-pp.folio_wake_bit
      0.13 ±  2%      +0.0        0.16 ±  2%  perf-profile.children.cycles-pp.llseek
      0.03 ± 70%      +0.0        0.06        perf-profile.children.cycles-pp.get_jiffies_update
      0.12 ±  3%      +0.0        0.15 ±  2%  perf-profile.children.cycles-pp.iomap_iter
      0.14 ±  5%      +0.0        0.16 ±  3%  perf-profile.children.cycles-pp.__mutex_unlock_slowpath
      0.03 ± 70%      +0.0        0.06 ±  6%  perf-profile.children.cycles-pp.tmigr_requires_handle_remote
      0.04 ± 44%      +0.0        0.07        perf-profile.children.cycles-pp.__lruvec_stat_mod_folio
      0.14 ±  2%      +0.0        0.17 ±  4%  perf-profile.children.cycles-pp.iomap_write_end
      0.04 ± 45%      +0.0        0.07 ±  6%  perf-profile.children.cycles-pp.xfs_trans_alloc_inode
      0.03 ± 70%      +0.0        0.06 ±  7%  perf-profile.children.cycles-pp.xfs_map_blocks
      0.15 ±  3%      +0.0        0.18 ±  2%  perf-profile.children.cycles-pp.iomap_write_begin
      0.11 ±  5%      +0.0        0.14 ±  3%  perf-profile.children.cycles-pp.wake_up_q
      0.14 ±  3%      +0.0        0.17 ±  3%  perf-profile.children.cycles-pp.xlog_cil_committed
      0.14 ±  3%      +0.0        0.17 ±  2%  perf-profile.children.cycles-pp.xlog_cil_process_committed
      0.03 ± 70%      +0.0        0.07 ±  8%  perf-profile.children.cycles-pp.balance_dirty_pages_ratelimited_flags
      0.22            +0.0        0.26 ±  2%  perf-profile.children.cycles-pp.xlog_cil_insert_format_items
      0.15 ±  2%      +0.0        0.19 ±  5%  perf-profile.children.cycles-pp.xfs_bmap_add_extent_unwritten_real
      0.16 ±  2%      +0.0        0.20 ±  5%  perf-profile.children.cycles-pp.xfs_bmapi_convert_unwritten
      0.02 ±141%      +0.0        0.06 ± 13%  perf-profile.children.cycles-pp.xlog_grant_push_threshold
      0.28 ±  4%      +0.0        0.32 ±  2%  perf-profile.children.cycles-pp.update_process_times
      0.15            +0.0        0.19        perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.32 ±  3%      +0.0        0.36 ±  3%  perf-profile.children.cycles-pp.tick_nohz_handler
      0.18 ±  2%      +0.0        0.23 ±  4%  perf-profile.children.cycles-pp.xfs_bmapi_write
      0.27 ±  2%      +0.0        0.32        perf-profile.children.cycles-pp.xlog_ioend_work
      0.36 ±  4%      +0.0        0.41 ±  3%  perf-profile.children.cycles-pp.__hrtimer_run_queues
      0.26 ±  2%      +0.0        0.31        perf-profile.children.cycles-pp.xlog_state_do_callback
      0.26 ±  2%      +0.0        0.31        perf-profile.children.cycles-pp.xlog_state_do_iclog_callbacks
      0.00            +0.1        0.05        perf-profile.children.cycles-pp.xa_load
      0.00            +0.1        0.05        perf-profile.children.cycles-pp.xfs_iext_lookup_extent
      0.02 ±141%      +0.1        0.07 ±  5%  perf-profile.children.cycles-pp.up_write
      0.31 ±  2%      +0.1        0.38 ±  2%  perf-profile.children.cycles-pp.xlog_cil_insert_items
      0.41 ±  4%      +0.1        0.47 ±  2%  perf-profile.children.cycles-pp.hrtimer_interrupt
      0.41 ±  3%      +0.1        0.48 ±  3%  perf-profile.children.cycles-pp.__sysvec_apic_timer_interrupt
      0.13 ± 12%      +0.1        0.20 ±  8%  perf-profile.children.cycles-pp.xfs_log_ticket_ungrant
      0.30            +0.1        0.38 ±  3%  perf-profile.children.cycles-pp.copy_to_brd
      0.56 ±  3%      +0.1        0.64 ±  2%  perf-profile.children.cycles-pp.sysvec_apic_timer_interrupt
      0.35            +0.1        0.43 ±  3%  perf-profile.children.cycles-pp.brd_submit_bio
      0.95            +0.1        1.04        perf-profile.children.cycles-pp.mutex_spin_on_owner
      0.11 ± 11%      +0.1        0.21 ± 12%  perf-profile.children.cycles-pp.xlog_grant_add_space
      0.44            +0.1        0.55 ±  2%  perf-profile.children.cycles-pp.iomap_write_iter
      0.19 ±  5%      +0.1        0.30 ±  6%  perf-profile.children.cycles-pp.iomap_finish_ioends
      0.21 ± 11%      +0.1        0.35 ± 12%  perf-profile.children.cycles-pp.xfs_log_reserve
      0.22 ± 11%      +0.1        0.36 ± 11%  perf-profile.children.cycles-pp.xfs_trans_reserve
      0.40 ±  2%      +0.1        0.54 ±  2%  perf-profile.children.cycles-pp.xfs_iomap_write_unwritten
      0.57            +0.1        0.71 ±  2%  perf-profile.children.cycles-pp.iomap_file_buffered_write
      0.25 ± 10%      +0.1        0.39 ± 10%  perf-profile.children.cycles-pp.xfs_trans_alloc
      0.13 ± 11%      +0.2        0.32 ± 16%  perf-profile.children.cycles-pp.schedule_preempt_disabled
      0.23 ± 13%      +0.2        0.46 ± 12%  perf-profile.children.cycles-pp.sb_mark_inode_writeback
      0.25 ± 12%      +0.2        0.50 ± 12%  perf-profile.children.cycles-pp.sb_clear_inode_writeback
      0.59 ±  2%      +0.3        0.85 ±  2%  perf-profile.children.cycles-pp.xfs_end_io
      0.58 ±  2%      +0.3        0.84 ±  3%  perf-profile.children.cycles-pp.xfs_end_ioend
      0.46 ±  6%      +0.3        0.72 ±  6%  perf-profile.children.cycles-pp.md_end_clone_io
      0.30 ± 10%      +0.3        0.57 ±  9%  perf-profile.children.cycles-pp.__folio_start_writeback
      0.11 ± 11%      +0.3        0.38 ± 13%  perf-profile.children.cycles-pp.rwsem_down_read_slowpath
      0.43 ±  7%      +0.3        0.72 ±  7%  perf-profile.children.cycles-pp.iomap_writepage_map
      0.16 ±  9%      +0.3        0.46 ± 11%  perf-profile.children.cycles-pp.down_read
      0.44 ±  8%      +0.3        0.76 ±  7%  perf-profile.children.cycles-pp.__folio_end_writeback
      0.52 ±  7%      +0.4        0.88 ±  6%  perf-profile.children.cycles-pp.folio_end_writeback
      0.54 ±  7%      +0.4        0.90 ±  6%  perf-profile.children.cycles-pp.iomap_finish_ioend
      0.92 ±  2%      +0.4        1.30 ±  3%  perf-profile.children.cycles-pp.iomap_submit_ioend
      0.72 ±  3%      +0.4        1.16 ±  5%  perf-profile.children.cycles-pp.xlog_cil_commit
      0.82 ±  3%      +0.5        1.28 ±  5%  perf-profile.children.cycles-pp.__xfs_trans_commit
      0.92 ±  4%      +0.5        1.43 ±  6%  perf-profile.children.cycles-pp.xfs_vn_update_time
      0.94 ±  4%      +0.5        1.46 ±  6%  perf-profile.children.cycles-pp.kiocb_modified
      0.96 ±  4%      +0.5        1.48 ±  6%  perf-profile.children.cycles-pp.xfs_file_write_checks
      6.96 ±  2%      +0.5        7.49 ±  3%  perf-profile.children.cycles-pp.intel_idle
      1.45 ±  4%      +0.7        2.15 ±  5%  perf-profile.children.cycles-pp.iomap_writepages
      1.46 ±  4%      +0.7        2.16 ±  4%  perf-profile.children.cycles-pp.xfs_vm_writepages
      1.48 ±  4%      +0.7        2.18 ±  4%  perf-profile.children.cycles-pp.do_writepages
      1.51 ±  4%      +0.7        2.22 ±  4%  perf-profile.children.cycles-pp.filemap_fdatawrite_wbc
      1.51 ±  3%      +0.7        2.23 ±  4%  perf-profile.children.cycles-pp.__filemap_fdatawrite_range
      1.61 ±  3%      +0.8        2.36 ±  4%  perf-profile.children.cycles-pp.file_write_and_wait_range
     85.48            +0.8       86.24        perf-profile.children.cycles-pp.xfs_file_fsync
     87.06            +1.4       88.49        perf-profile.children.cycles-pp.xfs_file_buffered_write
     87.19            +1.5       88.65        perf-profile.children.cycles-pp.vfs_write
     87.20            +1.5       88.66        perf-profile.children.cycles-pp.ksys_write
     87.66            +1.5       89.14        perf-profile.children.cycles-pp.write
     87.50            +1.5       88.98        perf-profile.children.cycles-pp.do_syscall_64
     87.50            +1.5       88.99        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
     56.76           +13.7       70.44        perf-profile.children.cycles-pp.osq_lock
     57.89           +13.9       71.74        perf-profile.children.cycles-pp.__mutex_lock
     60.36           +14.6       74.96        perf-profile.children.cycles-pp.__flush_workqueue
     61.49           +14.6       76.10        perf-profile.children.cycles-pp.xlog_cil_push_now
     68.74           +14.8       83.60        perf-profile.children.cycles-pp.xfs_log_force_seq
     64.98           +15.1       80.03        perf-profile.children.cycles-pp.xlog_cil_force_seq
     22.30 ±  3%     -13.1        9.22 ±  4%  perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
      1.91 ±  9%      -1.4        0.46 ±  5%  perf-profile.self.cycles-pp.intel_idle_irq
      0.24 ±  2%      -0.1        0.18 ±  4%  perf-profile.self.cycles-pp._raw_spin_lock_irq
      0.18 ±  4%      -0.1        0.12 ±  6%  perf-profile.self.cycles-pp.available_idle_cpu
      0.37 ±  2%      -0.0        0.32 ±  2%  perf-profile.self.cycles-pp._raw_spin_lock_irqsave
      0.20 ±  3%      -0.0        0.17 ±  4%  perf-profile.self.cycles-pp.update_load_avg
      0.14 ±  3%      -0.0        0.11 ±  3%  perf-profile.self.cycles-pp.__schedule
      0.09 ±  4%      -0.0        0.07 ±  8%  perf-profile.self.cycles-pp.prepare_task_switch
      0.10            -0.0        0.08 ±  4%  perf-profile.self.cycles-pp.task_h_load
      0.10 ±  5%      -0.0        0.08 ±  6%  perf-profile.self.cycles-pp.__switch_to_asm
      0.08 ±  4%      -0.0        0.06        perf-profile.self.cycles-pp.sched_mm_cid_migrate_to
      0.07 ±  5%      -0.0        0.05 ±  7%  perf-profile.self.cycles-pp.menu_select
      0.09 ±  5%      -0.0        0.08 ±  6%  perf-profile.self.cycles-pp.switch_mm_irqs_off
      0.06 ±  7%      -0.0        0.05        perf-profile.self.cycles-pp.__switch_to
      0.07 ±  7%      -0.0        0.05 ±  8%  perf-profile.self.cycles-pp.enqueue_entity
      0.10 ±  4%      -0.0        0.09 ±  7%  perf-profile.self.cycles-pp.update_curr
      0.05            +0.0        0.06        perf-profile.self.cycles-pp.rep_movs_alternative
      0.06            +0.0        0.07 ±  5%  perf-profile.self.cycles-pp.xas_load
      0.08 ±  4%      +0.0        0.10 ±  5%  perf-profile.self.cycles-pp.__flush_workqueue
      0.07            +0.0        0.08 ±  5%  perf-profile.self.cycles-pp.iomap_set_range_uptodate
      0.08 ±  5%      +0.0        0.10 ±  3%  perf-profile.self.cycles-pp.memcpy_orig
      0.05 ±  7%      +0.0        0.07 ±  5%  perf-profile.self.cycles-pp.down_read
      0.08 ±  5%      +0.0        0.11 ±  4%  perf-profile.self.cycles-pp.__mutex_lock
      0.09 ±  4%      +0.0        0.12 ±  6%  perf-profile.self.cycles-pp.xlog_cil_insert_items
      0.03 ± 70%      +0.0        0.06        perf-profile.self.cycles-pp.get_jiffies_update
      0.02 ± 99%      +0.0        0.06 ±  7%  perf-profile.self.cycles-pp.__folio_end_writeback
      0.15            +0.0        0.19        perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.10 ± 12%      +0.1        0.16 ±  9%  perf-profile.self.cycles-pp.xfs_log_ticket_ungrant
      0.00            +0.1        0.06 ± 11%  perf-profile.self.cycles-pp.balance_dirty_pages_ratelimited_flags
      0.30 ±  2%      +0.1        0.37 ±  2%  perf-profile.self.cycles-pp.copy_to_brd
      0.95            +0.1        1.03        perf-profile.self.cycles-pp.mutex_spin_on_owner
      0.11 ± 11%      +0.1        0.20 ± 14%  perf-profile.self.cycles-pp.xlog_grant_add_space
      6.96 ±  2%      +0.5        7.49 ±  3%  perf-profile.self.cycles-pp.intel_idle
     56.27           +13.5       69.81        perf-profile.self.cycles-pp.osq_lock




Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki




More information about the Linux-nvme mailing list