[PATCH v3 0/7] Add JSON metrics for arm CMN and Yitian710 DDR

Jing Zhang renyu.zj at linux.alibaba.com
Tue May 30 02:19:27 PDT 2023


Changes since v2:
- Refact cmn identifier and use model and revision to form identifier.
- Let "Compat" support matching multiple identifier.
- Improved the ali_drw PMU event alias Brief Description.
- Update ali_drw PMU metric usage in documentation.

Changes since RFC:
- Refact arm-cmn PMU identifier.
- Not add arm-cmn PMU aliasing currently because it's Eventcode is
  difficult to define.
- Rename ali_drw PMU identifier and Unit name.
- Divide ali_drw PMU metric and aliasing into two patches.

Add an identifier sysfs file for the yitian710 SoC DDR and arm CMN to
allow userspace to identify the specific implementation of the device,
so that the perf tool can match the corresponding uncore events and
metrics through the identifier. Then added several general CMN metrics
and yitian710 soc DDR metrics and events alias.


$perf list:
...
ali_drw:
  chi_rxdat
       [A packet at CHI RXDAT interface (write data). Unit: ali_drw]
  chi_rxrsp
       [A packet at CHI RXRSP interface. Unit: ali_drw]
  chi_txdat
       [A packet at CHI TXDAT interface (read data). Unit: ali_drw]
  chi_txreq
       [A packet at CHI TXREQ interface (request). Unit: ali_drw]
  cycle
       [The ddr cycle. Unit: ali_drw]
...
arm_cmn:
  mc_message_retry_rate
       [The memory controller request retries rate indicates whether the memory controller is the bottleneck. Unit: arm_cmn ]
  rni_actual_read_bandwidth.all
       [This event measure the actual bandwidth(MB/sec) that RN-I bridge sends to the interconnect. Unit: arm_cmn ]
  rni_actual_write_bandwidth.all
       [This event measures the actual write bandwidth(MB/sec) at RN-I bridges. Unit: arm_cmn ]
  rni_retry_rate
       [RN-I bridge retry rate indicates whether the memory controller is the bottleneck. Unit: arm_cmn ]
  sbsx_actual_write_bandwidth.all
       [sbsx actual write bandwidth(MB/sec). Unit: arm_cmn ]
  sf_hit_rate
       [Snoop filter hit rate can be used to measure the Snoop Filter efficiency. Unit: arm_cmn ]
  slc_miss_rate
       [The system level cache miss rate include. Unit: arm_cmn ]
ali_drw:
  ddr_read_bandwidth.all
       [The ddr read bandwidth(MB/s). Unit: ali_drw ]
  ddr_write_bandwidth.all
       [The ddr write bandwidth(MB/s). Unit: ali_drw ]
...

$perf stat -M ddr_read_bandwidth.all ./test

Performance counter stats for 'system wide':

            38,150      hif_rd        #  2.4 MB/s  ddr_read_bandwidth.all
     1,000,957,941 ns   duration_time

       1.000957941 seconds time elapsed

Jing Zhang (7):
  driver/perf: Add identifier sysfs file for CMN
  perf metric: Event "Compat" value supports matching multiple
    identifiers
  perf vendor events: Add JSON metrics for CMN
  driver/perf: Add identifier sysfs file for Yitian 710 DDR
  perf jevents: Add support for Yitian 710 DDR PMU aliasing
  perf vendor events: Add JSON metrics for Yitian 710 DDR
  docs: perf: Update metric usage for Alibaba's T-Head PMU driver

 Documentation/admin-guide/perf/alibaba_pmu.rst     |   5 +
 drivers/perf/alibaba_uncore_drw_pmu.c              |  27 ++
 drivers/perf/arm-cmn.c                             |  79 ++++-
 .../pmu-events/arch/arm64/arm/cmn/sys/metrics.json |  74 ++++
 .../arm64/freescale/yitian710/sys/ali_drw.json     | 373 +++++++++++++++++++++
 .../arm64/freescale/yitian710/sys/metrics.json     |  20 ++
 tools/perf/pmu-events/jevents.py                   |   2 +
 tools/perf/util/metricgroup.c                      |  24 +-
 8 files changed, 595 insertions(+), 9 deletions(-)
 create mode 100644 tools/perf/pmu-events/arch/arm64/arm/cmn/sys/metrics.json
 create mode 100644 tools/perf/pmu-events/arch/arm64/freescale/yitian710/sys/ali_drw.json
 create mode 100644 tools/perf/pmu-events/arch/arm64/freescale/yitian710/sys/metrics.json

-- 
1.8.3.1




More information about the linux-arm-kernel mailing list