[PATCH v6 00/12] perf tools: fix perf stat with large socket IDs

John Garry john.garry at huawei.com
Fri Dec 4 06:48:36 EST 2020


On 03/12/2020 15:39, Jiri Olsa wrote:

+

> On Thu, Nov 26, 2020 at 04:13:16PM +0200, James Clark wrote:
>> Changes since v5:
>>    * Fix test for cpu_map__get_die() by shifting id before testing.
>>    * Fix test for cpu_map__get_socket() by not using cpu_map__id_to_socket()
>>      which is only valid in CPU aggregation mode.
>>
>> James Clark (12):
>>    perf tools: Improve topology test
>>    perf tools: Use allocator for perf_cpu_map
>>    perf tools: Add new struct for cpu aggregation
>>    perf tools: Replace aggregation ID with a struct
>>    perf tools: add new map type for aggregation
>>    perf tools: drop in cpu_aggr_map struct
>>    perf tools: Start using cpu_aggr_id in map
>>    perf tools: Add separate node member
>>    perf tools: Add separate socket member
>>    perf tools: Add separate die member
>>    perf tools: Add separate core member
>>    perf tools: Add separate thread member
> 
> Acked-by: Jiri Olsa <jolsa at redhat.com>
> 

Tested-by: John Garry <john.garry at huawei.com>

I still think that vendors (like us) need to fix/improve their firmware 
tables so that we don't get silly big numbers for socket/package IDs, 
like S5418-D0, below:

$./perf stat -a --per-die

  Performance counter stats for 'system wide':

S36-D0   48   72,216.31 msec cpu-clock      #   47.933 CPUs utilized
S36-D0   48        174     context-switches #   0.002 K/sec
S36-D0   48         48     cpu-migrations   #   0.001 K/sec
S36-D0   48         0     page-faults    #   0.000 K/sec
S36-D0   48   7,991,698     cycles    #   0.000 GHz
S36-D0   48   4,750,040     instructions   #   0.59  insn per cycle
S36-D0    1   <not supported>     branches
S36-D0   48      32,928     branch-misses    #   0.00% of all branches
S5418-D0   48   72,189.54 msec cpu-clock     #   47.915 CPUs utilized
S5418-D0   48        176     context-switches  #   0.002 K/sec
S5418-D0   48         48     cpu-migrations   #   0.001 K/sec
S5418-D0   48         0     page-faults     #   0.000 K/sec
S5418-D0   48   5,677,218     cycles    #    0.000 GHz
S5418-D0   48   3,872,285     instructions   #  0.68  insn per cycle
S5418-D0    1   <not supported>     branches
S5418-D0   48      29,208     branch-misses   #  0.00% of all branches

       1.506615297 seconds time elapsed

but at least it works now. Thanks.

> 
>>
>>   tools/perf/builtin-stat.c      | 128 ++++++++++++------------
>>   tools/perf/tests/topology.c    |  64 ++++++++++--
>>   tools/perf/util/cpumap.c       | 171 ++++++++++++++++++++++-----------
>>   tools/perf/util/cpumap.h       |  55 ++++++-----
>>   tools/perf/util/stat-display.c | 102 ++++++++++++--------
>>   tools/perf/util/stat.c         |   2 +-
>>   tools/perf/util/stat.h         |   9 +-
>>   7 files changed, 337 insertions(+), 194 deletions(-)
>>
>> -- 
>> 2.28.0
>>
> 
> .
> 




More information about the linux-arm-kernel mailing list