[PATCH] Revert "perf cs-etm: Move definition of 'traceid_list' global variable from header file"

Andrey Zhizhikin andrey.z at gmail.com
Fri Nov 20 11:31:59 EST 2020


Hello Salvatore,

On Fri, Nov 20, 2020 at 4:53 PM Salvatore Bonaccorso <carnil at debian.org> wrote:
>
> Hi Andrey,
>
> On Fri, Nov 20, 2020 at 03:29:39PM +0100, Andrey Zhizhikin wrote:
> > Hello Salvatore,
> >
> > On Fri, Nov 20, 2020 at 2:34 PM Salvatore Bonaccorso <carnil at debian.org> wrote:
> > >
> > > Hi Andrey,
> > >
> > > On Fri, Nov 20, 2020 at 10:54:22AM +0100, Andrey Zhizhikin wrote:
> > > > On Fri, Nov 20, 2020 at 8:39 AM Salvatore Bonaccorso <carnil at debian.org> wrote:
> > > > >
> > > > > This reverts commit 168200b6d6ea0cb5765943ec5da5b8149701f36a upstream.
> > > > > (but only from 4.19.y)
> > > >
> > > > This revert would fail the build of 4.19.y with gcc10, I believe the
> > > > original commit was introduced to address exactly this case. If this
> > > > is intended behavior that 4.19.y is not compiled with newer gcc
> > > > versions - then this revert is OK.
> > >
> > > TTBOMK, this would not regress the build for newer gcc (specifically
> > > gcc10) as 4.19.158 is failing perf tool builds there as well (without
> > > the above commit reverted). Just as an example v4.19.y does not have
> > > cff20b3151cc ("perf tests bp_account: Make global variable static")
> > > which is there in v5.6-rc6 to fix build failures with 10.0.1.
> > >
> > > But it did regress builds with older gcc's as for instance used in
> > > Debian buster (gcc 8.3.0) since 4.19.152.
> > >
> > > Do I possibly miss something? If there is a solution to make it build
> > > with newer GCCs and *not* regress previously working GCC versions then
> > > this is surely the best outcome though.
> >
> > I guess (and from what I understand in Leo's reply), porting of
> > 95c6fe970a01 ("perf cs-etm: Change tuple from traceID-CPU# to
> > traceID-metadata") should solve the issue for both older and newer gcc
> > versions.
> >
> > The breakage is now in
> > [tools/perf/util/cs-etm-decoder/cs-etm-decoder.c] file (which uses
> > traceid_list inside). This is solved with the above commit, which
> > concealed traceid_list internally inside [tools/perf/util/cs-etm.c]
> > file and exposed to [tools/perf/util/cs-etm-decoder/cs-etm-decoder.c]
> > via cs_etm__get_cpu() call.
> >
> > Can you try out to port that commit to see if that would solve your
> > regression?
>
> So something like the following will compile as well with the older
> gcc version.
>
> I realize: I mainline the order of the commits was:
>
> 95c6fe970a01 ("perf cs-etm: Change tuple from traceID-CPU# to traceID-metadata")
> 168200b6d6ea ("perf cs-etm: Move definition of 'traceid_list' global variable from header f
> ile")
>
> But to v4.19.y only 168200b6d6ea was backported, and while that was
> done I now realize the comment was also changed including the change
> fom 95c6fe970a01.
>
> Thus the proposed backported patch would drop the change in
> tools/perf/util/cs-etm.c to the comment as this was already done.
> Thecnically currently the comment would be wrong, because it reads:
>
> /* RB tree for quick conversion between traceID and metadata pointers */
>
> but backport of 95c6fe970a01 is not included.
>
> Would the right thing to do thus be:
>
> - Revert b801d568c7d8 "perf cs-etm: Move definition of 'traceid_list' global variable from header file"
> - Backport 95c6fe970a01 ("perf cs-etm: Change tuple from traceID-CPU# to traceID-metadata")
> - Backport 168200b6d6ea ("perf cs-etm: Move definition of 'traceid_list' global variable from header file")

Yes, I believe this would be the correct course of action here; this
should cover the regression you've encountered and should ensure that
perf builds on both the "old" and "new" gcc versions.

>
> ?
>
> Leo ist that what you were proposing?
>
> Regards,
> Salvatore
>
> From 7d6b3668d8ae5d3aea8827670ade8ac43b92db4a Mon Sep 17 00:00:00 2001
> From: Leo Yan <leo.yan at linaro.org>
> Date: Tue, 29 Jan 2019 20:28:39 +0800
> Subject: [PATCH] perf cs-etm: Change tuple from traceID-CPU# to
>  traceID-metadata
>
> commit 95c6fe970a0160cb770c5dce9f80311b42d030c0 upstream.
>
> If packet processing wants to know the packet is bound with which ETM
> version, it needs to access metadata to decide that based on metadata
> magic number; but we cannot simply to use CPU logic ID number as index
> to access metadata sequential array, especially when system have
> hotplugged off CPUs, the metadata array are only allocated for online
> CPUs but not offline CPUs, so the CPU logic number doesn't match with
> its index in the array.
>
> This patch is to change tuple from traceID-CPU# to traceID-metadata,
> thus it can use the tuple to retrieve metadata pointer according to
> traceID.
>
> For safe accessing metadata fields, this patch provides helper function
> cs_etm__get_cpu() which is used to return CPU number according to
> traceID; cs_etm_decoder__buffer_packet() is the first consumer for this
> helper function.
>
> Signed-off-by: Leo Yan <leo.yan at linaro.org>
> Reviewed-by: Mathieu Poirier <mathieu.poirier at linaro.org>
> Cc: Alexander Shishkin <alexander.shishkin at linux.intel.com>
> Cc: Jiri Olsa <jolsa at redhat.com>
> Cc: Mike Leach <mike.leach at linaro.org>
> Cc: Namhyung Kim <namhyung at kernel.org>
> Cc: Robert Walker <robert.walker at arm.com>
> Cc: Suzuki K Poulouse <suzuki.poulose at arm.com>
> Cc: coresight ml <coresight at lists.linaro.org>
> Cc: linux-arm-kernel at lists.infradead.org
> Link: http://lkml.kernel.org/r/20190129122842.32041-6-leo.yan@linaro.org
> Signed-off-by: Arnaldo Carvalho de Melo <acme at redhat.com>
> [Salvatore Bonaccorso: Drop comment change in tools/perf/util/cs-etm.h
> which was already changed with b801d568c7d8 ("perf cs-etm: Move
> definition of 'traceid_list' global variable from header file")]
> Signed-off-by: Salvatore Bonaccorso <carnil at debian.org>
> ---
>  .../perf/util/cs-etm-decoder/cs-etm-decoder.c |  8 +++---
>  tools/perf/util/cs-etm.c                      | 26 ++++++++++++++-----
>  tools/perf/util/cs-etm.h                      |  7 +++++
>  3 files changed, 30 insertions(+), 11 deletions(-)
>
> diff --git a/tools/perf/util/cs-etm-decoder/cs-etm-decoder.c b/tools/perf/util/cs-etm-decoder/cs-etm-decoder.c
> index 938def6d0bb9..f540037eb705 100644
> --- a/tools/perf/util/cs-etm-decoder/cs-etm-decoder.c
> +++ b/tools/perf/util/cs-etm-decoder/cs-etm-decoder.c
> @@ -278,14 +278,12 @@ cs_etm_decoder__buffer_packet(struct cs_etm_decoder *decoder,
>                               enum cs_etm_sample_type sample_type)
>  {
>         u32 et = 0;
> -       struct int_node *inode = NULL;
> +       int cpu;
>
>         if (decoder->packet_count >= MAX_BUFFER - 1)
>                 return OCSD_RESP_FATAL_SYS_ERR;
>
> -       /* Search the RB tree for the cpu associated with this traceID */
> -       inode = intlist__find(traceid_list, trace_chan_id);
> -       if (!inode)
> +       if (cs_etm__get_cpu(trace_chan_id, &cpu) < 0)
>                 return OCSD_RESP_FATAL_SYS_ERR;
>
>         et = decoder->tail;
> @@ -296,7 +294,7 @@ cs_etm_decoder__buffer_packet(struct cs_etm_decoder *decoder,
>         decoder->packet_buffer[et].sample_type = sample_type;
>         decoder->packet_buffer[et].exc = false;
>         decoder->packet_buffer[et].exc_ret = false;
> -       decoder->packet_buffer[et].cpu = *((int *)inode->priv);
> +       decoder->packet_buffer[et].cpu = cpu;
>         decoder->packet_buffer[et].start_addr = CS_ETM_INVAL_ADDR;
>         decoder->packet_buffer[et].end_addr = CS_ETM_INVAL_ADDR;
>
> diff --git a/tools/perf/util/cs-etm.c b/tools/perf/util/cs-etm.c
> index ad33b99f5d21..3275b8dc9344 100644
> --- a/tools/perf/util/cs-etm.c
> +++ b/tools/perf/util/cs-etm.c
> @@ -94,6 +94,20 @@ static int cs_etm__update_queues(struct cs_etm_auxtrace *etm);
>  static int cs_etm__process_timeless_queues(struct cs_etm_auxtrace *etm,
>                                            pid_t tid, u64 time_);
>
> +int cs_etm__get_cpu(u8 trace_chan_id, int *cpu)
> +{
> +       struct int_node *inode;
> +       u64 *metadata;
> +
> +       inode = intlist__find(traceid_list, trace_chan_id);
> +       if (!inode)
> +               return -EINVAL;
> +
> +       metadata = inode->priv;
> +       *cpu = (int)metadata[CS_ETM_CPU];
> +       return 0;
> +}
> +
>  static void cs_etm__packet_dump(const char *pkt_string)
>  {
>         const char *color = PERF_COLOR_BLUE;
> @@ -233,7 +247,7 @@ static void cs_etm__free(struct perf_session *session)
>         cs_etm__free_events(session);
>         session->auxtrace = NULL;
>
> -       /* First remove all traceID/CPU# nodes for the RB tree */
> +       /* First remove all traceID/metadata nodes for the RB tree */
>         intlist__for_each_entry_safe(inode, tmp, traceid_list)
>                 intlist__remove(traceid_list, inode);
>         /* Then the RB tree itself */
> @@ -1319,9 +1333,9 @@ int cs_etm__process_auxtrace_info(union perf_event *event,
>                                     0xffffffff);
>
>         /*
> -        * Create an RB tree for traceID-CPU# tuple. Since the conversion has
> -        * to be made for each packet that gets decoded, optimizing access in
> -        * anything other than a sequential array is worth doing.
> +        * Create an RB tree for traceID-metadata tuple.  Since the conversion
> +        * has to be made for each packet that gets decoded, optimizing access
> +        * in anything other than a sequential array is worth doing.
>          */
>         traceid_list = intlist__new(NULL);
>         if (!traceid_list) {
> @@ -1387,8 +1401,8 @@ int cs_etm__process_auxtrace_info(union perf_event *event,
>                         err = -EINVAL;
>                         goto err_free_metadata;
>                 }
> -               /* All good, associate the traceID with the CPU# */
> -               inode->priv = &metadata[j][CS_ETM_CPU];
> +               /* All good, associate the traceID with the metadata pointer */
> +               inode->priv = metadata[j];
>         }
>
>         /*
> diff --git a/tools/perf/util/cs-etm.h b/tools/perf/util/cs-etm.h
> index c7ef97b198c7..97c3152f5bfd 100644
> --- a/tools/perf/util/cs-etm.h
> +++ b/tools/perf/util/cs-etm.h
> @@ -66,6 +66,7 @@ static const u64 __perf_cs_etmv4_magic   = 0x4040404040404040ULL;
>  #ifdef HAVE_CSTRACE_SUPPORT
>  int cs_etm__process_auxtrace_info(union perf_event *event,
>                                   struct perf_session *session);
> +int cs_etm__get_cpu(u8 trace_chan_id, int *cpu);
>  #else
>  static inline int
>  cs_etm__process_auxtrace_info(union perf_event *event __maybe_unused,
> @@ -73,6 +74,12 @@ cs_etm__process_auxtrace_info(union perf_event *event __maybe_unused,
>  {
>         return -1;
>  }
> +
> +static inline int cs_etm__get_cpu(u8 trace_chan_id __maybe_unused,
> +                                 int *cpu __maybe_unused)
> +{
> +       return -1;
> +}
>  #endif
>
>  #endif
> --
> 2.29.2
>


-- 
Regards,
Andrey.



More information about the linux-arm-kernel mailing list