[PATCH] coresight: etm: Make cycle count threshold user configurable

Al Grant Al.Grant at arm.com
Fri Aug 4 03:10:40 PDT 2023



> -----Original Message-----
> From: James Clark <james.clark at arm.com>
> Sent: Friday, August 4, 2023 10:23 AM
> To: Anshuman Khandual <Anshuman.Khandual at arm.com>; Al Grant
> <Al.Grant at arm.com>; linux-arm-kernel at lists.infradead.org
> Cc: Mike Leach <mike.leach at linaro.org>; coresight at lists.linaro.org; linux-
> doc at vger.kernel.org; linux-kernel at vger.kernel.org
> Subject: Re: [PATCH] coresight: etm: Make cycle count threshold user
> configurable
> 
> 
> 
> On 04/08/2023 09:45, Anshuman Khandual wrote:
> >
> >
> > On 8/4/23 13:34, Al Grant wrote:
> >>
> >>
> >>> -----Original Message-----
> >>> From: Anshuman Khandual <anshuman.khandual at arm.com>
> >>> Sent: Friday, August 4, 2023 5:47 AM
> >>> To: linux-arm-kernel at lists.infradead.org
> >>> Cc: Anshuman Khandual <Anshuman.Khandual at arm.com>; Mike Leach
> >>> <mike.leach at linaro.org>; coresight at lists.linaro.org;
> >>> linux-doc at vger.kernel.org; linux-kernel at vger.kernel.org
> >>> Subject: [PATCH] coresight: etm: Make cycle count threshold user
> >>> configurable
> >>>
> >>> Cycle counting is enabled, when requested and supported but with a
> >>> default threshold value ETM_CYC_THRESHOLD_DEFAULT i.e 0x100 getting
> >>> into TRCCCCTLR, representing the minimum interval between cycle
> >>> count trace packets.
> >>>
> >>> This makes cycle threshold user configurable, from the user space
> >>> via perf event attributes. Although it falls back using
> >>> ETM_CYC_THRESHOLD_DEFAULT, in case no explicit request. As expected it
> creates a sysfs file as well.
> >>>
> >>> /sys/bus/event_source/devices/cs_etm/format/cc_threshold
> >>>
> >>> New 'cc_threshold' uses 'event->attr.config3' as no more space is
> >>> available in 'event->attr.config1' or 'event->attr.config2'.
> >>>
> >>> Cc: Suzuki K Poulose <suzuki.poulose at arm.com>
> >>> Cc: Mike Leach <mike.leach at linaro.org>
> >>> Cc: James Clark <james.clark at arm.com>
> >>> Cc: Leo Yan <leo.yan at linaro.org>
> >>> Cc: coresight at lists.linaro.org
> >>> Cc: linux-arm-kernel at lists.infradead.org
> >>> Cc: linux-doc at vger.kernel.org
> >>> Cc: linux-kernel at vger.kernel.org
> >>> Signed-off-by: Anshuman Khandual <anshuman.khandual at arm.com>
> >>> ---
> >>>  Documentation/trace/coresight/coresight.rst        |  2 ++
> >>>  drivers/hwtracing/coresight/coresight-etm-perf.c   |  2 ++
> >>>  drivers/hwtracing/coresight/coresight-etm4x-core.c | 12
> >>> ++++++++++--
> >>>  3 files changed, 14 insertions(+), 2 deletions(-)
> >>>
> >>> diff --git a/Documentation/trace/coresight/coresight.rst
> >>> b/Documentation/trace/coresight/coresight.rst
> >>> index 4a71ea6cb390..b88d83b59531 100644
> >>> --- a/Documentation/trace/coresight/coresight.rst
> >>> +++ b/Documentation/trace/coresight/coresight.rst
> >>> @@ -624,6 +624,8 @@ They are also listed in the folder
> >>> /sys/bus/event_source/devices/cs_etm/format/
> >>>     * - timestamp
> >>>       - Session local version of the system wide setting:
> >>> :ref:`ETMv4_MODE_TIMESTAMP
> >>>         <coresight-timestamp>`
> >>> +   * - cc_treshold
> >>
> >> Spelling: cc_threshold
> >
> > Will fix this, besides does it require some more description for this
> > new config option i.e cc_threshold ?
> >
> >>
> >>> +     - Cycle count treshhold value
> >>>
> >>>  How to use the STM module
> >>>  -------------------------
> >>> diff --git a/drivers/hwtracing/coresight/coresight-etm-perf.c
> >>> b/drivers/hwtracing/coresight/coresight-etm-perf.c
> >>> index 5ca6278baff4..09f75dffae60 100644
> >>> --- a/drivers/hwtracing/coresight/coresight-etm-perf.c
> >>> +++ b/drivers/hwtracing/coresight/coresight-etm-perf.c
> >>> @@ -68,6 +68,7 @@ PMU_FORMAT_ATTR(preset,		"config:0-3");
> >>>  PMU_FORMAT_ATTR(sinkid,		"config2:0-31");
> >>>  /* config ID - set if a system configuration is selected */
> >>>  PMU_FORMAT_ATTR(configid,	"config2:32-63");
> >>> +PMU_FORMAT_ATTR(cc_threshold,	"config3:0-11");
> >>>
> >>>
> >>>  /*
> >>> @@ -101,6 +102,7 @@ static struct attribute *etm_config_formats_attr[] = {
> >>>  	&format_attr_preset.attr,
> >>>  	&format_attr_configid.attr,
> >>>  	&format_attr_branch_broadcast.attr,
> >>> +	&format_attr_cc_threshold.attr,
> >>>  	NULL,
> >>>  };
> >>>
> >>> diff --git a/drivers/hwtracing/coresight/coresight-etm4x-core.c
> >>> b/drivers/hwtracing/coresight/coresight-etm4x-core.c
> >>> index 9d186af81ea0..9a2766f68416 100644
> >>> --- a/drivers/hwtracing/coresight/coresight-etm4x-core.c
> >>> +++ b/drivers/hwtracing/coresight/coresight-etm4x-core.c
> >>> @@ -644,7 +644,7 @@ static int etm4_parse_event_config(struct
> >>> coresight_device *csdev,
> >>>  	struct etmv4_config *config = &drvdata->config;
> >>>  	struct perf_event_attr *attr = &event->attr;
> >>>  	unsigned long cfg_hash;
> >>> -	int preset;
> >>> +	int preset, cc_threshold;
> >>>
> >>>  	/* Clear configuration from previous run */
> >>>  	memset(config, 0, sizeof(struct etmv4_config)); @@ -667,7 +667,15
> >>> @@ static int etm4_parse_event_config(struct coresight_device *csdev,
> >>>  	if (attr->config & BIT(ETM_OPT_CYCACC)) {
> >>>  		config->cfg |= TRCCONFIGR_CCI;
> >>>  		/* TRM: Must program this for cycacc to work */
> >>> -		config->ccctlr = ETM_CYC_THRESHOLD_DEFAULT;
> >>> +		cc_treshold = attr->config3 & ETM_CYC_THRESHOLD_MASK;
> >>
> >> Spelling again
> >
> > Yikes, this does not even build. Seems like I had missed the
> > applicable config i.e CONFIG_CORESIGHT_SOURCE_ETM4X this time around.
> Apologies.
> >
> >>
> >>> +		if (cc_treshold) {
> >>> +			if (cc_treshold < drvdata->ccitmin)
> >>> +				config->ccctlr = drvdata->ccitmin;
> >>> +			else
> >>> +				config->ccctlr = cc_threshold;
> >>> +		} else {
> >>> +			config->ccctlr = ETM_CYC_THRESHOLD_DEFAULT;
> >>> +		}
> >>
> >> Consider dropping the check against CCITMIN. There are CPUs where
> >> CCITMIN is incorrect, e.g. see published errata 1490853 where the
> >> value 0x100 should be 0b100 i.e. 4. On these ETMs it is possible to
> >> set the timing threshold to four cycles instead of 256 cycles,
> >> providing much better timing resolution. The kernel currently does
> >> not work around this errata and uses the incorrect value of ccitmin.
> >> If you drop the check, and trust the value provided by userspace, you
> >> allow userspace to work around it.
> > Why ? We could just work around the errata #1490853 while initializing
> > the drvdata->ccitmin if that is where the problem exists.
> 
> 
> > I dont think
> > user space should be required to know about the erratas, and provide a
> 
> I think that becomes less true for the tracing and PMU stuff. If you are using it you
> likely need to know a lot about the platform you are working on anyway.
> 
> For example right now I'm trying to upstream some metric formulas which have a
> workaround where userspace needs to know the variant of the processor. It's not
> possible for the kernel to do anything about it.
> 
> In this case as it's only one known errata we could add the workaround.
> 
> Unless we expect there to be the same issue again in the future? Or we know
> there are already more CPUs than #1490853 mentions?

It's a common-mode failure affecting several CPUs, each with their
own set of affected/fixed versions, so there would be several MIDRs
to check. The ones that have been published in errata notices include:

- #1490853: Neoverse N1, fixed r4p1
- #1490853: Cortex-A76, fixed r4p1 (n.b. same number as above)
- #1619801: Neoverse V1, fixed r1p0
- #1502854: Cortex-X1, fixed r1p0
- #1491015: Cortex-A77, fixed r1p1

Hopefully that's the lot.

For this issue you don't really need to check the fix version, as the
number you'd put in ccitmin is the same anyway.

Al


> 
> > right value instead.
> > _______________________________________________
> > CoreSight mailing list -- coresight at lists.linaro.org To unsubscribe
> > send an email to coresight-leave at lists.linaro.org


More information about the linux-arm-kernel mailing list