[PATCH] ARM: perf: remove erroneous check on active_events

Ashwin Chaugule ashwinc at codeaurora.org
Thu Apr 28 11:21:30 EDT 2011


Hi Mark,

> From: Mark Rutland <mark.rutland at arm.com>
>
> When initialising a PMU, there is a check to protect against races with
> other CPUs filling all of the available event slots. Since armpmu_add
> checks that an event can be scheduled, we do not need to do this at
> initialisation time. Furthermore the current code is broken because it
> assumes that atomic_inc_not_zero will unconditionally increment
> active_counts and then tries to decrement it again on failure.
> 
> This patch removes the broken, redundant code.

Nice find ! I had a slightly different solution.

> 
> Signed-off-by: Mark Rutland <mark.rutland at arm.com>
> Acked-by: Will Deacon <will.deacon at arm.com>
> Cc: Will Deacon <will.deacon at arm.com>
> Cc: Jamie Iles <jamie at jamieiles.com>
> ---
>  arch/arm/kernel/perf_event.c |    5 -----
>  1 files changed, 0 insertions(+), 5 deletions(-)
> 
> diff --git a/arch/arm/kernel/perf_event.c b/arch/arm/kernel/perf_event.c
> index fd6403c..29a0cf8 100644
> --- a/arch/arm/kernel/perf_event.c
> +++ b/arch/arm/kernel/perf_event.c
> @@ -560,11 +560,6 @@ static int armpmu_event_init(struct perf_event *event)
>        event->destroy = hw_perf_event_destroy;
> 
>        if (!atomic_inc_not_zero(&active_events)) {
> -               if (atomic_read(&active_events) > armpmu->num_events) {
> -                       atomic_dec(&active_events);
> -                       return -ENOSPC;
> -               }
> -
>                mutex_lock(&pmu_reserve_mutex);
>                if (atomic_read(&active_events) == 0) {
>                        err = armpmu_reserve_hardware();
> --

While its true that we check if we can schedule an event in add(), I think
we should check it here, because, if we don't have space on the PMU,
returning -ENOSPC will not "allocate" a perf_event instance.

Whereas, if we skip this check in the init function, the perf_event
instance will be allocated but may or may not be "armed" on the pmu in
add(). This is fine, except that perf-core code "rotates" the linked list
of active events. When an event is allocated but can't be "armed" on the
PMU, the perf-core normalizes its output. This shows up as (scaled by)
against the output of the event in perf-stat.

In practice, I've seen that when this happens, the output of the event has
anywhere between 50-80% std deviation. Many users seem to not like this.
So I think we should enforce support for only as many events as there are
counters in the init function, and return -ENOSPC otherwise.

If the event allocation fails, it'll show up as <not-counted>, which I
think is better than using an output with very high std dev.

How about something like this:-

if (!atomic_inc_not_zero(&active_events)) {
	mutex_lock
	err = armpmu_reserve_hardware();
	mutex_unlock
	if (!err)
		atomic_inc(&active_events);
	..
} else
	if(atomic_read(&active_events) > armpmu->num_events)
		atomic_dec(&active_events)
		return -ENOSPC;
}



Cheers,
Ashwin


-- 
Sent by an employee of the Qualcomm Innovation Center, Inc.
The Qualcomm Innovation Center, Inc. is a member of the Code Aurora Forum.



More information about the linux-arm-kernel mailing list