[RESEND PATCH v3] ARM: tegra124: pmu support

Kyle Huey me at kylehuey.com
Sat Jul 18 06:54:11 PDT 2015


On Fri, Jul 17, 2015 at 4:59 PM, Thierry Reding
<thierry.reding at gmail.com> wrote:
> On Mon, Jul 13, 2015 at 10:35:45AM -0700, Kyle Huey wrote:
>> This patch modifies the device tree for tegra124 based devices to enable
>> the Cortex A15 PMU.  The interrupt numbers are taken from NVIDIA TRM
>> DP-06905-001_v03p.  This patch was tested on a Jetson TK1.
>>
>> Updated for proper ordering and to add interrupt-affinity values.
>>
>> Signed-off-by: Kyle Huey <khuey at kylehuey.com>
>> ---
>>  arch/arm/boot/dts/tegra124.dtsi | 17 +++++++++++++----
>>  1 file changed, 13 insertions(+), 4 deletions(-)
>
> Is there any way to test this? What are the effects of adding this?

Yes.  This enables the ARM PMU driver for the Cortex A15, which allows
one to use hardware performance counters via the perf_event_open API.
For a simple test program, see
https://github.com/khuey/perf-counter-test/.  Without this patch, the
perf_event_open syscall will fail.  With this patch, the program will
print out the performance counter value for each iteration of the
loop. (IIRC on the A15 the branch counter was removed, so you may want
to replace 0xD with 0x8 which counts instructions executed if you want
to see a non-zero number there).  You also will see a message about
the PMU in the kernel log at startup after applying this patch.

I have also tested this extensively (including the interrupt features
of the PMU) on a more complex program.

> Does it enable using perf for profiling?

I have not tested it, but I believe you can use perf without this
patch if you do not use features that require hardware performance
counter support.  This patch would enable those features.

>> diff --git a/arch/arm/boot/dts/tegra124.dtsi b/arch/arm/boot/dts/tegra124.dtsi
>> index 13cc7ca..de07d7e 100644
>> --- a/arch/arm/boot/dts/tegra124.dtsi
>> +++ b/arch/arm/boot/dts/tegra124.dtsi
>> @@ -918,31 +918,40 @@
>>               #address-cells = <1>;
>>               #size-cells = <0>;
>>
>> -             cpu at 0 {
>> +             A15_0: cpu at 0 {
>>                       device_type = "cpu";
>>                       compatible = "arm,cortex-a15";
>>                       reg = <0>;
>>               };
>>
>> -             cpu at 1 {
>> +             A15_1: cpu at 1 {
>>                       device_type = "cpu";
>>                       compatible = "arm,cortex-a15";
>>                       reg = <1>;
>>               };
>>
>> -             cpu at 2 {
>> +             A15_2: cpu at 2 {
>>                       device_type = "cpu";
>>                       compatible = "arm,cortex-a15";
>>                       reg = <2>;
>>               };
>>
>> -             cpu at 3 {
>> +             A15_3: cpu at 3 {
>>                       device_type = "cpu";
>>                       compatible = "arm,cortex-a15";
>>                       reg = <3>;
>>               };
>>       };
>>
>> +     pmu {
>> +             compatible = "arm,cortex-a15-pmu";
>> +             interrupts = <GIC_SPI 144 IRQ_TYPE_LEVEL_HIGH>,
>> +                          <GIC_SPI 145 IRQ_TYPE_LEVEL_HIGH>,
>> +                          <GIC_SPI 146 IRQ_TYPE_LEVEL_HIGH>,
>> +                          <GIC_SPI 147 IRQ_TYPE_LEVEL_HIGH>;
>> +             interrupt-affinity = <&A15_0>, <&A15_1>, <&A15_2>, <&A15_3>;
>
> These labels look somewhat artificial to me, perhaps we could do
> something like the following instead?
>
>         interrupt-affinity = <&{/cpus/cpu at 0}>, ...;
>
> That's slightly more obvious and avoids the need to "invent" labels for
> the CPUs.
>
> No need to respin, I can fix that up when applying if nobody objects to
> using the alternative notation.
>
> Thierry

I have no objections.  I was not aware that the device tree syntax
supported that.  FWIW I cargo-culted my way to victory from
vexpress-v2p-ca9.dts here.

- Kyle



More information about the linux-arm-kernel mailing list