[PATCH v6 1/4] nmi_backtrace: add more trigger_*_cpu_backtrace() methods

Chris Metcalf cmetcalf at mellanox.com
Mon Aug 8 08:49:57 PDT 2016


On 8/8/2016 9:57 AM, Petr Mladek wrote:
> On Thu 2016-07-14 16:50:29, Chris Metcalf wrote:
>> Currently you can only request a backtrace of either all cpus, or
>> all cpus but yourself.  It can also be helpful to request a remote
>> backtrace of a single cpu, and since we want that, the logical
>> extension is to support a cpumask as the underlying primitive.
>>
>> This change modifies the existing lib/nmi_backtrace.c code to take
>> a cpumask as its basic primitive, and modifies the linux/nmi.h code
>> to use either the old "all/all_but_self" arch methods, or the new
>> "cpumask" method, depending on which is available.
> I triggered this function using
>    echo l >/proc/sysrq-trigger
>
>
> and got
>
> [  270.791328] -----------  All but itself: ---------------------
>
> [  270.791331] ===============================
> [  270.791331] [ INFO: suspicious RCU usage. ]
> [  270.791333] 4.8.0-rc1-4-default+ #3086 Not tainted
> [  270.791333] -------------------------------
> [  270.791335] ./include/linux/rcupdate.h:556 Illegal context switch in RCU read-side critical section!

Ah hah, you tested this with CPUMASK_OFFSTACK, which I didn't.
That explains why you got RCU kmalloc warnings.

>> +	cpumask_copy(mask, cpu_online_mask);
>> +	cpumask_clear_cpu(cpu, mask);
>> +	arch_trigger_cpumask_backtrace(mask);
>> +	put_cpu();
>> +	free_cpumask_var(mask);
>> +	return true;
> Also this looks too much code for an inlined function.
> It is rather slow and there is not a big gain. I would move
> the definition to lib/nmi_backtrace.c.

After some thought, I ended up just removing both cpumask allocation
sites.  For the allbutself() case, I just re-introduced the "include_self"
boolean that the code used to have.  If it is false when we get into the inner
nmi_trigger_cpumask_backtrace(), I just clear the cpu bit of the current
cpu.  It requires passing a funny boolean around with the mask, but the
alternative (if we don't want to allocate a mask on this path) is to
break apart the nmi_trigger_cpumask_backtrace() function so we can
piggy-back on its locking and its cpumask and set up the cpumask the
way we want, which I think is too much added ugliness.

For the trigger_single_cpu_backtrace() case, I remembered that there was
a cpumask_of() function that we can use that is fast and doesn't allocate,
even with CPUMASK_OFFSTACK, so I just used that instead.

> PS: I am sorry for sending this so late in the game. I was
> curious why the patch had not been upstream yet and. I made
> a closer look to give a Reviewed-by tag...

No worries - even a late review is much better than none!  I'll
send v7 shortly and please do let me know if it works for you.

-- 
Chris Metcalf, Mellanox Technologies
http://www.mellanox.com




More information about the linux-arm-kernel mailing list