[PATCH v6 4/8] crash: add generic infrastructure for crash hotplug support
Eric DeVolder
eric.devolder at oracle.com
Wed Apr 13 05:37:16 PDT 2022
On 4/12/22 21:41, Baoquan He wrote:
> On 04/11/22 at 08:54am, Eric DeVolder wrote:
>>
>>
>> On 4/11/22 04:20, Baoquan He wrote:
>>> Hi Eric,
>>>
>>> On 04/01/22 at 02:30pm, Eric DeVolder wrote:
>>> ... ...
>>>
>>>> +static void crash_hotplug_handler(unsigned int hp_action,
>>>> + unsigned long a, unsigned long b)
>>>
>>> I am still struggling to consider if these unused parameters should be
>>> kept or removed. Do you foresee or feel on which ARCH they could be used?
>>>
>>> Considering our elfcorehdr updating method, once memory or cpu changed,
>>> we will update elfcorehdr and cpu notes to reflect all existing memory
>>> regions and cpu in the current system. We could end up with having them
>>> but never being used. Then we may finally need to clean them up.
>>>
>>> If you have investigated and foresee or feel they could be used on a
>>> certain architecture, we can keep them for the time being.
>>
>> So 'hp_action' and 'a' are used within the existing patch series.
>> In crash_core.c, there is this bit of code:
>>
>> + kexec_crash_image->offlinecpu =
>> + (hp_action == KEXEC_CRASH_HP_REMOVE_CPU) ?
>> + (unsigned int)a : ~0U;
>>
>> which is referencing both 'hp_action' and using 'a' from the cpu notifier handler.
>> I looked into removing 'a' and setting offlinecpu directly, but I thought
>> it better that offlinecpu be set within the safety of the kexec_mutex.
>> Also, Sourabh Jain's work with PowerPC utilizing this framework directly
>> references hp_action in the arch-specific handler.
>>
>> The cpu and memory notifier handlers set hp_action accordingly. For cpu handler,
>> the 'a' is set with the impacted cpu. For memory handler, 'a' and 'b' form the
>> impacted memory range. I agree it looks like the memory range is currently
>> not useful.
>
> OK, memory handler doesn't need the action, memory regions. While cpu
> handler needs it to exclude the hot plugged cpu.
>
> We could have two ways to acheive this as below. How do you think about
> them?
>
> static void crash_hotplug_handler(unsigned int hp_action,
> unsigned long cpu)
>
> static int crash_memhp_notifier(struct notifier_block *nb,
> unsigned long val, void *v)
> {
> ......
> switch (val) {
> case MEM_ONLINE:
> crash_hotplug_handler(KEXEC_CRASH_HP_ADD_MEMORY,
> -1UL);
> break;
>
> case MEM_OFFLINE:
> crash_hotplug_handler(KEXEC_CRASH_HP_REMOVE_MEMORY,
> -1UL);
> break;
> }
> return NOTIFY_OK;
> }
>
> static int crash_cpuhp_online(unsigned int cpu)
> {
> crash_hotplug_handler(KEXEC_CRASH_HP_ADD_CPU, cpu);
> return 0;
> }
>
> static int crash_cpuhp_offline(unsigned int cpu)
> {
> crash_hotplug_handler(KEXEC_CRASH_HP_REMOVE_CPU, cpu);
> return 0;
> }
I'm OK with the above. Shall I post v7 or are you still looking at patches 7 and 8?
Thanks!
Eric
>
> OR,
>
> static void crash_hotplug_handler(unsigned int hp_action,
> int* cpu)
>
> static int crash_cpuhp_online(unsigned int cpu)
> {
> crash_hotplug_handler(KEXEC_CRASH_HP_ADD_CPU, NULL);
> return 0;
> }
>
> static int crash_cpuhp_offline(unsigned int cpu)
> {
> int dead_cpu = cpu;
> crash_hotplug_handler(KEXEC_CRASH_HP_REMOVE_CPU, &cpu);
> return 0;
> }
>
More information about the kexec
mailing list