[PATCH V2] notifier/panic: Introduce panic_notifier_filter

Andy Shevchenko andriy.shevchenko at linux.intel.com
Fri Jan 7 04:08:22 PST 2022


On Thu, Jan 06, 2022 at 05:00:07PM -0300, Guilherme G. Piccoli wrote:
> The kernel notifier infrastructure allows function callbacks to be
> added in multiple lists, which are then called in the proper time,
> like in a reboot or panic event. The panic_notifier_list specifically
> contains the callbacks that are executed during a panic event. As any
> other notifier list, the panic one has no filtering and all functions
> previously registered are executed.
> 
> The kdump infrastructure, on the other hand, enables users to set
> a crash kernel that is kexec'ed in a panic event, and vmcore/logs
> are collected in such crash kernel. When kdump is set, by default
> the panic notifiers are ignored - the kexec jumps to the crash kernel
> before the list is checked and callbacks executed.
> 
> There are some cases though in which kdump users might want to
> allow panic notifier callbacks to execute _before_ the kexec to
> the crash kernel, for a variety of reasons - for example, users
> may think kexec is very prone to fail and want to give a chance
> to kmsg dumpers to run (and save logs using pstore), or maybe
> some panic notifier is required to properly quiesce some hardware
> that must be used to the crash kernel. For these cases, we have
> the kernel parameter "crash_kexec_post_notifiers".
> 
> But there's a problem: currently it's an "all-or-nothing" situation,
> the kdump user choice is either to execute all panic notifiers or
> none of them. Given that panic notifiers may increase the risk of a
> kdump failure, this is a tough decision and may affect the debug of
> hard to reproduce bugs, if for some reason the user choice is to
> enable panic notifiers, but kdump then fails.
> 
> So, this patch aims to ease this decision: we hereby introduce a filter
> for the panic notifier list, in which users may select specifically
> which callbacks they wish to run, allowing a safer kdump. The allowlist
> should be provided using the parameter "panic_notifier_filter=a,b,..."
> where a, b are valid callback names. Invalid symbols are discarded.
> 
> Currently up to 16 symbols may be passed in this list, we consider
> that this numbers allows enough flexibility (and no matter what
> architecture is used, at most 30 panic callbacks are registered).
> In an experiment using a qemu x86 virtual machine, by default only
> six callbacks are registered in the panic notifier list.
> Once a valid callback name is provided in the list, such function
> is allowed to be registered/unregistered in the panic_notifier_list;
> all other panic callbacks are ignored. Notice that this filter is
> only for the panic notifiers and has no effect in the other notifiers.

...

> +static int __init panic_notifier_filter_setup(char *buf)
> +{
> +	char *func;
> +	unsigned long addr;
> +
> +	if (!buf)
> +		return -EINVAL;
> +
> +	while (buf) {
> +		func = strsep(&buf, ",");

Don't we have a parser of this format already?
Anyway, you may reduce code by

	unsigned long addr;
	char *func = buf;

	while ((func = strsep(&func, ",")) {

> +		addr = kallsyms_lookup_name(func);
> +
> +		if (!addr) {
> +			pr_info("panic_notifier_filter: invalid symbol %s\n", func);
> +			continue;
> +		}
> +
> +		if (panic_nf_count < PANIC_NF_MAX) {
> +			panic_nf_functions[panic_nf_count] = addr;
> +			panic_nf_count++;
> +			pr_debug("panic_notifier_filter: added symbol %s\n", func);
> +		} else {
> +			pr_warn("panic_notifier_filter: exceeded maximum notifiers (%d), aborting\n",
> +				PANIC_NF_MAX);
> +			panic_nf_count = 0;
> +			break;
> +		}
> +	}
> +
> +	return 0;
> +}
> +early_param("panic_notifier_filter", panic_notifier_filter_setup);

-- 
With Best Regards,
Andy Shevchenko





More information about the kexec mailing list