[BUG] zynq | CCF | SRCU

Sören Brinkmann soren.brinkmann at xilinx.com
Fri May 31 17:10:05 EDT 2013


Hi Mike,

On Fri, May 31, 2013 at 12:52:35PM -0700, Mike Turquette wrote:
> Quoting Sören Brinkmann (2013-05-31 12:12:07)
> > Hi,
> > 
> > we recently encountered some kernel panics when we compiled one of our
> > drivers as module and tested inserting/removing the module.
> > Trying to debug this issue, I could reproduce it on the mainline kernel
> > with a dummy module.
> > 
> > What happens is, that when on driver remove clk_notifier_unregister() is
> > called and no other notifier for that clock is registered, the kernel
> > panics.
> > I'm not sure what is going wrong here. If there is a bug (and if where)
> > or I'm just using the infrastructure the wrong way,... So, any hint is
> > appreciated.
> > 
> > I attach the output from the crashing system. The stacktrace indicates a
> > crash in 'srcu_readers_seq_idx()'.
> > I also attach the module I used to trigger the issue and a patch on top
> > of mainline commit a93cb29acaa8f75618c3f202d1cf43c231984644 which has
> > the DT modifications I need to make the module find its clock and boot
> > with my initramfs.
> > 
> 
> Soren,
> 
> I only took a quick look at this so the following is a shot in the dark.
> notifier_block->next should be protected by an RCU lock, and the way you
> open-code the initialization struck me as a bit weird.  Can you change
> your code to the following and let me know if it makes any difference?
> 
> static struct notifier_block nb = {
>         .notifier_call = clk_notif_dbg_cb;
> };
> 
> static int clk_notif_dbg_cb(struct notifier_block *nb,
>                 unsigned long event, void *data)
> {
>         pr_info("clk_notif_dbg_cb\n");
> 
>         return NOTIFY_OK;
> }
> 
> static int clk_notif_dbg_probe(struct platform_device *pdev)
> {
>         ...
>         if (clk_notifier_register(clk, &nb))
>                 dev_warn(&pdev->dev, "clk_notifier_register failed\n");
>         ...
> 
> 
> That is a small difference, but that style of initializing the
> notifier_block has always worked for me when using clk rate-change
> notifiers.  However I'm sure the bug you mention is far more evil and
> nefarious than that ;-)
I changed the module like this (I hope this is what you suggested):
	8,9d7
	< static struct notifier_block nb;
	< 
	17a16,19
	> static struct notifier_block nb = {
	>       .notifier_call = clk_notif_dbg_cb
	> };
	> 
	20,22d21
	<       nb.next = NULL;
	<       nb.notifier_call = clk_notif_dbg_cb;
	<

The result is the same. It crashes in srcu_readers_seq_idx().

	Thanks,
	Sören





More information about the linux-arm-kernel mailing list