mt8173-evb: failing to boot in linux-next and mainline
Eddie Huang
eddie.huang at mediatek.com
Mon Mar 28 04:51:16 PDT 2016
Hi Kevin,
On Thu, 2016-03-24 at 11:42 -0700, Kevin Hilman wrote:
> Hello,
>
> The mt8173-evb has been failing to boot in linux-next since 29 Feb,
> and is now causing failures in mainline[2] since the merge window is
> open and things are getting merged.
>
> Bisecting this has led to inconclusive results, so I'm still not sure
> what the real culprit is. The last few kernel for the failures are:
>
> [ 0.570979] NET: Registered protocol family 17
> [ 0.571615] 9pnet: Installing 9P2000 support
> [ 0.572211] Key type dns_resolver registered
> [ 0.573195] registered taskstats version 1
> [ 0.574170] hctosys: unable to open rtc device (rtc0)
> [ 0.581122] usb_vbus: disabling
>
> Thinking it was this regulator getting prematurely disabled, I set the
> 'regulator-always-on' property, which doesn't do much better, but does
> report a CPU stall:
>
> [ 0.556186] NET: Registered protocol family 17
> [ 0.556829] 9pnet: Installing 9P2000 support
> [ 0.557496] Key type dns_resolver registered
> [ 0.558669] registered taskstats version 1
> [ 0.559611] hctosys: unable to open rtc device (rtc0)
> [ 21.573504] INFO: rcu_preempt detected stalls on CPUs/tasks:
> [ 21.574227] 1-...: (1 GPs behind) idle=483/140000000000000/0
> softirq=221/222 fqs=2638
> [ 21.575227] (detected by 3, t=5252 jiffies, g=-238, c=-239, q=204)
> [ 21.576017] Task dump for CPU 1:
> [ 21.576424] swapper/0 R running task 0 1 0 0x00000002
> [ 21.577316] Call trace:
> [ 21.577634] [<ffffff8008086c20>] __switch_to+0xd0/0xdc
> [ 21.578280] [<ffffffc077096900>] 0xffffffc077096900
> [ 33.321443] random: nonblocking pool is initialized
> [ 84.593487] INFO: rcu_preempt detected stalls on CPUs/tasks:
> [ 84.594207] 1-...: (1 GPs behind) idle=483/140000000000000/0
> softirq=221/222 fqs=10861
> [ 84.595216] (detected by 3, t=21007 jiffies, g=-238, c=-239, q=204)
> [ 84.596017] Task dump for CPU 1:
> [ 84.596424] swapper/0 R running task 0 1 0 0x00000002
> [ 84.597315] Call trace:
> [ 84.597628] [<ffffff8008086c20>] __switch_to+0xd0/0xdc
> [ 84.598273] [<ffffffc077096900>] 0xffffffc077096900
>
> Anyways, I gave up on this for now hoping someone here would take a closer look.
>
> Thanks,
>
> Kevin
>
> [1] https://kernelci.org/boot/mt8173-evb/job/next/kernel/next-20160324/defconfig/defconfig/lab/lab-khilman/?_id=56f3b10759b51423934b0500
> [2] https://kernelci.org/boot/mt8173-evb/job/mainline/kernel/v4.5-12149-gaca04ce5dbda/defconfig/defconfig/lab/lab-khilman/?_id=56f3c33f59b5145aa14b04f2
>
Thanks your testing again. I can reproduce your case in my local
environment using 4.6-rc1. After checking, there are two ways to avoid
the hang:
1. set clk_ignore_unused to kernel command line
2. Revert patch "cc8ed76 soc: mediatek: SCPSYS: Fix double enabling of
regulators"
It seems there is a problem between turn on/off clock and power domain.
We will try to fix this and send patch to public once have a solution.
Thanks
Eddie
More information about the Linux-mediatek
mailing list