next/master boot: 273 boots: 63 failed, 209 passed with 1 untried/unknown (next-20171106)

Jon Hunter jonathanh at nvidia.com
Fri Nov 10 01:18:55 PST 2017


On 09/11/17 22:57, Ben Skeggs wrote:
> On 11/10/2017 08:54 AM, Jon Hunter wrote:
>>
>> On 09/11/17 21:45, Jon Hunter wrote:
>>>
>>> On 09/11/17 19:03, Guillaume Tucker wrote:
>>> ...
>>>
>>>> Alright, so here's all the results I got all based on
>>>> next-20171109 and running on tegra124-nyan-big:
>>>>
>>>>   * plain multi_v7_defconfig, passes:
>>>>     https://lava.collabora.co.uk/scheduler/job/981295
>>>>
>>>>   * CONFIG_MODULES disabled, fails:
>>>>     https://lava.collabora.co.uk/scheduler/job/981342
>>>>
>>>>   * CONFIG_MODULES and CONFIG_DRM_NOUVEAU disabled, also fails:
>>>>     https://lava.collabora.co.uk/scheduler/job/981343
>>>
>>> This is the crash in the EC driver that I mentioned before. You need to
>>> add the fix for the EC driver to avoid this BUG_ON.
>>>
>>> I was able to bisect this manually dancing around the various bugs and
>>> it points to this commit ...
>>>
>>> commit 7313cfa4f6e30384fa04083698d1e865cf812a6a
>>> Author: Ben Skeggs <bskeggs at redhat.com>
>>> Date:   Wed Nov 1 03:56:19 2017 +1000
>>>
>>>     drm/nouveau/bar: move bar1 initialisation into its own function
>>>
>>>
>>> Unfortunately, I cannot revert cleanly on top of next-20171109 and so I
>>> cannot confirm.
>>>
>>> Ben, we are seeing a hang on Tegra when booting with CONFIG_DRM_NOUVEAU
>>> enabled. Apart from the above bisect result, I don't have much else to
>>> go on at the moment. Let me know if you have any thoughts or anything to
>>> test.
>>
>> Here is part of the crash dump I see ...
> 
> Hey,
> 
> Oops, I went to great care to make that series bisectable, but
> apparently this slipped through the cracks.
> 
> 48fe02478a0ddb89930f3595f8217fa2dfd98796 should fix that crash.

Thanks Ben. However, looking at next-20171109 this one is already in.
So maybe the bisect is still not getting me to the current issue. When
booting next-20171109 the last thing I see is ...

[    2.228178] nouveau 57000000.gpu: NVIDIA GK20A (0ea000a1)
[    2.233634] nouveau 57000000.gpu: imem: using IOMMU
[    2.238572] nouveau 57000000.gpu: Direct firmware load for nvidia/gk20a/fecs_inst.bin failed with error -2
[    2.248295] nouveau 57000000.gpu: Direct firmware load for nouveau/nvea_fuc409c failed with error -2
[    2.257479] nouveau 57000000.gpu: Direct firmware load for nouveau/fuc409c failed with error -2
[    2.266189] nouveau 57000000.gpu: gr: failed to load fuc409c

So no crash. I did see the crash after the bisect, but not in top of
tree. It appears to hang after the nouveau probe fails. Any thoughts
on how to debug further?

Cheers
Jon

-- 
nvpublic



More information about the linux-arm-kernel mailing list