[bug report] WARNING: CPU: 0 PID: 1 at kernel/resource.c:834 __insert_resource+0x84/0x110

Changhui Zhong czhong at redhat.com
Wed Mar 20 02:06:46 PDT 2024


On Tue, Mar 19, 2024 at 10:43 PM Baoquan He <bhe at redhat.com> wrote:
>
> On 03/19/24 at 01:59pm, Ilpo Järvinen wrote:
> > On Tue, 19 Mar 2024, Changhui Zhong wrote:
> >
> > > Hello,
> > >
> > > found a kernel warning issue at "kernel/resource.c:834
> > > __insert_resource+0x84/0x110" ,please help check,
> > >
> > > repo:https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git
> > > branch: master
> > > commit HEAD:f6cef5f8c37f58a3bc95b3754c3ae98e086631ca
> > >
> > >  [    0.130164] ------------[ cut here ]------------
> > > [    0.130370] WARNING: CPU: 0 PID: 1 at kernel/resource.c:834
> > > __insert_resource+0x84/0x110
> > > [    0.131364] Modules linked in:
> > > [    0.132364] CPU: 0 PID: 1 Comm: swapper/0 Not tainted 6.8.0+ #1
> > > [    0.133365] Hardware name: Dell Inc. PowerEdge R640/06DKY5, BIOS
> > > 2.15.1 06/15/2022
> > > [    0.134364] RIP: 0010:__insert_resource+0x84/0x110
> > > [    0.135364] Code: d0 4c 39 c1 76 b1 c3 cc cc cc cc 4c 8d 4a 30 48
> > > 8b 52 30 48 85 d2 75 b7 48 89 56 30 49 89 31 48 89 46 28 31 c0 c3 cc
> > > cc cc cc <0f> 0b 48 89 d0 c3 cc cc cc cc 49 89 d2 eb 1a 4d 39 42 08 77
> > > 19 4d
> > > [    0.136363] RSP: 0000:ffffb257400dfe08 EFLAGS: 00010246
> > > [    0.137363] RAX: ffff9e147ffca640 RBX: 0000000000000000 RCX: 0000000026000000
> > > [    0.138363] RDX: ffffffff86c45ee0 RSI: ffffffff86c45ee0 RDI: 0000000026000000
> > > [    0.139363] RBP: ffffffff8684d120 R08: 0000000035ffffff R09: 0000000035ffffff
> > > [    0.140363] R10: 000000002f31646f R11: 0000000059a7ffee R12: ffffffff86c45ee0
> > > [    0.141363] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
> > > [    0.142363] FS:  0000000000000000(0000) GS:ffff9e1277800000(0000)
> > > knlGS:0000000000000000
> > > [    0.143363] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> > > [    0.144363] CR2: ffff9e1333601000 CR3: 0000000332220001 CR4: 00000000007706f0
> > > [    0.145363] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
> > > [    0.146363] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
> > > [    0.147363] PKRU: 55555554
> > > [    0.148363] Call Trace:
> > > [    0.149364]  <TASK>
> > > [    0.150365]  ? __warn+0x7f/0x130
> > > [    0.151363]  ? __insert_resource+0x84/0x110
> > > [    0.152364]  ? report_bug+0x18a/0x1a0
> > > [    0.153364]  ? handle_bug+0x3c/0x70
> > > [    0.154363]  ? exc_invalid_op+0x14/0x70
> > > [    0.155363]  ? asm_exc_invalid_op+0x16/0x20
> > > [    0.156364]  ? __insert_resource+0x84/0x110
> > > [    0.157364]  ? add_device_randomness+0x75/0xa0
> > > [    0.158363]  insert_resource+0x26/0x50
> > > [    0.159364]  ? __pfx_insert_crashkernel_resources+0x10/0x10
> > > [    0.160363]  insert_crashkernel_resources+0x62/0x70
> >
> > Hi,
> >
> > This seems related to crashkernel stuff, I added a few Ccs related to
> > it.
> >
> > I don't know why you sent this only to linux-pci list as it seems likely
> > to be entirely unrelated to PCI.
>
> Too few info is provided. I guess this is happening on x86_64. Do you
> have the kernel config, and what kernel you are testing? What operation
> are you taking to trigger this?

yes,my server is x86_64 platform,base OS is RHEL9.5,the default kernel
is  5.14.0-428.el9.x86_64,
and this issue is triggered after compile and installed the upstream
kernel(6.8.0+)then reboot the machine.
I don't have the kernel config file now, if needed I can reinstall and
collect it,

>
> Below commit could be suspect, but not sure if it's the real criminal.
>
> commit 4a693ce65b186fddc1a73621bd6f941e6e3eca21
> Author: Huacai Chen <chenhuacai at kernel.org>
> Date:   Fri Dec 29 16:02:13 2023 +0800
>
>     kdump: defer the insertion of crashkernel resources
>
> Dave reported a similar one, he did kexec reboot firstly, then in 2nd
> kernel crashkernel reservation will trigger the iomem inserting error.
>
> [PATCH] x86/kexec: do not update E820 kexec table for setup_data
> https://lore.kernel.org/all/ZeZ2Kos-OOZNSrmO@darkstar.users.ipa.redhat.com/T/#u
>
> Can you try Dave's patch firstly? If it doesn't work, try reverting
> above Huacai's patch? it may need manual editing.
>

I don't know how to revert Huacai's patch, I don't know much about it,
so,I try to apply Dave‘s patch,but it failed:
```
patching file arch/x86/kernel/e820.c
Hunk #1 FAILED at 1015.
Hunk #2 succeeded at 1038 (offset 2 lines).
Hunk #3 succeeded at 1048 (offset 2 lines).
1 out of 3 hunks FAILED -- saving rejects to file arch/x86/kernel/e820.c.rej
```
maybe Dave need to write a new version to fix it,




More information about the kexec mailing list