RISC-V regression on Linux 6.7-rc1

Lad, Prabhakar prabhakar.csengg at gmail.com
Mon Dec 4 12:28:55 PST 2023


On Mon, Dec 4, 2023 at 1:16 PM Emil Renner Berthing
<emil.renner.berthing at canonical.com> wrote:
>
> Lad, Prabhakar wrote:
> > Hi,
> >
> > On Mon, Nov 20, 2023 at 3:54 PM Ron Economos <re at w6rz.net> wrote:
> > >
> > > Linux 6.7-rc1 fails to boot on the HiFive Unmatched running Ubuntu
> > > 22.04. During boot, there are many random oops and kernel panics.
> > > Reverting the patch series "riscv: Add remaining module relocations and
> > > tests" (commit b51fc88cb35e49) resolves the issue.
> > >
> > > Here's an example, but it does something different on each boot.
> > >
> > > Nov 17 21:33:38 riscv64 kernel: Unable to handle kernel paging request
> > > at virtual address ffffff97e6cb45fa
> > > Nov 17 21:33:38 riscv64 kernel: Unable to handle kernel paging request
> > > at virtual address 0000005b7d944e0e
> > > Nov 17 21:33:38 riscv64 kernel: Oops [#1]
> > > Nov 17 21:33:38 riscv64 kernel: Modules linked in: sch_fq_codel
> > > auth_rpcgss nfs_acl drm(+) lockd grace backlight sunrpc efi_pstore
> > > ip_tables x_tables autofs4 btrfs blake2b_generic raid10 raid456 >
> > > Nov 17 21:33:38 riscv64 kernel: CPU: 1 PID: 391 Comm: cloud-init Not
> > > tainted 6.7.0-rc1 #2
> > > Nov 17 21:33:38 riscv64 kernel: Hardware name: SiFive HiFive Unmatched
> > > A00 (DT)
> > > Nov 17 21:33:38 riscv64 kernel: epc : refill_obj_stock+0x4e/0x160
> > > Nov 17 21:33:38 riscv64 kernel:  ra : refill_obj_stock+0x4e/0x160
> > > Nov 17 21:33:38 riscv64 kernel: epc : ffffffff802d9264 ra :
> > > ffffffff802d9264 sp : ffffffd898e2fd10
> > > Nov 17 21:33:38 riscv64 kernel:  gp : ffffffff81c4bdd8 tp :
> > > ffffffd884f9ec00 t0 : 0000000000000000
> > > Nov 17 21:33:38 riscv64 kernel:  t1 : 0000000000000000 t2 :
> > > 0000000000000000 s0 : ffffffd898e2fd60
> > > Nov 17 21:33:38 riscv64 kernel:  s1 : ffffffdbfed02f70 a0 :
> > > ffffffd89a2b1ec0 a1 : 0000000000000000
> > > Nov 17 21:33:38 riscv64 kernel:  a2 : 0000000000000000 a3 :
> > > 0000000000000000 a4 : 0000000000000000
> > > Nov 17 21:33:38 riscv64 kernel:  a5 : 0000000000000000 a6 :
> > > 0000000000000000 a7 : 0000000000000000
> > > Nov 17 21:33:38 riscv64 kernel:  s2 : 31413797e6cb45fa s3 :
> > > 0000000000000108 s4 : 0000000200000022
> > > Nov 17 21:33:38 riscv64 kernel:  s5 : ffffffff81cbb1e8 s6 :
> > > ffffffd884f9ec00 s7 : 0000003fc0325028
> > > Nov 17 21:33:38 riscv64 kernel:  s8 : 0000000000000000 s9 :
> > > 0000002b10fdc170 s10: 0000002b10fe2290
> > > Nov 17 21:33:38 riscv64 kernel:  s11: 0000000000000000 t3 :
> > > 0000000000000000 t4 : 0000000000000000
> > > Nov 17 21:33:38 riscv64 kernel:  t5 : 0000000000000000 t6 : 0000000000000000
> > > Nov 17 21:33:38 riscv64 kernel: status: 0000000200000100 badaddr:
> > > ffffff97e6cb45fa cause: 000000000000000d
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff802d9264>]
> > > refill_obj_stock+0x4e/0x160
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff802dce26>]
> > > obj_cgroup_uncharge+0x1c/0x2a
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff802b97a6>]
> > > kmem_cache_free+0x1b2/0x548
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff802f358c>] __fput+0x132/0x252
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff802f3702>] ____fput+0x18/0x22
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff80043e96>] task_work_run+0xa8/0xee
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff800bfc66>]
> > > exit_to_user_mode_loop.isra.0+0xf2/0x10e
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff80c4f2ac>]
> > > syscall_exit_to_user_mode+0x54/0x64
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff80c4eee0>]
> > > do_trap_ecall_u+0x5a/0x13a
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff80c5a716>]
> > > ret_from_exception+0x0/0x66
> > > Nov 17 21:33:38 riscv64 kernel: Code: 639c 94be 689c 8963 0aa7 8526 f097
> > > ffff 80e7 26a0 (3783) 0009
> > > Nov 17 21:33:38 riscv64 kernel: ---[ end trace 0000000000000000 ]---
> > > Nov 17 21:33:38 riscv64 kernel: Oops [#2]
> > > Nov 17 21:33:38 riscv64 kernel: note: cloud-init[391] exited with irqs
> > > disabled
> > > Nov 17 21:33:38 riscv64 kernel: Modules linked in: sch_fq_codel
> > > auth_rpcgss nfs_acl drm(+) lockd grace backlight sunrpc efi_pstore
> > > ip_tables x_tables autofs4 btrfs blake2b_generic raid10 raid456 >
> > > Nov 17 21:33:38 riscv64 kernel: CPU: 2 PID: 376 Comm: modprobe Tainted:
> > > G      D            6.7.0-rc1 #2
> > > Nov 17 21:33:38 riscv64 kernel: Hardware name: SiFive HiFive Unmatched
> > > A00 (DT)
> > > Nov 17 21:33:38 riscv64 kernel: epc : __kmem_cache_alloc_node+0x286/0x2fa
> > > Nov 17 21:33:38 riscv64 kernel:  ra : __kmem_cache_alloc_node+0x5a/0x2fa
> > > Nov 17 21:33:38 riscv64 kernel: epc : ffffffff802bacb8 ra :
> > > ffffffff802baa8c sp : ffffffd889f07940
> > > Nov 17 21:33:38 riscv64 kernel:  gp : ffffffff81c4bdd8 tp :
> > > ffffffd898d84380 t0 : ffffffd88e1c3ae0
> > > Nov 17 21:33:38 riscv64 kernel:  t1 : 0000940000000000 t2 :
> > > 0000000000000000 s0 : ffffffd889f079a0
> > > Nov 17 21:33:38 riscv64 kernel:  s1 : ffffffd880001700 a0 :
> > > 26856bdb7d944dce a1 : 0000000000000717
> > > Nov 17 21:33:38 riscv64 kernel:  a2 : 0000000000008500 a3 :
> > > ffffffff81cbb1e8 a4 : 26856bdb7d944e0e
> > > Nov 17 21:33:38 riscv64 kernel:  a5 : 0e4e947ddb6b0026 a6 :
> > > 000000000000ff00 a7 : 0000000000000718
> > > Nov 17 21:33:38 riscv64 kernel:  s2 : 0000000000000000 s3 :
> > > 0000000000000dc0 s4 : 0000000000000080
> > > Nov 17 21:33:38 riscv64 kernel:  s5 : ffffffffffffffff s6 :
> > > ffffffff80466e7a s7 : ffffffff81c4c454
> > > Nov 17 21:33:38 riscv64 kernel:  s8 : 0000000000000dc0 s9 :
> > > ffffffff024528c8 s10: ffffffff81b1b7d0
> > > Nov 17 21:33:38 riscv64 kernel:  s11: ffffffff81b1b880 t3 :
> > > 0000000000000000 t4 : 0000000000000000
> > > Nov 17 21:33:38 riscv64 kernel:  t5 : 0000000000000000 t6 : ffffffd880d57554
> > > Nov 17 21:33:38 riscv64 kernel: status: 0000000200000120 badaddr:
> > > 0000005b7d944e0e cause: 000000000000000d
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff802bacb8>]
> > > __kmem_cache_alloc_node+0x286/0x2fa
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff80247cac>] kmalloc_trace+0x30/0xac
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff80466e7a>]
> > > eventfs_create_dir+0x46/0x158
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff8015c968>]
> > > event_create_dir+0xac/0x2e0
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff8015dcdc>]
> > > trace_module_notify+0x1d8/0x264
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff80049e28>]
> > > notifier_call_chain+0x6c/0xe8
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff80049f2c>]
> > > blocking_notifier_call_chain_robust+0x5a/0xc2
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff800c4390>]
> > > load_module+0x16dc/0x1d1a
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff800c4bfc>]
> > > init_module_from_file+0x82/0xc4
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff800c4dda>]
> > > __riscv_sys_finit_module+0x19c/0x33a
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff80c4eed4>]
> > > do_trap_ecall_u+0x4e/0x13a
> > > Nov 17 21:33:38 riscv64 kernel: [<ffffffff80c5a716>]
> > > ret_from_exception+0x0/0x66
> > > Nov 17 21:33:38 riscv64 kernel: Code: 0813 f008 5613 0287 e7b3 0117 7633
> > > 0106 8893 0015 (6318) 8fd1
> > > Nov 17 21:33:38 riscv64 kernel: ---[ end trace 0000000000000000 ]---
> > > Nov 17 21:33:38 riscv64 systemd[1]: Finished Coldplug All udev Devices.
> > > Nov 17 21:33:38 riscv64 kernel: Unable to handle kernel paging request
> > > at virtual address 0000005b7d944e0e
> > > Nov 17 21:33:38 riscv64 kernel: Unable to handle kernel paging request
> > > at virtual address ffffff81e50b0eb6
> > > Nov 17 21:33:38 riscv64 kernel: Oops [#3]
> > >
> > Have there any patches posted to fix these issues? I'm still seeing
> > failures on RZ/Five (https://paste.debian.net/1300132/).
>
> This looks like it might be the module loading issue adressed here:
> https://lore.kernel.org/all/20231127-module_linking_freeing-v4-0-a2ca1d7027d0@rivosinc.com/
>
Thank you for the pointer. Patches do fix issues seen on RZ/Five.

Cheers,
Prabhakar



More information about the linux-riscv mailing list