[bug] aarch64 host no longer boots after 767507654c22 ("arch_numa: switch over to numa_memblks")
Zi Yan
ziy at nvidia.com
Tue Oct 29 08:07:21 PDT 2024
+tegra mailing list and maintainers
On 29 Oct 2024, at 8:47, Jan Stancek wrote:
> Hi,
>
> I'm seeing a regression on Nvidia IGX system, which no longer boots.
>
> bisect points at commit 767507654c22 ("arch_numa: switch over to numa_memblks").
> It hangs very early, with 4k or 64k pages, with no kernel messages printed:
>
> EFI stub: Booting Linux Kernel...
> EFI stub: Using DTB from configuration table
> EFI stub: Exiting boot services...
> <hangs here>
>
Is it possible to have earlycon output? It is hard to debug without any
information except kernel fails to boot.
Since the previous commit boots and I assume both kernels are compiled
with the same gcc toolchain, this should not be caused by the binuils
bug in 2.42[1]. Is your binutils version 2.42?
Thanks.
[1] https://sourceware.org/bugzilla/show_bug.cgi?id=31924
> Here's a log from successful boot with previous commit:
> https://people.redhat.com/jstancek/aarch64_numa_boot/console-log-good.txt
> and config: https://people.redhat.com/jstancek/aarch64_numa_boot/config
>
> # lscpu
> Architecture: aarch64
> CPU op-mode(s): 32-bit, 64-bit
> Byte Order: Little Endian
> CPU(s): 12
> On-line CPU(s) list: 0-11
> Vendor ID: ARM
> BIOS Vendor ID: NVIDIA
> Model name: Cortex-A78AE
> BIOS Model name: Not Specified Not Specified CPU @ 0.0GHz
> BIOS CPU family: 257
> Model: 1
> Thread(s) per core: 1
> Core(s) per cluster: 12
> Socket(s): 1
> Cluster(s): 1
> Stepping: r0p1
> CPU(s) scaling MHz: 100%
> CPU max MHz: 1971.2000
> CPU min MHz: 115.2000
> BogoMIPS: 62.50
> Flags: fp asimd evtstrm aes pmull sha1 sha2 crc32
> atomics fphp asimdhp cpuid asimdrdm lrcpc dcpop asimddp uscat ilrcpc
> flagm paca pacg
> Caches (sum of all):
> L1d: 768 KiB (12 instances)
> L1i: 768 KiB (12 instances)
> L2: 3 MiB (12 instances)
> L3: 6 MiB (3 instances)
> NUMA:
> NUMA node(s): 1
> NUMA node0 CPU(s): 0-11
> Vulnerabilities:
> Gather data sampling: Not affected
> Itlb multihit: Not affected
> L1tf: Not affected
> Mds: Not affected
> Meltdown: Not affected
> Mmio stale data: Not affected
> Reg file data sampling: Not affected
> Retbleed: Not affected
> Spec rstack overflow: Not affected
> Spec store bypass: Mitigation; Speculative Store Bypass
> disabled via prctl
> Spectre v1: Mitigation; __user pointer sanitization
> Spectre v2: Mitigation; CSV2, BHB
> Srbds: Not affected
> Tsx async abort: Not affected
>
> Regards,
> Jan
Best Regards,
Yan, Zi
More information about the linux-arm-kernel
mailing list