Failure to crashdump under Centos-6.4 x86_64 with kernel 3.12.30
Dave Anderson
anderson at redhat.com
Thu Jan 22 12:50:24 PST 2015
----- Original Message -----
> My kernel crashdumps don't work with vanilla 3.12.30 on a 256GB x86_64 intel box.
> Centos-6.4. With the standard makedumpfile, makedumpfile segfaults, and no output is written to disk.
> with makedumpfile 1.5.7, kexec-tools-2.0.8, and crash 7.0.2 I actually get vmcore files,
> but then crash rejects them with:
>
> crash vmcore /boot/vmlinux-3.12.30
>
> crash 7.0.2
> ...
> GNU gdb (GDB) 7.6
> ...
>
> crash: seek error: kernel virtual address: ffff88207fffcb60 type: "pglist node_id"
With respect to the crash error, since it's a seek error, it means that the page for
physical address 0x207fffcb60 (~130GB) is not included in the dumpfile. Are you getting any
other error messages prior to that one? Getting some debug output might show something
else of interest, for example by entering:
$ crash -d4 vmcore /boot/vmlinux-3.12.30
Also, for a sanity check, what happens when you do this:
$ crash --minimal vmcore /boot/vmlinux-3.12.30
...
crash> rd -a linux_banner
It should show something like this (taken from the only 3.12 dumpfile I have):
crash> rd -a linux_banner
ffffffff818000a0: Linux version 3.12.0-rc2+ (root at vm9-f19) (gcc version 4.8.1
ffffffff818000dc: 20130603 (Red Hat 4.8.1-1) (GCC) ) #2 SMP Wed Nov 6 13:25:13
ffffffff81800118: EST 2013
crash>
Dave
>
> the grub entry for this kernel is:
> kernel /boot/vmlinuz-3.12.30 ro root=LABEL=ROOT
> numa_balancing=disable intel_idle.max_cstate=0 processor.max_cstate=0
> mce=ignore_mce clocksource=tsc nohz=off nomodeset nodmraid rd_NO_LUKS
> rd_NO_LVM rd_NO_MD rd_NO_DM LANG=en_US.UTF-8 SYSFONT=latarcyrheb-sun16
> KEYBOARDTYPE=pc KEYTABLE=us crashkernel=256M at 256M console=ttyS1,115200n8
>
>
> CONFIG_CRASH_DUMP=y
> CONFIG_KEXEC_JUMP=y
> CONFIG_PHYSICAL_START=0x2000000
> CONFIG_RELOCATABLE=y
> CONFIG_PHYSICAL_ALIGN=0x1000000
>
> A similar issue appears with a grub crashkernel=128M at 32M
>
> Turning off all the makedumpfile exclusions (dump all pages) results in a much larger
> vmcore, but it still gets the same failures.
>
> Where do I start looking for the problems here?
> kernel 3.10.65 won't dump either, but an old 3.0.29 kernel dumps just fine.
> Is this a problem with some .config setting?
>
> Berkley
More information about the kexec
mailing list