kdump/kexec on EFI-enabled x2apic platforms

Jack Steiner steiner at sgi.com
Mon Mar 29 16:01:04 EDT 2010


All -

I just started debugging kdump/kexec on our UV platform and
have run into some problems. I suspect others have encountered these
same or similar problems. Any help would be appreciated.



Our platform uses EFI boot. It is Nehalem based & has a large number of cpus.
The BIOS enables x2apic mode and the kernel runs with interrupt remapping enabled.
Note that some apicids have more than 8 bits - x2apic mode is required.


I am able to successfully kexec the dump kernel but run into several problems.

	- because the initial kernel boots using EFI, BIOS does not build the legacy
	  tables that are required to locate the RSDP using the legacy method in
	  acpi_find_root_pointer(). (When booting with EFI, acpi_find_root_pointer() is
	  not used. The ACPI tables are found from pointers in EFI tables.)

	- it appears that kdump/kexec intentionally boots the kdump kernel
	  in a mode does does enable efi mode. (Am I correct here???)
	  This avoids the issues with EFI virtual mode. However, the result
	  is that ACPI tables are not found. From the dump kernel:

	  	ACPI Error: A valid RSDP was not found (20090903/tbxfroot-222)

	- Because ACPI tables are not found, the dump kernel does not transition
	  into x2apic mode. The hardware, however, is still in x2apic mode from the
	  initial kernel. Later in the boot of the dump kernel, read_apic_id()
	  tried to read memory-mapped apic registers instead of the MSRs that are
	  used in x2apic mode. This is not allowed & the dump kernel panics with:

		[    0.000000]  [<ffffffff81b52195>] early_idt_handler+0x55/0x68
		[    0.000000]  [<ffffffff8101f393>] ? native_apic_mem_read+0x3/0x10
		[    0.000000]  [<ffffffff8101a3c6>] ? read_apic_id+0x16/0x30
		[    0.000000]  [<ffffffff81b5f857>] init_apic_mappings+0xe7/0x137
		[    0.000000]  [<ffffffff81b559fd>] setup_arch+0x900/0xc33
		[    0.000000]  [<ffffffff81b52bae>] start_kernel+0x6f/0x4a1



I checked an Intel Nehalem whitebox using the Intel BIOS. The dump kernel does not
find the RDSP but the initial kernel does not enable x2apic mode either (possibly
because of an old BIOS - not sure).  As a result, the dump kernel does not hit
the panic shown above. The kdump kernel successfully boots w/o having discovered
ACPI tables.


How should I proceed?
	- should I be running the dump kernel with EFI mode enabled?
	- should I be fixing the issues with x2apic mode in a non-EFI dump kernel?
	- or should BIOS be building the tables necessary to support both EFI & non-EFI boot.


--- jack


Jack Steiner (steiner at sgi.com)		SGI - Silicon Graphics, Inc.




More information about the kexec mailing list