kdump kernel hangs on 2-way AMD Opteron Intellistation

Ankita Garg ankita at in.ibm.com
Fri Aug 1 08:13:45 EDT 2008


Hi,

On Fri, Aug 01, 2008 at 12:16:53PM +0530, Ankita Garg wrote:
> Hi Vivek,
> 
> On Thu, Jul 31, 2008 at 09:46:52AM -0400, Vivek Goyal wrote:
> > On Wed, Jul 30, 2008 at 05:45:50PM +0530, Ankita Garg wrote:
> > > Hi,
> > > 
> > > We have been using RT kernel as the kdump kernel for most of our work.
> > > This was working fine on 2.6.21.4-rt10 RT kernel. After which, we moved
> > > over to 2.6.24-rt1 RT kernel. Now, the kdump kernel hangs. Attaching the
> > > complete kdump kernel boot log with initcall_debug option passed to the
> > > kdump kernel.
> > > 
> > Can you please look at /proc/iomem in first kernel and make sure that
> > ACPI related areas are being passed to second kernel at command line
> > and are visible in second kernel's memory map?
> >
> 
> Yes, I checked that the ACPI related areas are being being passed to the
> second kernel. Just to be sure, providing the output of /proc/iomem from
> first kernel and second kernel commandline:
> 
> > Again CCing acpi mailing list.
> > 
> 
> Am also attaching the kdump kernel boot log when the following
> commandline option was passed to it:
> 
> > 
> PCI: Scanning behind PCI bridge 0000:81:00.1, config 838381, pass 1
> PCI: Bus scan for 0000:81 returning with max=83
> PCI: Scanning behind PCI bridge 0000:80:0e.0, config 878480, pass 0
> PCI: Scanning bus 0000:84
> PCI: Found 0000:84:00.0 [10de/01b3] 000604 01
> PCI: Fixups for bus 0000:84
> PCI: Scanning behind PCI bridge 0000:84:00.0, config 878584, pass 0
> PCI: Scanning bus 0000:85
> PCI: Found 0000:85:00.0 [10de/01b3] 000604 01
> PCI: Found 0000:85:01.0 [10de/01b3] 000604 01
> PCI: Fixups for bus 0000:85
> PCI: Scanning behind PCI bridge 0000:85:00.0, config 868685, pass 0
> PCI: Scanning bus 0000:86
> PCI: Found 0000:86:00.0 [10de/029f] 000300 00
> 

Looking at the above (ie where the kdump kernel apparently hung) and the
output of lspci,

#lspci
<snip>
83:04.0 SCSI storage controller: Adaptec AIC-7902B U320 (rev 10)
83:04.1 SCSI storage controller: Adaptec AIC-7902B U320 (rev 10)
84:00.0 PCI bridge: nVidia Corporation Unknown device 01b3 (rev a3)
85:00.0 PCI bridge: nVidia Corporation Unknown device 01b3 (rev a3)
85:01.0 PCI bridge: nVidia Corporation Unknown device 01b3 (rev a3)
86:00.0 VGA compatible controller: nVidia Corporation G70 [Quadro FX
4500 X2] (rev a1)
87:00.0 3D controller: nVidia Corporation G70 [Quadro FX 4500 X2] (rev
a1)

we thought that the PCI subsystem initialization somehow did not succeed
for the last nvidia card. Tried on another 2-way Opteron Intellistation
without the above card, and found that kdump succeeded. Considering
this, any idea on what could be going wrong here? Is the driver code
even invoked at the time the pci subsystem is being initialized? 

-- 
Regards,
Ankita Garg (ankita at in.ibm.com)
Linux Technology Center
IBM India Systems & Technology Labs, 
Bangalore, India   



More information about the kexec mailing list