<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">

<html>

<head>

  <meta content="text/html;charset=us-ascii" http-equiv="Content-Type">

  <title></title>

</head>

<body bgcolor="#ffffff" text="#000000">

Vivek Goyal wrote:

<blockquote cite="mid20080418141028.GB1983@redhat.com" type="cite">

  <pre wrap="">On Thu, Apr 17, 2008 at 05:16:55PM -0700, Piet Delaney wrote:

  </pre>

  <blockquote type="cite">

    <pre wrap="">Hey Guys:

I've been using kgdb for a while with our 2.6.12 and now 2.6.16 kernel

as well as kdump/kexec with our 2.6.16 kernel. I'm a bit disappointed

with the visibility of local variables on the threads/tasks not currently

running on CPUs. Both crash, and the gdb macros that you guys wrote,

show the most important stuff but I'd prefer to be able to see everything

with gdb/ddd as I can with kgdb; including all local variables and formal

parameters at each stack frame.

A long time ago I used gdb on SunOS 4.1.4 and use to simply set $fp

and $sp from the saved information in the U-block to view a process.

I wish gdb would allow be to run your macros, btt for example, and extract

the stackp from task.thread.esp assign it temporally to $sp for the 

current task,

do the backtrace command and see everything. Changing $sp and $fp for a 

while

like I use to do with gdb on SunOS 4.1.4 and then using ddd+gdb to 

browse the

stack formals and locals would be nice. Just doing a 'set write on' 

isn't sufficient,

gdb wants a process and I can't see to satisfy it with simply setting 

the current

thread.

I was wondering if any of you guys have been thinking of anything like this

and had and hacks or ideas on how to see the locals and formals for all 

tasks.

One thought I had was a minor hack of the kexec code to do something 

like your gdb macros

and walk thru the task list and then append a ELF Notes, like done by 

crash_save_this_cpu(),

for each task. I have no idea if gdb has a limit on the number of 

elf_prstatus structures

that can be provided. I suppose I'd leave it a KEXEC config variable to 

enable this, as

some would argue that it's not as save as simply saving the regs for the 

active CPUs.

This would leave 'info threads' with gdb similar to 'ps' with crash and 

virtually identical

to the experience with kgdb.

    </pre>

  </blockquote>

  <pre wrap=""><!---->

IIUC, you are suggesting that we create elf notes even for non-active

tasks in vmcore.</pre>

</blockquote>

Yep.<br>

<br>

<blockquote cite="mid20080418141028.GB1983@redhat.com" type="cite">

  <pre wrap=""> We should not be doing that.

- It is not safe to traverse through task list after system has crashed.

  </pre>

</blockquote>

I agree it's not 100% safe, but for many developers it's a risk<br>

worth taking. For example we make a lot of changes in the<br>

TCP/IP stack for implementing a proxy for filtering network<br>

traffic. Virtually all bugs are ones we make in the networking<br>

code and having a precise view of each stack would be helpful.<br>

It's extremely unlikely that the task structures have been whacked<br>

in our case; and likely for many other developers.<br>

<br>

<blockquote cite="mid20080418141028.GB1983@redhat.com" type="cite">

  <pre wrap="">- We reserve the memory for elf notes at system boot. At that time we

  have no idea how many task system will have at the time of crash.

  </pre>

</blockquote>

How about reserving memory with each task structure for the<br>

ELF notes? With Solaris I reserved a page for each CPU to be<br>

able to store the register windows during a stack overflow so<br>

I could map it in during the stack overflow. It's not unreasonable<br>

to allocate the space with the task; especially if it's a kernel<br>

config option. If no one used it we could remove it and go for<br>

an alternate approach.<br>

<br>

<blockquote cite="mid20080418141028.GB1983@redhat.com" type="cite">

  <pre wrap="">

I think following can be a way forward for your requirement.

- Either gdb should provide a SunOS kind of facility where one can 

  provide stack pointer and switch the task context. ( I don't know

  if there is already a way to do that).

  </pre>

</blockquote>

I'll talk on the gdb mailing list about it. Perhaps we could<br>

initially get it in as a developer maintance function to allow <br>

the registers to be changed for a thread in a core dump. <br>

<br>

<blockquote cite="mid20080418141028.GB1983@redhat.com" type="cite">

  <pre wrap="">

- Or one can write a user space tool, which parses original vmcore,

  walks through task list, prepare elf notes for all the tasks and emit

  a new vmcore which is fetched to gdb.

  </pre>

</blockquote>

Sounds difficult. I was talking to Dave Anderson about having<br>

crash provide task info to his embedded gdb process but he<br>

wasn't supportive of that approach. <br>

<br>

Having a peace of code that has to walk thru the task list and<br>

modify the elf notes seems a lot harder than just having the kernel<br>

do it. More code that has to be maintained.&nbsp; <br>

<br>

Isn't there precedence with other kernels like freebsd and Solaris<br>

having the core dumps provide task information? If it was conditional<br>

we could see just how many developers find the risk of causing a problem<br>

during the crash dump a issue.<br>

<br>

Looks to me like the size of the ELF notes is quite small and wouldn't<br>

be a significant burden to add to the task structure.<br>

<br>

-piet<br>

<blockquote cite="mid20080418141028.GB1983@redhat.com" type="cite">

  <pre wrap="">

Thanks

Vivek

  </pre>

</blockquote>

<br>

</body>

</html>