[RFC] Handling kernel stack overflows
Adrian Bunk
bunk at stusta.de
Fri Aug 3 09:30:54 EDT 2007
On Fri, Aug 03, 2007 at 02:05:53PM +1000, Keith Owens wrote:
>...
> Long answer:
>
> * Define a config option to control whether or not extra kernel stacks
> are to be used. Set this config option by default on i386 and
> x86_64, unless EMBEDDED is set, in which case it becomes a user
> selectable option. It can never be set on IA64, the 'struct task'
> embedded in the stack prevents that. Decide about other
> architectures as required.
>
> * Create a tunable number of extra kernel stacks on each cpu as it
> boots. They are created on each cpu to avoid taking all the memory
> from any one node.
None of this should be in any way user selectable.
We are currently having problems although distributions ship with
4k stacks. Any option offering less than the default distributions
ship with, in the worst case hidden under EMBEEDED, could be called
CONFIG_NONWORKING_KERNEL since it will be an untested configuration that
will no longer be working after some time.
>...
> * If the usage threshold has been reached, do down_trylock() on the
> counting semaphore. If all of the extra stacks are in use then
> down_trylock() will fail, log a rate limited error and return -EIO.
> The caller will get an I/O error, but that is far better than
> overflowing the kernel stack and crashing the entire machine.
>
> * At each critical point, if the config option is true we check how
> much of the current stack is in use. If that figure is less than a
> threshold value then continue with normal processing on the current
> stack, no change. Checking the stack usage requires an arch specific
> routine.
>...
Even with the current stack usage in the kernel the threshold value must
be at a value that you can't do this with only 4 kB of initial stack
since you'd have to use extra stack when you have 3 kB left.
Why?
Consider that the highest stack usage of a single function of a network
device driver (sic) is currently over 2 kB. [1]
And since relaxed stack usage rules will result in driver authors
becoming more sloppy, only 3 kB of free stack won't always be enough...
cu
Adrian
[1] http://lkml.org/lkml/2006/12/13/87
--
"Is there not promise of rain?" Ling Tan asked suddenly out
of the darkness. There had been need of rain for many days.
"Only a promise," Lao Er said.
Pearl S. Buck - Dragon Seed
More information about the kexec
mailing list