[RFC PATCH 4/4] mm/vmalloc.c: Treat the entire kernel virtual space as vmalloc
Dave Hansen
dave.hansen at intel.com
Thu Nov 14 12:26:18 EST 2013
On 11/11/2013 03:26 PM, Laura Abbott wrote:
> With CONFIG_ENABLE_VMALLOC_SAVINGS, all lowmem is tracked in
> vmalloc. This means that all the kernel virtual address space
> can be treated as part of the vmalloc region. Allow vm areas
> to be allocated from the full kernel address range.
>
> Signed-off-by: Laura Abbott <lauraa at codeaurora.org>
> Signed-off-by: Neeti Desai <neetid at codeaurora.org>
> ---
> mm/vmalloc.c | 11 +++++++++++
> 1 files changed, 11 insertions(+), 0 deletions(-)
>
> diff --git a/mm/vmalloc.c b/mm/vmalloc.c
> index c7b138b..181247d 100644
> --- a/mm/vmalloc.c
> +++ b/mm/vmalloc.c
> @@ -1385,16 +1385,27 @@ struct vm_struct *__get_vm_area_caller(unsigned long size, unsigned long flags,
> */
> struct vm_struct *get_vm_area(unsigned long size, unsigned long flags)
> {
> +#ifdef CONFIG_ENABLE_VMALLOC_SAVING
> + return __get_vm_area_node(size, 1, flags, PAGE_OFFSET, VMALLOC_END,
> + NUMA_NO_NODE, GFP_KERNEL,
> + __builtin_return_address(0));
> +#else
> return __get_vm_area_node(size, 1, flags, VMALLOC_START, VMALLOC_END,
> NUMA_NO_NODE, GFP_KERNEL,
> __builtin_return_address(0));
> +#endif
> }
>
> struct vm_struct *get_vm_area_caller(unsigned long size, unsigned long flags,
> const void *caller)
> {
> +#ifdef CONFIG_ENABLE_VMALLOC_SAVING
> + return __get_vm_area_node(size, 1, flags, PAGE_OFFSET, VMALLOC_END,
> + NUMA_NO_NODE, GFP_KERNEL, caller);
> +#else
> return __get_vm_area_node(size, 1, flags, VMALLOC_START, VMALLOC_END,
> NUMA_NO_NODE, GFP_KERNEL, caller);
> +#endif
> }
Couple of nits: first of all, there's no reason to copy, paste, and
#ifdef this much code. This just invites one of the copies to bitrot.
I'd much rather see this:
#ifdef CONFIG_ENABLE_VMALLOC_SAVING
#define LOWEST_VMALLOC_VADDR PAGE_OFFSET
#else
#define LOWEST_VMALLOC_VADDR VMALLOC_START
#endif
Then just replace the PAGE_OFFSET in the function arguments with
LOWEST_VMALLOC_VADDR.
Have you done any audits to make sure that the rest of the code that
deals with vmalloc addresses in the kernel is using is_vmalloc_addr()?
I'd be a bit worried that we might have picked up an assumption or two
that *all* vmalloc addresses are _above_ VMALLOC_START.
The percpu.c code looks like it might do this, and maybe the kcore code.
The vmalloc.c code itself has this in get_vmalloc_info():
> /*
> * Some archs keep another range for modules in vmalloc space
> */
> if (addr < VMALLOC_START)
> continue;
Seems like that would break as well.
With this patch, VMALLOC_START loses enough of its meaning that I wonder
if we should even keep it around. It's the start of the _dedicated_
vmalloc space, but it's mostly useless and obscure enough that maybe we
should get rid of its use in common code.
More information about the linux-arm-kernel
mailing list