[MAKEDUMPFILE PATCH] makedumpfile: elf_info: check for invalid physical address when finding max_paddr
HAGIO KAZUHITO(萩尾 一仁)
k-hagio-ab at nec.com
Wed Jun 16 20:28:10 PDT 2021
-----Original Message-----
>
> Kernel commit 464920104bf7adac12722035bfefb3d772eb04d8
> "/proc/kcore: update physical address for kcore ram and text" sets an
> invalid paddr (-1=0xffffffffffffffff) for PT_LOAD segments of not direct
> mapped regions,
> $ readelf -l /proc/kcore
>
> Elf file type is CORE (Core file)
> Entry point 0x0
> There are 4 program headers, starting at offset 64
>
> Program Headers:
> Type Offset VirtAddr PhysAddr
> FileSiz MemSiz Flags Align
> NOTE 0x0000000000000120 0x0000000000000000 0x0000000000000000
> 0x0000000000002320 0x0000000000000000 0x0
> LOAD 0x1000000000010000 0xd000000000000000 0xffffffffffffffff
> ^^^^^^^^^^^^^^^^^^
> 0x0001f80000000000 0x0001f80000000000 RWE 0x10000
> LOAD 0x0000000000010000 0xc000000000000000 0x0000000000000000
> 0x00000003f0000000 0x00000003f0000000 RWE 0x10000
> LOAD 0x3000000000010000 0xf000000000000000 0xffffffffffffffff
> ^^^^^^^^^^^^^^^^^^
> 0x0000000000fc0000 0x0000000000fc0000 RWE 0x10000
>
> makedumple uses max_paddr to calculate the number of sections for sparse
> memory model thus wrong number is obtained based on max_paddr=-1. This
> error could lead to the failure of copying /proc/kcore for RHEL-8.5 on
> ppc64le machine [1],
> $ uname -r
> 4.18.0-312.el8.ppc64le
> $ makedumpfile /proc/kcore vmcore1
> get_mem_section: Could not validate mem_section.
> get_mm_sparsemem: Can't get the address of mem_section.
>
> makedumpfile Failed.
IIUC, makedumpfile has NOT been developed to filter /proc/kcore so far,
except for the --mem-usage option, and this is the first patch for it.
It might be more handy than crash's snap.so extension, so I applied this:
https://github.com/makedumpfile/makedumpfile/commit/38d921a2ef50ebd36258097553626443ffe27496
But please note that maybe filtering /proc/kcore with makedumpfile (and
also analyzing the dumpfile with crash) is still not mature enough for
field or production use.
Thanks,
Kazu
>
> Let's check if the phys_start of the segment is a valid physical address
> to fix this problem.
>
> [1] https://bugzilla.redhat.com/show_bug.cgi?id=1965267
>
> Reported-by: Xiaoying Yan <yiyan at redhat.com>
> Signed-off-by: Coiby Xu <coxu at redhat.com>
> ---
> elf_info.c | 2 +-
> 1 file changed, 1 insertion(+), 1 deletion(-)
>
> diff --git a/elf_info.c b/elf_info.c
> index e8affb7..9444847 100644
> --- a/elf_info.c
> +++ b/elf_info.c
> @@ -628,7 +628,7 @@ get_max_paddr(void)
>
> for (i = 0; i < num_pt_loads; i++) {
> pls = &pt_loads[i];
> - if (max_paddr < pls->phys_end)
> + if (pls->phys_start != NOT_PADDR && max_paddr < pls->phys_end)
> max_paddr = pls->phys_end;
> }
> return max_paddr;
> --
> 2.31.1
>
>
> _______________________________________________
> kexec mailing list
> kexec at lists.infradead.org
> http://lists.infradead.org/mailman/listinfo/kexec
More information about the kexec
mailing list