[PATCH 2/2] Check PG_swapbacked for swap cache pages

Petr Tesarik ptesarik at suse.cz
Tue Apr 24 03:43:25 PDT 2018


Hi Masaki-san,

what is the status of this patch? Do you have any concerns?

TIA,
Petr T

On Fri, 13 Apr 2018 18:29:59 +0200
Petr Tesarik <ptesarik at suse.cz> wrote:

> When page cache is filtered out (dump level bitmap includes 2 or 4),
> makedumpfile checks the PG_swapcache bit, but since kernel commit
> 6326fec1122cde256bd2a8c63f2606e08e44ce1d (v4.10-rc1~7) this bit is
> an alias for PG_owner_priv_1, which is also used by filesystem
> code (PG_checked) and Xen (PG_pinned and PG_foreign).
> 
> With these kernels, the PG_swapcache flag is valid only if
> PG_swapbacked is set. A Linux kernel patch has already been
> submitted to export the value of PG_swapbacked in VMCOREINFO.
> 
> Since there are released kernels in the wild which do not export the
> value, a fallback is implemented. I considered these three situations:
> 
>   1. Kernels before v2.6.28-rc1~244:
>      PG_swapbacked does not exist, so it must not be checked.
>      Instead, check PG_swapcache, which is never overloaded for
>      another purpose.
> 
>   2. Kernels between v2.6.28-rc1~244 and v4.10-rc1~7:
>      It is sufficient to check only PG_swapcache, but PG_swapbacked
>      may also be checked (it is always set if PG_swapcache is set).
> 
>   3. Kernels since v4.10-rc1~7:
>      PG_swapbacked must be checked.
> 
> If PG_swapbacked value is known (exported or read from debuginfo),
> it is always safe to use it (case 2 or 3). If PG_swapbacked is not
> known, it is safe to ignore it for cases 1 and 2, but not 3.
> Thankfully, the new value of PG_swapcache (since v4.10-rc1~7) is
> less than PG_private (which is known), whereas the old value had
> always been greater than PG_private. Moreover, the flags between
> PG_private and PG_swapbacked haven't changed since v4.10-rc1~7, so
> PG_swapbacked can fall back to PG_private + 6 if unknown.
> 
> Without this patch, all Xen dumps are unusable, because PG_pinned is
> set for all page table pages.
> 
> Signed-off-by: Petr Tesarik <ptesarik at suse.com>
> ---
>  makedumpfile.c | 19 ++++++++++++++++++-
>  makedumpfile.h |  2 ++
>  2 files changed, 20 insertions(+), 1 deletion(-)
> 
> diff --git a/makedumpfile.c b/makedumpfile.c
> index 175ba68..ec04a88 100644
> --- a/makedumpfile.c
> +++ b/makedumpfile.c
> @@ -252,7 +252,18 @@ isHugetlb(unsigned long dtor)
>  static int
>  is_cache_page(unsigned long flags)
>  {
> -	return isLRU(flags) || isSwapCache(flags);
> +	if (isLRU(flags))
> +		return TRUE;
> +
> +	/* PG_swapcache is valid only if:
> +	 *   a. PG_swapbacked bit is set, or
> +	 *   b. PG_swapbacked did not exist (kernels before 4.10-rc1).
> +	 */
> +	if ((NUMBER(PG_swapbacked) == NOT_FOUND_NUMBER || isSwapBacked(flags))
> +	    && isSwapCache(flags))
> +		return TRUE;
> +
> +	return FALSE;
>  }
>  
>  static inline unsigned long
> @@ -1735,6 +1746,7 @@ get_structure_info(void)
>  	ENUM_NUMBER_INIT(PG_lru, "PG_lru");
>  	ENUM_NUMBER_INIT(PG_private, "PG_private");
>  	ENUM_NUMBER_INIT(PG_swapcache, "PG_swapcache");
> +	ENUM_NUMBER_INIT(PG_swapbacked, "PG_swapbacked");
>  	ENUM_NUMBER_INIT(PG_buddy, "PG_buddy");
>  	ENUM_NUMBER_INIT(PG_slab, "PG_slab");
>  	ENUM_NUMBER_INIT(PG_hwpoison, "PG_hwpoison");
> @@ -1988,6 +2000,9 @@ get_value_for_old_linux(void)
>  		NUMBER(PG_private) = PG_private_ORIGINAL;
>  	if (NUMBER(PG_swapcache) == NOT_FOUND_NUMBER)
>  		NUMBER(PG_swapcache) = PG_swapcache_ORIGINAL;
> +	if (NUMBER(PG_swapbacked) == NOT_FOUND_NUMBER
> +	    && NUMBER(PG_swapcache) < NUMBER(PG_private))
> +		NUMBER(PG_swapbacked) = NUMBER(PG_private) + 6;
>  	if (NUMBER(PG_slab) == NOT_FOUND_NUMBER)
>  		NUMBER(PG_slab) = PG_slab_ORIGINAL;
>  	if (NUMBER(PG_head_mask) == NOT_FOUND_NUMBER)
> @@ -2264,6 +2279,7 @@ write_vmcoreinfo_data(void)
>  	WRITE_NUMBER("PG_private", PG_private);
>  	WRITE_NUMBER("PG_head_mask", PG_head_mask);
>  	WRITE_NUMBER("PG_swapcache", PG_swapcache);
> +	WRITE_NUMBER("PG_swapbacked", PG_swapbacked);
>  	WRITE_NUMBER("PG_buddy", PG_buddy);
>  	WRITE_NUMBER("PG_slab", PG_slab);
>  	WRITE_NUMBER("PG_hwpoison", PG_hwpoison);
> @@ -2658,6 +2674,7 @@ read_vmcoreinfo(void)
>  	READ_NUMBER("PG_private", PG_private);
>  	READ_NUMBER("PG_head_mask", PG_head_mask);
>  	READ_NUMBER("PG_swapcache", PG_swapcache);
> +	READ_NUMBER("PG_swapbacked", PG_swapbacked);
>  	READ_NUMBER("PG_slab", PG_slab);
>  	READ_NUMBER("PG_buddy", PG_buddy);
>  	READ_NUMBER("PG_hwpoison", PG_hwpoison);
> diff --git a/makedumpfile.h b/makedumpfile.h
> index 6205ef3..fe306bb 100644
> --- a/makedumpfile.h
> +++ b/makedumpfile.h
> @@ -155,6 +155,7 @@ test_bit(int nr, unsigned long addr)
>  #define isPrivate(flags)	test_bit(NUMBER(PG_private), flags)
>  #define isCompoundHead(flags)   (!!((flags) & NUMBER(PG_head_mask)))
>  #define isSwapCache(flags)	test_bit(NUMBER(PG_swapcache), flags)
> +#define isSwapBacked(flags)	test_bit(NUMBER(PG_swapbacked), flags)
>  #define isHWPOISON(flags)	(test_bit(NUMBER(PG_hwpoison), flags) \
>  				&& (NUMBER(PG_hwpoison) != NOT_FOUND_NUMBER))
>  
> @@ -1881,6 +1882,7 @@ struct number_table {
>  	long	PG_head;
>  	long	PG_head_mask;
>  	long	PG_swapcache;
> +	long	PG_swapbacked;
>  	long	PG_buddy;
>  	long	PG_slab;
>  	long    PG_hwpoison;




More information about the kexec mailing list