[PATCH 3/3] makedumpfile: use cycle detection when parsing the prink log_buf

Thu Mar 10 06:33:44 PST 2022

Hi Dave,

On Wed, 9 Mar 2022 03:48:12 -0500
David Wysochanski <dwysocha at redhat.com> wrote:

> On Mon, Mar 7, 2022 at 12:23 PM Philipp Rudo <prudo at redhat.com> wrote:
> >
> > The old printk mechanism (> v3.5.0 and < v5.10.0) had a fixed size
> > buffer (log_buf) that contains all messages. The location for the next
> > message is stored in log_next_idx. In case the log_buf runs full
> > log_next_idx wraps around and starts overwriting old messages at the
> > beginning of the buffer. The wraparound is denoted by a message with
> > msg->len == 0.
> >
> > Following the behavior described above blindly in makedumpfile is
> > dangerous as e.g. a memory corruption could overwrite (parts of) the
> > log_buf. If the corruption adds a message with msg->len == 0 this leads
> > to an endless loop when dumping the dmesg with makedumpfile appending
> > the messages up to the corruption over and over again to the output file
> > until file system is full. Fix this by using cycle detection and aboard
> > once one is detected.
> >
> > While at it also verify that the index is within the log_buf and thus
> > guard against corruptions with msg->len != 0.
> >
> > Fixes: 36c2458 ("[PATCH] --dump-dmesg fix for post 3.5 kernels.")
> > Reported-by: Audra Mitchell <aubaker at redhat.com>
> > Suggested-by: Dave Wysochanski <dwysocha at redhat.com>
> > Signed-off-by: Philipp Rudo <prudo at redhat.com>
> > ---
> >  makedumpfile.c | 42 ++++++++++++++++++++++++++++++++++++++++--
> >  1 file changed, 40 insertions(+), 2 deletions(-)
> >
> > diff --git a/makedumpfile.c b/makedumpfile.c
> > index edf128b..2738d16 100644
> > --- a/makedumpfile.c
> > +++ b/makedumpfile.c
> > @@ -15,6 +15,7 @@
> >   */
> >  #include "makedumpfile.h"
> >  #include "print_info.h"
> > +#include "detect_cycle.h"
> >  #include "dwarf_info.h"
> >  #include "elf_info.h"
> >  #include "erase_info.h"
> > @@ -5528,10 +5529,11 @@ dump_dmesg()
> >         unsigned long index, log_buf, log_end;
> >         unsigned int log_first_idx, log_next_idx;
> >         unsigned long long first_idx_sym;
> > +       struct detect_cycle *dc = NULL;
> >         unsigned long log_end_2_6_24;
> >         unsigned      log_end_2_6_25;
> >         char *log_buffer = NULL, *log_ptr = NULL;
> > -       char *idx;
> > +       char *idx, *next_idx;
> >  
> 
> Would be clearer to call the above "next_ptr" rather than "next_idx"
> (as far as I know 'index' refers to 32-bit quantities).
> Same comment about the "idx" variable, maybe "ptr"?

Hmm... I stuck with idx as the kernel uses the same name. In my
opinion using the same name makes it easier to see that both variables
contain the same "quantity" even when the implementation is slightly
different (in the kernel idx is the offset in the log_buf just like it
was in makedumpfile before patch 2). But my opinion isn't very strong
on the naming. So when the consent is to rename the variable I'm open
to do it.

@Kazu: Do you have a preference here?

Same for your comments in patch 2.

> >         /*
> >          * log_end has been changed to "unsigned" since linux-2.6.25.
> > @@ -5679,12 +5681,47 @@ dump_dmesg()
> >                         goto out;
> >                 }
> >                 idx = log_buffer + log_first_idx;
> > +               dc = dc_init(idx, log_buffer, log_next);
> >                 while (idx != log_buffer + log_next_idx) {
> >                         log_ptr = log_from_idx(idx, log_buffer);
> >                         if (!dump_log_entry(log_ptr, info->fd_dumpfile,
> >                                             info->name_dumpfile))
> >                                 goto out;
> > -                       idx = log_next(idx, log_buffer);
> > +                       if (dc_next(dc, (void **) &next_idx)) {
> > +                               unsigned long len;
> > +                               char *first;
> > +
> > +                               /* Clear everything we have already written... */
> > +                               ftruncate(info->fd_dumpfile, 0);
> > +                               lseek(info->fd_dumpfile, 0, SEEK_SET);
> > +  
> 
> I'm not sure I understand why you're doing this.

That's because in every pass of the loop the entry is written to the
output file. So most likely the output file already contains more
than needed. Thus we somehow need to trim the file to the end of
the cycle. Thus I decided to go brute force and simply clear all
content from the file and write all we need a second time.

In our very specific case there are 2704 lines to the corruption. That
means that the function has already written 6798 (4095 + 2704 - 1) lines
till it detects the cycle. 

> > +                               /* ...and only write up to the corruption. */
> > +                               dc_find_start(dc, (void **) &first, &len);
> > +                               idx = log_buffer + log_first_idx;
> > +                               while (len) {  
> 
> I don't think this is correct.  It looks like "len" is the length of
> the loop segment, correct?
> But don't you want to print the whole buffer until the corruption?
> That means you need to print both the non-loop segment plus the loop segment.
> With the 'while(len)' you're only printing # of entries == loop segment.
> Look at the diagram here:
> https://listman.redhat.com/archives/crash-utility/2018-July/007582.html

True... I'll fix it in v2...

Thanks
Philipp

> 
> > +                                       log_ptr = log_from_idx(idx, log_buffer);
> > +                                       if (!dump_log_entry(log_ptr,
> > +                                                           info->fd_dumpfile,
> > +                                                           info->name_dumpfile))
> > +                                               goto out;
> > +                                       idx = log_next(idx, log_buffer);
> > +                                       len--;
> > +                               }
> > +                               ERRMSG("Cycle when parsing dmesg detected.\n");
> > +                               ERRMSG("The printk log_buf is most likely corrupted.\n");
> > +                               ERRMSG("log_buf = 0x%lx, idx = 0x%lx\n", log_buf, idx - log_buffer);
> > +                               close_files_for_creating_dumpfile();
> > +                               goto out;
> > +                       }
> > +                       if (next_idx < log_buffer ||
> > +                           next_idx > log_buffer + log_buf_len - SIZE(printk_log)) {
> > +                               ERRMSG("Index outside log_buf detected.\n");
> > +                               ERRMSG("The printk log_buf is most likely corrupted.\n");
> > +                               ERRMSG("log_buf = 0x%lx, idx = 0x%lx\n", log_buf, idx - log_buffer);
> > +                               close_files_for_creating_dumpfile();
> > +                               goto out;
> > +                       }
> > +                       idx = next_idx;
> >                 }
> >                 if (!close_files_for_creating_dumpfile())
> >                         goto out;
> > @@ -5694,6 +5731,7 @@ dump_dmesg()
> >  out:
> >         if (log_buffer)
> >                 free(log_buffer);
> > +       free(dc);
> >
> >         return ret;
> >  }
> > --
> > 2.35.1
> >  
>