[Hypervisor Live Update] Notes from March 9, 2026

David Rientjes rientjes at google.com
Sat Mar 14 18:06:28 PDT 2026


Hi everybody,

Here are the notes from the last Hypervisor Live Update call that happened 
on Monday, March 9.  Thanks to everybody who was involved!

These notes are intended to bring people up to speed who could not attend 
the call as well as keep the conversation going in between meetings.

----->o-----
Pasha updated on stateless KHO: it's now in Andrew's tree and ready to go 
for the next merge window.  He noted that deferred struct page 
initialization was being worked on as a next step and then attention will 
shift to optimizing start up performance.  We want to make sure KHO is not 
the long pole.

Pratyush noted that there is a weird interaction between HVO, sparse init, 
and KHO.  HugeTLB pages must be allocated before sparse init but when in 
KHO boot we are in scratch only which wouldn't allow for allocating such 
large amounts of memory.  Pasha noted that there is an exception available 
where we can ensure that memory is coming from the preserved tree.  
Pratyush said we should parse the bitmaps and make the memblock 
reservations but we don't set the order in struct page, then turn off 
scratch and do the hugepage allocation and then parse the bitmap again to 
set the orders.

----->o-----
We briefly talked about KHO enlightenment for ASI which wasn't deemed to 
be super urgent.  ASI can be merged without KHO enlightenment so this 
could be a later extension.

----->o-----
Pasha noted that he received some comments from Pratyush on the luod 
design[1] that he would be iterating through.  We welcomed anybody else to 
continue to provide feedback as well directly on the doc.  Pasha discussed 
with Jork and there will be a three-way meeting that will determine what 
the intercept will be with systemd.

We discussed versioning support and when the timelines for landing the 
support would be.  Pasha suggested that we would need this when we start 
making strong guarantees about supporting live update from version to 
version.  Pratyush was working on HugeTLB preservation support ahead of 
this.

----->o-----
Pratyush updated on HugeTLB preservation support and how it would be split 
into multiple parts.  Part one is selftests that he sent out to the 
mailing list for review.  He is currently blocked on the ordering issue 
mentioned above.  Pratyush suggested upstream code review from upstream 
HugeTLB experts would be very useful on the list.  We discussed handling 
this at the next Linux MM Alignment Session on Wednesday, March 18.

----->o-----
David Matlack is prepping v3 of the VFIO support.  He was working on 
preserving VFIO file descriptors but not preserving a lot of the device 
state itself.  A question arose upstream about userspace being aware that 
we're in this development phase where files are preserved but not 
everything on the device is preserved.  He didn't see a long-term need for 
a UAPI that expresses that it's saving bits and pieces of a device or 
letting userspace control what the kernel preserves and what it does not 
preseve.  He was thinking of doing this under CONFIG_EXPERIMENTAL.

----->o-----
Sami received some feedback for IOMMU persistence but otherwise no major 
updates.  There were some patches sent from Intel that would be a small 
dependency for hitless replacement that Sami is looking at.

----->o-----
Pasha brought up plans for the next steps for live update while keeping 
CPUs running through the reboot, which will be a long-term discussion.  
This will need to involve KVM folks in the upstream discussion.  Jason 
said it would be very useful to demonstrate kexec live update with a very 
simple VM with no VFIO.  David suggested a KVM selftest would be useful 
for this type of demo.

----->o-----
Next meeting will be on Monday, March 23 at 8am PDT (UTC-7), everybody is
welcome: https://meet.google.com/rjn-dmzu-hgq

Topics for the next meeting:

 - luod design feedback and intersection with systemd; implementation and
   next steps
 - improving KHO and boot time performance, including deferred struct page
   initialization
 - HugeTLB preservation support including selftests
   + debrief discussion from Linux MM Alignment Session
 - VFIO patch series review from the list 
 - phase 1 of IOMMU persistence patch series, then phase 2 status and
   updates
 - hitless replacement for iommu domains and patches from Intel
 - guest_memfd HugeTLB enlightenment: conversion support, HugeTLB support
   without restructuring, and HugeTLB support with restructuring
 - later: versioning support for luod to negotiate
 - later: ordering issues when disabling interrupts based on feedback from
   Nvidia
 - later: update on PCI preservation series and next steps
 - later: KHO enlightenment for ASI

Please let me know if you'd like to propose additional topics for
discussion, thank you!

[1] https://tinyurl.com/luoddesign



More information about the kexec mailing list