[Hypervisor Live Update] Notes from March 9, 2026
David Rientjes
rientjes at google.com
Sat Mar 14 18:06:28 PDT 2026
Hi everybody,
Here are the notes from the last Hypervisor Live Update call that happened
on Monday, March 9. Thanks to everybody who was involved!
These notes are intended to bring people up to speed who could not attend
the call as well as keep the conversation going in between meetings.
----->o-----
Pasha updated on stateless KHO: it's now in Andrew's tree and ready to go
for the next merge window. He noted that deferred struct page
initialization was being worked on as a next step and then attention will
shift to optimizing start up performance. We want to make sure KHO is not
the long pole.
Pratyush noted that there is a weird interaction between HVO, sparse init,
and KHO. HugeTLB pages must be allocated before sparse init but when in
KHO boot we are in scratch only which wouldn't allow for allocating such
large amounts of memory. Pasha noted that there is an exception available
where we can ensure that memory is coming from the preserved tree.
Pratyush said we should parse the bitmaps and make the memblock
reservations but we don't set the order in struct page, then turn off
scratch and do the hugepage allocation and then parse the bitmap again to
set the orders.
----->o-----
We briefly talked about KHO enlightenment for ASI which wasn't deemed to
be super urgent. ASI can be merged without KHO enlightenment so this
could be a later extension.
----->o-----
Pasha noted that he received some comments from Pratyush on the luod
design[1] that he would be iterating through. We welcomed anybody else to
continue to provide feedback as well directly on the doc. Pasha discussed
with Jork and there will be a three-way meeting that will determine what
the intercept will be with systemd.
We discussed versioning support and when the timelines for landing the
support would be. Pasha suggested that we would need this when we start
making strong guarantees about supporting live update from version to
version. Pratyush was working on HugeTLB preservation support ahead of
this.
----->o-----
Pratyush updated on HugeTLB preservation support and how it would be split
into multiple parts. Part one is selftests that he sent out to the
mailing list for review. He is currently blocked on the ordering issue
mentioned above. Pratyush suggested upstream code review from upstream
HugeTLB experts would be very useful on the list. We discussed handling
this at the next Linux MM Alignment Session on Wednesday, March 18.
----->o-----
David Matlack is prepping v3 of the VFIO support. He was working on
preserving VFIO file descriptors but not preserving a lot of the device
state itself. A question arose upstream about userspace being aware that
we're in this development phase where files are preserved but not
everything on the device is preserved. He didn't see a long-term need for
a UAPI that expresses that it's saving bits and pieces of a device or
letting userspace control what the kernel preserves and what it does not
preseve. He was thinking of doing this under CONFIG_EXPERIMENTAL.
----->o-----
Sami received some feedback for IOMMU persistence but otherwise no major
updates. There were some patches sent from Intel that would be a small
dependency for hitless replacement that Sami is looking at.
----->o-----
Pasha brought up plans for the next steps for live update while keeping
CPUs running through the reboot, which will be a long-term discussion.
This will need to involve KVM folks in the upstream discussion. Jason
said it would be very useful to demonstrate kexec live update with a very
simple VM with no VFIO. David suggested a KVM selftest would be useful
for this type of demo.
----->o-----
Next meeting will be on Monday, March 23 at 8am PDT (UTC-7), everybody is
welcome: https://meet.google.com/rjn-dmzu-hgq
Topics for the next meeting:
- luod design feedback and intersection with systemd; implementation and
next steps
- improving KHO and boot time performance, including deferred struct page
initialization
- HugeTLB preservation support including selftests
+ debrief discussion from Linux MM Alignment Session
- VFIO patch series review from the list
- phase 1 of IOMMU persistence patch series, then phase 2 status and
updates
- hitless replacement for iommu domains and patches from Intel
- guest_memfd HugeTLB enlightenment: conversion support, HugeTLB support
without restructuring, and HugeTLB support with restructuring
- later: versioning support for luod to negotiate
- later: ordering issues when disabling interrupts based on feedback from
Nvidia
- later: update on PCI preservation series and next steps
- later: KHO enlightenment for ASI
Please let me know if you'd like to propose additional topics for
discussion, thank you!
[1] https://tinyurl.com/luoddesign
More information about the kexec
mailing list