[RFC V2] IMA Log Snapshotting Design Proposal

Tushar Sugandhi tusharsu at linux.microsoft.com
Tue Nov 21 17:01:38 PST 2023



On 11/16/23 14:28, Paul Moore wrote:
> On Tue, Oct 31, 2023 at 3:15 PM Mimi Zohar <zohar at linux.ibm.com> wrote:
>> On Thu, 2023-10-19 at 11:49 -0700, Tushar Sugandhi wrote:
>>
>> [...]
>>> -----------------------------------------------------------------------
>>> | C.1 Solution Summary                                                |
>>> -----------------------------------------------------------------------
>>> To achieve the goals described in the section above, we propose the
>>> following changes to the IMA subsystem.
>>>
>>>       a. The IMA log from Kernel memory will be offloaded to some
>>>          persistent storage disk to keep the system running reliably
>>>          without facing memory pressure.
>>>          More details, alternate approaches considered etc. are present
>>>          in section "D.3 Choices for Storing Snapshots" below.
>>>
>>>       b. The IMA log will be divided into multiple chunks (snapshots).
>>>          Each snapshot would be a delta between the two instances when
>>>          the log was offloaded from memory to the persistent storage
>>>          disk.
>>>
>>>       c. Some UM process (like a remote-attestation-client) will be
>>>          responsible for writing the IMA log snapshot to the disk.
>>>
>>>       d. The same UM process would be responsible for triggering the IMA
>>>          log snapshot.
>>>
>>>       e. There will be a well-known location for storing the IMA log
>>>          snapshots on the disk.  It will be non-trivial for UM processes
>>>          to change that location after booting into the Kernel.
>>>
>>>       f. A new event, "snapshot_aggregate", will be computed and measured
>>>          in the IMA log as part of this feature.  It should help the
>>>          remote-attestation client/service to benefit from the IMA log
>>>          snapshot feature.
>>>          The "snapshot_aggregate" event is described in more details in
>>>          section "D.1 Snapshot Aggregate Event" below.
>>>
>>>       g. If the existing remote-attestation client/services do not change
>>>          to benefit from this feature or do not trigger the snapshot,
>>>          the Kernel will continue to have it's current functionality of
>>>          maintaining an in-memory full IMA log.
>>>
>>> Additionally, the remote-attestation client/services need to be updated
>>> to benefit from the IMA log snapshot feature.  These proposed changes
>>>
>>> are described in section "D.4 Remote-Attestation Client/Service Side
>>> Changes" below, but their implementation is out of scope for this
>>> proposal.
>>
>> As previously said on v1,
>>     This design seems overly complex and requires synchronization between the
>>     "snapshot" record and exporting the records from the measurement list. [...]
>>
>>     Concerns:
>>     - Pausing extending the measurement list.
>>
>> Nothing has changed in terms of the complexity or in terms of pausing
>> the measurement list.   Pausing the measurement list is a non starter.
> 
> The measurement list would only need to be paused for the amount of
> time it would require to generate the snapshot_aggregate entry, which
> should be minimal and only occurs when a privileged userspace requests
> a snapshot operation.  The snapshot remains opt-in functionality, and
> even then there is the possibility that the kernel could reject the
> snapshot request if generating the snapshot_aggregate entry was deemed
> too costly (as determined by the kernel) at that point in time.
> 
Thanks Paul for responding and sharing your thoughts.


Hi Mimi,
To address your concern about pausing the measurements -
We are not proposing to pause the measurements for the entire duration
of UM <--> Kernel interaction while taking a snapshot.

We are simply proposing to pause the measurements when we get the TPM
PCR quotes to add them to "snapshot_aggregate". (which should be a very
small time window). IMA already has this mechanism when two separate
modules try to add entry to IMA log - by using
mutex_lock(&ima_extend_list_mutex); in ima_add_template_entry.


We plan to use this existing locking functionality.
Hope this addresses your concern about pausing extending the measurement
list.

~Tushar

>> Userspace can already export the IMA measurement list(s) via the
>> securityfs {ascii,binary}_runtime_measurements file(s) and do whatever
>> it wants with it.  All that is missing in the kernel is the ability to
>> trim the measurement list, which doesn't seem all that complicated.
> 
>>From my perspective what has been presented is basically just trimming
> the in-memory measurement log, the additional complexity (which really
> doesn't look that bad IMO) is there to ensure robustness in the face
> of an unreliable userspace (processes die, get killed, etc.) and to
> establish a new, transitive root of trust in the newly trimmed
> in-memory log.
> 
> I suppose one could simplify things greatly by having a design where
> userspace  captures the measurement log and then writes the number of
> measurement records to trim from the start of the measurement log to a
> sysfs file and the kernel acts on that.  You could do this with, or
> without, the snapshot_aggregate entry concept; in fact that could be
> something that was controlled by userspace, e.g. write the number of
> lines and a flag to indicate if a snapshot_aggregate was desired to
> the sysfs file.  I can't say I've thought it all the way through to
> make sure there are no gotchas, but I'm guessing that is about as
> simple as one can get.
> 
> If there is something else you had in mind, Mimi, please share the
> details.  This is a very real problem we are facing and we want to
> work to get a solution upstream.
> 



More information about the kexec mailing list