[PATCH] ARM64: KVM: Fix coherent_icache_guest_page() for host with external L3-cache.
Catalin Marinas
catalin.marinas at arm.com
Thu Aug 29 06:52:35 EDT 2013
Hi Anup,
Jumping late into this thread (holidays).
On Fri, Aug 16, 2013 at 07:57:55AM +0100, Anup Patel wrote:
> The approach of flushing d-cache by set/way upon first run of VCPU will
> not work because for set/way operations ARM ARM says: "For set/way
> operations, and for All (entire cache) operations, the point is defined to be
> to the next level of caching". In other words, set/way operations work upto
> point of unification.
I don't understand where you got the idea that set/way operations work
up to the point of unification. This is incorrect, the set/way
operations work on the level of cache specified by bits 3:1 in the
register passed to the DC CISW instruction. For your L3 cache, those
bits would be 2 (and __flush_dcache_all() implementation does this
dynamically).
For the I-cache all operation, that's correct, it only invalidates to
the PoU but it doesn't need to go any further, you do the D-cache
maintenance for this (no point in duplicating functionality).
> Also, Guest Linux already does __flush_dcache_all() from __cpu_setup()
> at bootup time which does not work for us when L3-cache is enabled so,
> there is no point is adding one more __flush_dcache_all() upon first run of
> VCPU in KVM ARM64.
Do you mean the CPU is not aware of the L3 cache? Does CLIDR_EL1 report
an L3 cache on your implementation?
It's not clear to me whether your L3 cache is inner or outer (or a mix).
You say that D-cache maintenance to PoC flushes the L3 which looks to me
like an inner cache, in which cache it should be reported in the LoC
bits in CLIDR_EL1.
> IMHO, we are left with following options:
> 1. Flush all RAM regions of VCPU using __flush_dcache_range()
> upon first run of VCPU
> 2. Implement outer-cache framework for ARM64 and flush all
> caches + outer cache (i.e. L3-cache) upon first run of VCPU
Do you have specific instructions for flushing the L3 cache only? It's
not clear to me what an outer-cache framework would to on AArch64. It
was added on AArch32 for the L2x0/PL310 which need separate instructions
by physical address for flushing the cache. I really hope we don't get
these again on ARMv8 hardware.
> 3. Use an alternate version of flush_icache_range() which will
> flush d-cache by PoC instead of PoU. We can also ensure
> that coherent_icache_guest_page() function will be called
> upon Stage2 prefetch aborts only.
flush_icache_range() is meant to flush only to the PoU to ensure the I-D
cache coherency. I don't think we should change this.
--
Catalin
More information about the linux-arm-kernel
mailing list