[PATCH v2 1/3] cacheinfo: Set cache 'id' based on DT data
Jonathan Cameron
Jonathan.Cameron at huawei.com
Mon Jul 7 05:32:07 PDT 2025
On Mon, 7 Jul 2025 11:27:06 +0100
Ben Horgan <ben.horgan at arm.com> wrote:
> Hi James,
>
> On 7/4/25 18:38, James Morse wrote:
> > From: Rob Herring <robh at kernel.org>
> >
> > Use the minimum CPU h/w id of the CPUs associated with the cache for the
> > cache 'id'. This will provide a stable id value for a given system. As
> > we need to check all possible CPUs, we can't use the shared_cpu_map
> > which is just online CPUs. As there's not a cache to CPUs mapping in DT,
> > we have to walk all CPU nodes and then walk cache levels.
> >
> > The cache_id exposed to user-space has historically been 32 bits, and
> > is too late to change. This value is parsed into a u32 by user-space
> > libraries such as libvirt:
> > https://github.com/libvirt/libvirt/blob/master/src/util/virresctrl.c#L1588
> >
> > Give up on assigning cache-id's if a CPU h/w id greater than 32 bits
> > is found.
> >
> > Cc: Greg Kroah-Hartman <gregkh at linuxfoundation.org>
> > Cc: "Rafael J. Wysocki" <rafael at kernel.org>
> > Signed-off-by: Rob Herring <robh at kernel.org>
> > [ ben: converted to use the __free cleanup idiom ]
> > Signed-off-by: Ben Horgan <ben.horgan at arm.com>
> > [ morse: Add checks to give up if a value larger than 32 bits is seen. ]
> > Signed-off-by: James Morse <james.morse at arm.com>
> > ---
> > Use as a 32bit value has also been seen in DPDK patches here:
> > http://inbox.dpdk.org/dev/20241021015246.304431-2-wathsala.vithanage@arm.com/
> >
> > Changes since v1:
> > * Remove the second loop in favour of a helper.
> >
> > An open question from v1 is whether it would be preferable to use an
> > index into the DT of the CPU nodes instead of the hardware id. This would
> > save an arch specific swizzle - but the numbers would change if the DT
> > were changed. This scheme isn't sensitive to the order of DT nodes.
> >
> > ---
> > drivers/base/cacheinfo.c | 38 ++++++++++++++++++++++++++++++++++++++
> > 1 file changed, 38 insertions(+)
> >
> > diff --git a/drivers/base/cacheinfo.c b/drivers/base/cacheinfo.c
> > index cf0d455209d7..df593da0d5f7 100644
> > --- a/drivers/base/cacheinfo.c
> > +++ b/drivers/base/cacheinfo.c
> > @@ -8,6 +8,7 @@
> > #define pr_fmt(fmt) KBUILD_MODNAME ": " fmt
> >
> > #include <linux/acpi.h>
> > +#include <linux/bitfield.h>
> > #include <linux/bitops.h>
> > #include <linux/cacheinfo.h>
> > #include <linux/compiler.h>
> > @@ -183,6 +184,42 @@ static bool cache_node_is_unified(struct cacheinfo *this_leaf,
> > return of_property_read_bool(np, "cache-unified");
> > }
> >
> > +static bool match_cache_node(struct device_node *cpu,
> > + const struct device_node *cache_node)
> > +{
> > + for (struct device_node *cache __free(device_node) = of_find_next_cache_node(cpu);
> Looks like the creation of this helper function has upset the
> device_node reference counting. This first __free(device_node) will only
> cause of_node_put() to be called in the case of the early return from
> the loop. You've dropped the second __free(device_node) which accounts
> for 'cache' changing on each iteration.
Good catch - this behaves differently from many of the of_get_next* type
helpers in that it doesn't drop the reference to the previous iteration
within the call.
Maybe it should?
I checked a few of the call sites and some would be simplified if it did
others would need some more complex restructuring but might benefit as
well.
> > + cache != NULL; cache = of_find_next_cache_node(cache)) {
> > + if (cache == cache_node)
> > + return true;
> > + }
> > +
> > + return false;
> > +}
> > +
> > +static void cache_of_set_id(struct cacheinfo *this_leaf,
> > + struct device_node *cache_node)
> > +{
> > + struct device_node *cpu;
> > + u32 min_id = ~0;
> > +
> > + for_each_of_cpu_node(cpu) {
> > + u64 id = of_get_cpu_hwid(cpu, 0);
> > +
> > + if (FIELD_GET(GENMASK_ULL(63, 32), id)) {
> > + of_node_put(cpu);
> > + return;
> > + }
> > +
> > + if (match_cache_node(cpu, cache_node))
> > + min_id = min(min_id, id);
> > + }
> > +
> > + if (min_id != ~0) {
> > + this_leaf->id = min_id;
> > + this_leaf->attributes |= CACHE_ID;
> > + }
> > +}
> > +
> > static void cache_of_set_props(struct cacheinfo *this_leaf,
> > struct device_node *np)
> > {
> > @@ -198,6 +235,7 @@ static void cache_of_set_props(struct cacheinfo *this_leaf,
> > cache_get_line_size(this_leaf, np);
> > cache_nr_sets(this_leaf, np);
> > cache_associativity(this_leaf);
> > + cache_of_set_id(this_leaf, np);
> > }
> >
> > static int cache_setup_of_node(unsigned int cpu)
>
>
> Thanks,
>
> Ben
>
>
More information about the linux-arm-kernel
mailing list