[PATCH v5 3/8] drivers: cpuidle: implement DT based idle states infrastructure

Lorenzo Pieralisi lorenzo.pieralisi at arm.com
Thu Jun 26 09:01:35 PDT 2014


On Wed, Jun 25, 2014 at 04:59:49PM +0100, Mark Rutland wrote:
> On Wed, Jun 25, 2014 at 03:10:16PM +0100, Lorenzo Pieralisi wrote:

[...]

> > diff --git a/drivers/cpuidle/dt_idle_states.c b/drivers/cpuidle/dt_idle_states.c
> > new file mode 100644
> > index 0000000..5c16001c
> > --- /dev/null
> > +++ b/drivers/cpuidle/dt_idle_states.c
> > @@ -0,0 +1,283 @@
> > +/*
> > + * DT idle states parsing code.
> > + *
> > + * Copyright (C) 2014 ARM Ltd.
> > + * Author: Lorenzo Pieralisi <lorenzo.pieralisi at arm.com>
> > + *
> > + * This program is free software; you can redistribute it and/or modify
> > + * it under the terms of the GNU General Public License version 2 as
> > + * published by the Free Software Foundation.
> > + */
> > +
> > +#define pr_fmt(fmt) "DT idle-states: " fmt
> > +
> > +#include <linux/cpuidle.h>
> > +#include <linux/cpumask.h>
> > +#include <linux/errno.h>
> > +#include <linux/kernel.h>
> > +#include <linux/list.h>
> > +#include <linux/list_sort.h>
> > +#include <linux/module.h>
> > +#include <linux/of.h>
> > +#include <linux/slab.h>
> > +
> > +#include "dt_idle_states.h"
> > +
> > +struct state_elem {
> > +	struct list_head list;
> > +	struct device_node *node;
> > +	u32 val;
> > +};
> 
> Ah. So the fixed-size entry parameter requirement is because this code
> is in charge of allocating and freeing these structs?

Nope, I use this struct to sort the states and val is the value that
determines the order (ie power-rank) in this patch. If I used the
phandle lists for ordering nodes, this struct would disappear completely,
I have to check if that's feasible.

> > +
> > +static struct list_head head __initdata = LIST_HEAD_INIT(head);
> > +
> > +static bool __init state_cpu_valid(struct device_node *state_node,
> > +				   struct device_node *cpu_node)
> > +{
> > +	int i = 0;
> > +	struct device_node *cpu_state;
> > +
> > +	while ((cpu_state = of_parse_phandle(cpu_node,
> > +					     "cpu-idle-states", i++))) {
> > +		if (cpu_state && state_node == cpu_state) {
> 
> You can drop the cpu_state NULL check, it's implicit in the while loop.

Yep.

> > +			of_node_put(cpu_state);
> > +			return true;
> > +		}
> > +		of_node_put(cpu_state);
> > +	}
> > +	return false;
> > +}
> 
> Is it possible to use a bool ret variable to avoid the two of_node_put
> cases? Or does that end up making this larger?

No, I think you are right.

> > +static bool __init state_cpus_valid(const cpumask_t *cpus,
> > +				    struct device_node *state_node)
> > +{
> > +	int cpu;
> > +	struct device_node *cpu_node;
> > +
> > +	/*
> > +	 * Check if state is valid on driver cpumask cpus
> > +	 */
> > +	for_each_cpu(cpu, cpus) {
> > +		cpu_node = of_get_cpu_node(cpu, NULL);
> > +
> > +		if (!cpu_node) {
> > +			pr_err("Missing device node for CPU %d\n", cpu);
> > +			return false;
> > +		}
> > +
> > +		if (!state_cpu_valid(state_node, cpu_node))
> > +			return false;
> > +	}
> > +
> > +	return true;
> > +}
> 
> Doesn't this leave all the cpu node refcounts incremented? (it's painful
> to get device node refcounting right, I know).
> 
> I think you can use the similarly named of_cpu_device_node_get to find
> the CPU node. It uses the pointer stored in cpu->dev.of_node, so it
> doesn't have to walk the tree to find the CPU node. It also doesn't
> increment the refcount.
> 
> Unless this is too early for that?

I think I can use of_cpu_device_node_get(...), but I should still manage
refcount properly on that, which I am not doing here, good catch.

> > +static void __init init_state_node(struct cpuidle_driver *drv,
> > +				   struct device_node *state_node,
> > +				   int *cnt)
> > +{
> > +	struct cpuidle_state *idle_state;
> > +
> > +	pr_debug(" * %s...\n", state_node->full_name);
> > +
> > +	idle_state = &drv->states[*cnt];
> > +
> > +	if (of_property_read_u32(state_node, "wakeup-latency-us",
> > +				 &idle_state->exit_latency)) {
> 
> I'm not a fan of this construction, as the obvious reading is that we
> take the branch if we succeeded (which obviously isn't true as
> of_property_read_* return error codes). 
> 
> Could we change it to something like:
> 
> 	err = of_property_read_u32(state_node, "wakeup-latency-us",
> 				   &idle_state->exit_latency);
> 	if (err) {

You are right, I will update it.

> > +		u32 entry_latency, exit_latency;
> > +
> > +		if (of_property_read_u32(state_node, "entry-latency-us",
> > +					 &entry_latency)) {
> > +			pr_debug(" * %s missing entry-latency-us property\n",
> > +				 state_node->full_name);
> > +			return;
> > +		}
> 
> Returning without error code? Do the fields have sane default values?
> 
> Or is this safe because we didn't increment cnt?

The latter, but it isn't nice, agreed, it is just an internal interface
though. I will make it less opaque and easier to understand.

> > +
> > +		if (of_property_read_u32(state_node, "exit-latency-us",
> > +					 &exit_latency)) {
> > +			pr_debug(" * %s missing exit-latency-us property\n",
> > +				 state_node->full_name);
> > +			return;
> > +		}
> > +		/*
> > +		 * If wakeup-latency-us is missing, default to entry+exit
> > +		 * latencies as defined in idle states bindings
> > +		 */
> > +		idle_state->exit_latency = entry_latency + exit_latency;
> > +	}
> > +
> > +	if (of_property_read_u32(state_node, "min-residency-us",
> > +				 &idle_state->target_residency)) {
> > +		pr_debug(" * %s missing min-residency-us property\n",
> > +			     state_node->full_name);
> > +		return;
> > +	}
> > +
> > +	idle_state->flags = CPUIDLE_FLAG_TIME_VALID;
> > +	if (!of_property_read_bool(state_node, "timer-state-retained"))
> > +		idle_state->flags |= CPUIDLE_FLAG_TIMER_STOP;
> > + 	strncpy(idle_state->name, state_node->name, CPUIDLE_NAME_LEN);
> > +	strncpy(idle_state->desc, state_node->name, CPUIDLE_NAME_LEN);
> 
> Does the name make sense as a desc? Is a desc necessary?
> 
> CPUIDLE_DESC_LEN seems to exist, and is double CPUIDLE_NAME_LEN.

Yes, that's a copy and paste typo that I missed. BTW this code is likely
to disappear, since the way CPUidle driver manages these strings is changing.

As to is desc really needed, I need to check all existing drivers to
provide a complete answer.

> > +static void __init add_idle_states(struct cpuidle_driver *drv,
> > +				   struct device_node *idle_states)
> > +{
> > +	struct device_node *state_node;
> > +
> > +	for_each_child_of_node(idle_states, state_node) {
> > +		if ((!of_device_is_compatible(state_node, "arm,idle-state"))) {
> 
> Holy brackets batman! I think we can drop the outer ones given there's
> no assignment we want to supress warnings for.

Eheh sorry, should be a leftover, fixed.

> > +			pr_warn(" * %s: children of /cpus/idle-states must be \"arm,idle-state\" compatible\n",
> > +				     state_node->full_name);
> 
> Presumably the entire reason for having the compatible string is for
> future extensibility.
> 
> It would probably be better to have something like:
> 
> 	pr_warn("Node %s has unrecognised/missing compatible string\n",
> 		state_node->full_name);
> 

It makes sense, so I will change the pr_warn.

> > +			continue;
> > +		}
> > +		/*
> > +		 * If memory allocation fails, better bail out.
> > +		 * Initialized nodes are freed at initialization
> > +		 * completion in of_init_idle_driver().
> > +		 */
> > +		if ((add_state_node(drv->cpumask, state_node) == -ENOMEM))
> > +			break;
> 
> Can we not return? Or is the list sort important in the error case too?

Well, we might have a valid list of states that have to be sorted and I
think that's correct to break and not just return in that case.

Let's see if I can avoid the sorting altogether.

> > +	}
> > +	/*
> > +	 * Sort the states list before initializing the CPUidle driver
> > +	 * states array.
> > +	 */
> > +	list_sort(NULL, &head, state_cmp);
> > +}
> > +
> > +/**
> > + * dt_init_idle_driver() - Parse the DT idle states and initialize the
> > + *			   idle driver states array
> > + *
> > + * @drv:	  Pointer to CPU idle driver to be initialized
> > + * @state_nodes:  Array of struct device_nodes to be initialized if
> > + *		  init_nodes == true. Must be sized CPUIDLE_STATE_MAX
> > + * @start_idx:    First idle state index to be initialized
> > + * @init_nodes:   Boolean to request device nodes initialization
> > + *
> > + * On success the states array in the cpuidle driver contains
> > + * initialized entries in the states array, starting from index start_idx.
> > + * If init_nodes == true, on success the state_nodes array is initialized
> > + * with idle state DT node pointers, starting from index start_idx,
> > + * in a 1:1 relation with the idle driver states array.
> > + *
> > + * Return:
> > + *	0 on success
> > + *	<0 on failure
> > + */
> > +int __init dt_init_idle_driver(struct cpuidle_driver *drv,
> > +			       struct device_node *state_nodes[],
> > +			       unsigned int start_idx, bool init_nodes)
> > +{
> > +	struct device_node *idle_states_node;
> > +	int ret;
> > +
> > +	if (start_idx >= CPUIDLE_STATE_MAX) {
> > +		pr_warn("State index exceeds static CPU idle driver states array size\n");
> > +		return -EINVAL;
> > +	}
> > +
> > +	if (WARN(init_nodes && !state_nodes,
> > +		"Requested nodes stashing in an invalid nodes container\n"))
> > +		return -EINVAL;
> 
> That warning message is somewhat confusing, and I'm not sure I
> follow the logic.

It is a belt and braces check to make sure that, if the dt init code is
requested to fill in the state_nodes array (init_nodes == true), at least
the array base was passed and it is not a NULL pointer. I think I'd better
remove it and let the kernel oops if the interface is used wrongly, that would
be a kernel bug and there is not much to WARN about.

Thanks,
Lorenzo




More information about the linux-arm-kernel mailing list