[PATCH v3] irqchip/gic-v3: Ensure GICR_CTLR.EnableLPI=0 is observed before enabling
Marc Zyngier
marc.zyngier at arm.com
Fri Mar 23 02:19:35 PDT 2018
On Thu, 22 Mar 2018 19:41:09 +0000,
Shanker Donthineni wrote:
>
> Hi Marc,
>
> On 03/22/2018 10:51 AM, Marc Zyngier wrote:
> > On 22/03/18 01:58, Shanker Donthineni wrote:
> >> The definition of the GICR_CTLR.RWP control bit was expanded to indicate
> >> status of changing GICR_CTLR.EnableLPI from 1 to 0 is being in progress
> >> or completed. Software must observe GICR_CTLR.RWP==0 after clearing
> >> GICR_CTLR.EnableLPI from 1 to 0 and before writing GICR_PENDBASER and/or
> >> GICR_PROPBASER, otherwise behavior is UNPREDICTABLE.
> >>
> >> Signed-off-by: Shanker Donthineni <shankerd at codeaurora.org>
> >> ---
> >> Changes since v2:
> >> -Revert readl_relaxed_poll() usage since it's not usable in GICv3 probe().
> >> -Changes to pr_xxx messages.
> >>
> >> Changes since v1:
> >> -Moved LPI disable code to a seperate function as Marc suggested.
> >> -Mark's suggestion to use readl_relaxed_poll_timeout() helper functions.
> >>
> >> drivers/irqchip/irq-gic-v3-its.c | 75 +++++++++++++++++++++++++++++++-------
> >> include/linux/irqchip/arm-gic-v3.h | 1 +
> >> 2 files changed, 62 insertions(+), 14 deletions(-)
> >>
> >> diff --git a/drivers/irqchip/irq-gic-v3-its.c b/drivers/irqchip/irq-gic-v3-its.c
> >> index 2cbb19c..c1e8a8e 100644
> >> --- a/drivers/irqchip/irq-gic-v3-its.c
> >> +++ b/drivers/irqchip/irq-gic-v3-its.c
> >> @@ -33,6 +33,7 @@
> >> #include <linux/of_platform.h>
> >> #include <linux/percpu.h>
> >> #include <linux/slab.h>
> >> +#include <linux/time64.h>
> >>
> >> #include <linux/irqchip.h>
> >> #include <linux/irqchip/arm-gic-v3.h>
> >
> > This hunk doesn't apply to my -next branch, but I don't think it is
> > actually required either...
> >
>
> I'll try to drop "#include <linux/time64.h>" in next patch if USEC_PER_SEC
> included by other header files or rebase to -next branch.
>
> >> @@ -1875,16 +1876,6 @@ static void its_cpu_init_lpis(void)
> >> gic_data_rdist()->pend_page = pend_page;
> >> }
> >>
> >> - /* Disable LPIs */
> >> - val = readl_relaxed(rbase + GICR_CTLR);
> >> - val &= ~GICR_CTLR_ENABLE_LPIS;
> >> - writel_relaxed(val, rbase + GICR_CTLR);
> >> -
> >> - /*
> >> - * Make sure any change to the table is observable by the GIC.
> >> - */
> >> - dsb(sy);
> >> -
> >> /* set PROPBASE */
> >> val = (page_to_phys(gic_rdists->prop_page) |
> >> GICR_PROPBASER_InnerShareable |
> >> @@ -3287,13 +3278,69 @@ static bool gic_rdists_supports_plpis(void)
> >> return !!(gic_read_typer(gic_data_rdist_rd_base() + GICR_TYPER) & GICR_TYPER_PLPIS);
> >> }
> >>
> >> +static int redist_disable_lpis(void)
> >> +{
> >> + void __iomem *rbase = gic_data_rdist_rd_base();
> >> + u64 timeout = USEC_PER_SEC;
> >> + u64 val;
> >> +
> >> + if (!gic_rdists_supports_plpis()) {
> >> + pr_info("CPU%d: LPIs not supported\n", smp_processor_id());
> >> + return -ENXIO;
> >> + }
> >> +
> >> + val = readl_relaxed(rbase + GICR_CTLR);
> >> + if (!(val & GICR_CTLR_ENABLE_LPIS))
> >> + return 0;
> >> +
> >> + pr_warn("CPU%d: Booted with LPIs enabled, memory probably corrupted\n",
> >> + smp_processor_id());
> >> + add_taint(TAINT_CRAP, LOCKDEP_STILL_OK);
> >> +
> >> + /* Disable LPIs */
> >> + val &= ~GICR_CTLR_ENABLE_LPIS;
> >> + writel_relaxed(val, rbase + GICR_CTLR);
> >> +
> >> + /* Make sure any change to GICR_CTLR is observable by the GIC */
> >> + dsb(sy);
> >> +
> >> + /**
> >> + * Software must observe RWP==0 after clearing GICR_CTLR.EnableLPIs
> >> + * from 1 to 0 before programming GICR_PEND{PROP}BASER registers.
> >> + * Bail out the driver probe() in case of timeout.
> >> + */
> >> + while (readl_relaxed(rbase + GICR_CTLR) & GICR_CTLR_RWP) {
> >> + if (!timeout) {
> >> + pr_err("CPU%d: Failed to observe RWP==0 after disabling LPIs\n",
> >
> > I think you can simplify the message with something like:
> >
> > "Time-out disabling LPIs\n"
> >
> > Nobody apart from you and I really want to know about RWP...
> >
>
> I'll change.
>
> >> + smp_processor_id());
> >> + return -ETIMEDOUT;
> >> + }
> >> + udelay(1);
> >> + timeout--;
> >> + }
> >> +
> >> + /**
> >> + * After it has been written to 1, it is IMPLEMENTATION DEFINED whether
> >> + * the bit GICR_CTLR.EnableLPI becomes RES1 or can be cleared to 0.
> >> + * Bail out the driver probe() on systems where it's RES1.
> >> + */
> >> + if (readl_relaxed(rbase + GICR_CTLR) & GICR_CTLR_ENABLE_LPIS) {
> >> + pr_err("CPU%d: Failed to disable LPIs\n", smp_processor_id());
> >> + return -EBUSY;
> >> + }
> >> +
> >> + return 0;
> >> +}
> >> +
> >> int its_cpu_init(void)
> >> {
> >> if (!list_empty(&its_nodes)) {
> >> - if (!gic_rdists_supports_plpis()) {
> >> - pr_info("CPU%d: LPIs not supported\n", smp_processor_id());
> >> - return -ENXIO;
> >> - }
> >> + int ret;
> >> +
> >> + ret = redist_disable_lpis();
> >> + if (ret)
> >> + return ret;
> >
> > Just realised that this is totally broken.
> >
> > Why do we have this in the loop? Checking the LPI support for each ITS
> > was admittedly braindead (we only need to check it once per CPU), but
> > now trying to disable the LPIs each time we encounter an ITS is going to
> > make it go crazy and taint the kernel for no reason.
> >
>
> Sorry, I didn't quite understand suggestions you're recommending. I don't
> see any loop here, it just checks the ITS_LIST_EMPTY.
>
> The function its_cpu_init() is being called for each CPU coming online.
> We're trying to disable GICR LPI before calling its_cpu_init_lpis() and
> its_cpu_init_collection(). Newly added function redist_disable_lpis()
> will be called only once per CPU but not per each ITS hardware instance.
> Is something I'm missing here?
No you're not. I just got confused with my own patches and completely
misread yours.
Sorry about that. I'll apply the patch directly with the above
changes.
Thanks,
M.
--
Jazz is not dead, it just smell funny.
More information about the linux-arm-kernel
mailing list