[PATCH v2] irqchip/gic-v3-its: Allow GIC ITS number more than MAX_NUMNODES
Hanjun Guo
hanjun.guo at linaro.org
Wed Jul 26 02:55:17 PDT 2017
On 2017/7/26 16:00, Marc Zyngier wrote:
> On 26/07/17 08:52, Hanjun Guo wrote:
>> On 2017/7/25 18:30, Marc Zyngier wrote:
>>> On 22/07/17 04:54, Hanjun Guo wrote:
>>>> From: Hanjun Guo <hanjun.guo at linaro.org>
>>>>
>>>> When running 4.13-rc1 on top of D05, I got the boot log:
>>>>
>>>> [ 0.000000] SRAT: PXM 0 -> ITS 0 -> Node 0
>>>> [ 0.000000] SRAT: PXM 0 -> ITS 1 -> Node 0
>>>> [ 0.000000] SRAT: PXM 0 -> ITS 2 -> Node 0
>>>> [ 0.000000] SRAT: PXM 1 -> ITS 3 -> Node 1
>>>> [ 0.000000] SRAT: ITS affinity exceeding max count[4]
>>>>
>>>> This is wrong on D05 as we have 8 ITSes with 4 NUMA nodes.
>>>>
>>>> So dynamically alloc the memory needed instead of using
>>>> its_srat_maps[MAX_NUMNODES], which count the number of
>>>> ITS entry(ies) in SRAT and alloc its_srat_maps as needed,
>>>> then build the mapping of numa node to ITS ID. Of course,
>>>> its_srat_maps will be freed after ITS probing because
>>>> we don't need that after boot.
>>>>
>>>> After doing this, I got what I wanted:
>>>>
>>>> [ 0.000000] SRAT: PXM 0 -> ITS 0 -> Node 0
>>>> [ 0.000000] SRAT: PXM 0 -> ITS 1 -> Node 0
>>>> [ 0.000000] SRAT: PXM 0 -> ITS 2 -> Node 0
>>>> [ 0.000000] SRAT: PXM 1 -> ITS 3 -> Node 1
>>>> [ 0.000000] SRAT: PXM 2 -> ITS 4 -> Node 2
>>>> [ 0.000000] SRAT: PXM 2 -> ITS 5 -> Node 2
>>>> [ 0.000000] SRAT: PXM 2 -> ITS 6 -> Node 2
>>>> [ 0.000000] SRAT: PXM 3 -> ITS 7 -> Node 3
>>>>
>>>> Fixes: dbd2b8267233 ("irqchip/gic-v3-its: Add ACPI NUMA node mapping")
>>>> Signed-off-by: Hanjun Guo <hanjun.guo at linaro.org>
>>>> Cc: Ganapatrao Kulkarni <ganapatrao.kulkarni at cavium.com>
>>>> Cc: Lorenzo Pieralisi <lorenzo.pieralisi at arm.com>
>>>> Cc: Marc Zyngier <marc.zyngier at arm.com>
>>>> ---
>>>>
>>>> v1->v2:
>>>> - Add NULL check in acpi_get_its_numa_node() for no ITS affinity case;
>>>> - Free the its_srat_maps after ITS probing.
>>>>
>>>> drivers/irqchip/irq-gic-v3-its.c | 39 ++++++++++++++++++++++++++++++++-------
>>>> 1 file changed, 32 insertions(+), 7 deletions(-)
>>>>
>>>> diff --git a/drivers/irqchip/irq-gic-v3-its.c b/drivers/irqchip/irq-gic-v3-its.c
>>>> index 3ccdf76..1d692aa 100644
>>>> --- a/drivers/irqchip/irq-gic-v3-its.c
>>>> +++ b/drivers/irqchip/irq-gic-v3-its.c
>>>> @@ -1847,13 +1847,16 @@ struct its_srat_map {
>>>> u32 its_id;
>>>> };
>>>>
>>>> -static struct its_srat_map its_srat_maps[MAX_NUMNODES] __initdata;
>>>> +static struct its_srat_map *its_srat_maps __initdata;
>>>> static int its_in_srat __initdata;
>>>>
>>>> static int __init acpi_get_its_numa_node(u32 its_id)
>>>> {
>>>> int i;
>>>>
>>>> + if (!its_srat_maps)
>>>> + return NUMA_NO_NODE;
>>>> +
>>>> for (i = 0; i < its_in_srat; i++) {
>>>> if (its_id == its_srat_maps[i].its_id)
>>>> return its_srat_maps[i].numa_node;
>>>> @@ -1861,6 +1864,12 @@ static int __init acpi_get_its_numa_node(u32 its_id)
>>>> return NUMA_NO_NODE;
>>>> }
>>>>
>>>> +static int __init gic_acpi_match_srat_its(struct acpi_subtable_header *header,
>>>> + const unsigned long end)
>>>> +{
>>>> + return 0;
>>>> +}
>>>> +
>>>> static int __init gic_acpi_parse_srat_its(struct acpi_subtable_header *header,
>>>> const unsigned long end)
>>>> {
>>>> @@ -1877,12 +1886,6 @@ static int __init gic_acpi_parse_srat_its(struct acpi_subtable_header *header,
>>>> return -EINVAL;
>>>> }
>>>>
>>>> - if (its_in_srat >= MAX_NUMNODES) {
>>>> - pr_err("SRAT: ITS affinity exceeding max count[%d]\n",
>>>> - MAX_NUMNODES);
>>>> - return -EINVAL;
>>>> - }
>>>> -
>>>
>>> So you're getting rid of that message when overflowing the array...
>>
>> This overflowing will not happen, because I scan the SRAT
>> to count the entry(ies) of ITS affinity first to alloc the
>> array, and then parse the same SRAT again to setup the mapping
>> of NUMA node to ITS, so is it fine for us to just remove the
>> check here?
>
> Removing that check is fine, as long as you make sure the allocation
> hasn't failed.
Sorry, just want to make sure I understand correctly. This function will
not be called if allocation failure, so do you mean we can keep the code
as it is?
Thanks
Hanjun
More information about the linux-arm-kernel
mailing list