Question about potential missed IPI events

Thu Nov 16 03:15:54 PST 2023

On 11/16/23 2:11 AM, Xiang W wrote:
> 在 2023-11-16星期四的 01:44 -0800，Bo Gan写道：
>> On 11/16/23 1:19 AM, Xiang W wrote:
>>> 在 2023-11-16星期四的 00:41 -0800，Bo Gan写道：
>>>>
>>>> Hi Xiang, Thank you so much for replying. I'd like to revise your chart a little bit:
>>>>
>>>> Case 1, xchg_ulong in `ipi_process` gets reordered before ipi_clear:
>>>>
>>>>      B                 A                C                 CLINT
>>>>      |                 |                |                  |
>>>>      | bit_set         |                |                  |
>>>>      | send -----------|----------------|----------------> |
>>>>      |                 | xchg           |                  |
>>>>      |                 |                | bit_set          |
>>>>      |                 |                | send ----------> |
>>>>      |                 | clear----------|----------------> |
>>>>
>>>> In this case, A would observe the ipi_data->ipi_type *eventually*, but when?
>>>> A won't process C's IPI request until another IPI comes in the future, which
>>>> might cause the IPI request to wait indefinitely. This is not an efficiency
>>>> problem, but a correctness problem. I think the fix should be adding specific
>>>> fence (need to reason about the correct one) between `ipi_dev->ipi_clear` and
>>>> `atomic_raw_xchg_ulong` in `ipi_process`
>>> Your statement is more correct.
>>>
>>
>> For case 1, can we agree there's an issue in the current code, and it needs to
>> be fixed?
> Yes!

Great, and let's fix it!

>>
>>>>
>>>>
>>>> Case 2, xchg_ulong doesn't get reordered, however:
>>>>
>>>>      B                 A                C                 CLINT
>>>>      |                 |                |                  |
>>>>      | bit_set         |                |                  |
>>>>      | send -----------|----------------|----------------> |
>>>>      |                 | xchg           |            /---> |
>>>>      |                 | clear----------|-----------/----> |
>>>>      |                 |                | bit_set  /       |
>>>>      |                 |                | send ----        |
>>> Why is there an upward-sloping arrow in your picture? Multiple cores
>>> observing atomic operations at the same address should not be
>>> inconsistent.
>>>
>>
>> Let me clarify case 2 a little bit more. Let me use a downward-sloping
>> arrow. In this case the clear sent from A took so long that it arrived
>> at CLINT after C's send.
>>
>>       B                 A                C                 CLINT
>>       |                 |                |                  |
>>       | bit_set ------> |                |                  |
>>       | send -----------|----------------|----------------> |
>>       |                 | xchg           |                  |
>>       |                 | clear----------|----------------> |
>>       |                 |          \     |                  |
>>       |                 | <---------\----|-bit_set          |
>>       |                 |            \   | send ----------> |
>>       |                 |             \  |                  |
>>       |                 |              --|----------------> |
>>                                   
>> Back to your question. Here we have two memory/IO locations, not one.
>> First is the `ipi_data->ipi_type`, second is the msip MMIO for hart A.
>> Essentially, can the following happen?
>>
>>    i.  Assume there's proper fence to ensure xchg orders before clear in A.
>>
>>    ii. From A's point of view, the xchg observes bit_set from B,
>>        but not bit_set from C.
>>
>>    iii.From CLINT's point of view, it observes send from B, then send from C,
>>        then clear from A.
>>
>> I don't know how to reason about this. The only thing I can think of is the
>> global memory order defined by RVWMO. Can we apply the global memory order
>> to the interaction between hart and clint?
> 
> I understand what you mean. The one I proposed is caused by inconsistent core
> speeds and has nothing to do with memory order. Do you think the memory order
> observed by hart and clint is inconsistent?
> 
> How to solve this? fence should only restrict memory ordering on the same core.
> 
> Regards,
> Xiang W

Yes, my issue is about memory ordering, and yes, I'm questioning the global
memory order observed by hart and clint. What kind of rules do they follow?
Where are the rules defined? In RISCV ISA? Or Sifive manual? If this is not
possible, I'm happy to be lectured on the reasoning.

Bo