[PATCH] irq: bcm2835: Re-implement the hardware IRQ handler.
Stephen Warren
swarren at wwwdotorg.org
Tue Oct 1 22:12:39 EDT 2013
On 09/26/2013 02:19 AM, Craig McGeachie wrote:
> On 09/24/2013 03:38 PM, Stephen Warren wrote:
>> I'm not actually sure that's a good change. If we read e.g. PEND1 and
>> saw 4 bits asserted, why wouldn't we optimize by handling each of those
>> 4 bits. Re-reading the PEND1 register for each bit, and presumably
>> seeing the exact same set of remaining bits still set (or even more),
>> just seems like a waste. We know we're going to have to process all the
>> interrupts, so why not just process them?
>
>
> On 09/25/2013 11:33 PM, Simon Arlott wrote:
>> On Sat, September 21, 2013 09:14, Craig McGeachie wrote:
>>> * Restore the flow of control that re-reads the base pending register
>>> after handling any interrupt. The current version handles all
>>> interrupts found in a GPU pending register before re-reading the
>>> base pending register. In the original Broadcom assembly code, there
>>> appear to be defect tracking numbers next to code inserted to create
>>> this behaviour.
>>
>> This was by design so that continuous interrupts in a bank did not impact
>> progress of interrupts in other bank. If there are defects with this
>> strategy, then check that they do not exist in your interrupt handlers
>> instead of the interrupt controller itself. Unless there is a real bug
>> in the interrupt controller, you're decreasing the performance by
>> re-reading the base pending register every time.
>
> I don't understand the concern with re-reading two volatile registers
> between each dispatch. Given the amount of processing that occurs
> between the call to handle_IRQ and the calls to the possibly multiple
> registered interrupt handlers, plus the processing that the handlers
> perform (even if they are implemented as top/bottom halves), I think the
> performance overhead of the two extra reads is vanishingly small. In
> fact, I think that focusing on eliminating them is premature
> optimisation. Developers are notoriously bad at identifying performance
> hotspots through visual inspection.
>
> The point about the registers being volatile is important. It's a C
> keyword for a very good reason.
Volatile as a keyword isn't especially useful for registers though,
since register IO tends to need various barriers as well, but anyway...
I do agree that it's likely best if the driver processes interrupts in
the priority order that the HW designers came up with. So, I'm open to
that change. This might make a difference to some time-critical
shortcuts like the PCM (audio) interrupt.
More information about the linux-arm-kernel
mailing list