[PATCH] ARM: force linker to use PIC veneers
Ard Biesheuvel
ard.biesheuvel at linaro.org
Tue Mar 24 05:50:40 PDT 2015
On 24 March 2015 at 13:22, Dave Martin <Dave.Martin at arm.com> wrote:
> On Tue, Mar 24, 2015 at 11:16:24AM +0100, Ard Biesheuvel wrote:
>> When building a very large kernel, it is up to the linker to decide
>> when and where to insert stubs to allow calls to functions that are
>> out of range for the ordinary b/bl instructions.
>>
>> However, since the kernel is built as a position dependent binary,
>> these stubs (aka veneers) may contain absolute addresses, which will
>> break such veneer assisted far calls performed with the MMU off.
>>
>> For instance, the call from __enable_mmu() in the .head.text section
>> to __turn_mmu_on() in the .idmap.text section may be turned into
>> something like this:
>>
>> c0008168 <__enable_mmu>:
>> c0008168: f020 0002 bic.w r0, r0, #2
>> c000816c: f420 5080 bic.w r0, r0, #4096
>> c0008170: f000 b846 b.w c0008200 <____turn_mmu_on_veneer>
>> [...]
>> c0008200 <____turn_mmu_on_veneer>:
>> c0008200: 4778 bx pc
>> c0008202: 46c0 nop
>> c0008204: e59fc000 ldr ip, [pc]
>> c0008208: e12fff1c bx ip
>> c000820c: c13dfae1 teqgt sp, r1, ror #21
>> [...]
>> c13dfae0 <__turn_mmu_on>:
>> c13dfae0: 4600 mov r0, r0
>> [...]
>>
>> After adding --pic-veneer to the LDFLAGS, the veneer is emitted like
>> this instead:
>>
>> c0008200 <____turn_mmu_on_veneer>:
>> c0008200: 4778 bx pc
>> c0008202: 46c0 nop
>> c0008204: e59fc004 ldr ip, [pc, #4]
>> c0008208: e08fc00c add ip, pc, ip
>> c000820c: e12fff1c bx ip
>> c0008210: 013d7d31 teqeq sp, r1, lsr sp
>> c0008214: 00000000 andeq r0, r0, r0
>>
>> Note that this particular example is best addressed by moving
>> .head.text and .idmap.text closer together, but this issue could
>> potentially affect any code that needs to execute with the
>> MMU off.
>>
>> Signed-off-by: Ard Biesheuvel <ard.biesheuvel at linaro.org>
>
> Although that fixes the problem, wouldn't this introduce extra potential
> overhead for every call in the kernel?
>
It does not change whether a veneer is emitted or not, it only affects
the PIC nature of it.
So the overhead is 1 additional word for the add instruction, which I
think is a small price to pay for correctness, especially considering
that someone building such a big kernel obviously does not optimize
for size.
> How many such veneers get added in the your kernel configuration, and
> how many are actually necessary (i.e., calls between MMU-off code and
> elsewhere)?
>
Very few. In addition to the example (which will be addressed in
another way regardless) there are some resume functions that get
allocated in .data, and those would need it as well. I have also
proposed b_far/bl_far macros that could be used there as well.
The primary concern is that you can't really check whether any
problematic veneers have been emitted, unless all code that may run
with the MMU off is moved to the idmap.text section.
More information about the linux-arm-kernel
mailing list