Kernel related (?) user space crash at ARM11 MPCore
Russell King - ARM Linux
linux at arm.linux.org.uk
Sun Sep 20 05:31:39 EDT 2009
On Sun, Sep 20, 2009 at 09:39:00AM +0100, Catalin Marinas wrote:
> On Sat, 2009-09-19 at 23:40 +0100, Russell King - ARM Linux wrote:
> > On Mon, Aug 17, 2009 at 06:25:16PM +0100, Catalin Marinas wrote:
> > > Assuming that the dynamic linker does instruction modifications as well
> > > and expects the mprotect(RX) to flush the caches, the patch below
> > > appears to fix the problem (not intensively tested). Note that I don't
> > > say this is the right fix but it may work around the problem until
> > > further investigation into the dynamic linker.
> >
> > Having now re-read the start of the thread, and put all the pieces
> > together, the problem is not to do with SMP per-se, or Icache
> > problems.
>
> It's the I-D cache coherency.
You may be right, but the current evidence does not support that.
If what you say is true, then all current ARMv6 and ARMv7 non-SMP
systems would be affected. So far, the bug report is only against
SMP systems, where the cache is always forced to write allocate mode.
> > I'd like to request that someone who can prove that the program works
> > on ARMv6/v7 hardware does the following test:
> >
> > 1. boot the system with cachepolicy=writealloc
> > 2. re-test the program
>
> I don't think this would work. All the non-SMP v6/v7 processors I'm
> aware of only support read-allocate caches, even if you try to force
> write-allocate. On the SMP ones (Cortex-A9, ARM11MPCore), write-allocate
> is the default.
Are you sure - I thought some of them did support write allocate.
> I also recall that the cachepolicy argument was only affecting the
> kernel mapping rather than the user one. Is this still the case?
Since changing the ptebits stuff, it affects everything.
> > I'll bet that it will fail - because the writes to userspace to fix up
> > the dynamic linking end up sitting in the data cache rather than hitting
> > the underlying page. (I would try it myself, but I can't build EABI
> > userspace binaries here - only OABI stuff I'm afraid).
>
> I've explained this already in this thread.
This thread is soo big that I couldn't find an explaination. Besides,
my explaination appears to be different from yours.
> > I think what we need to do is to ensure that the copy_user_highpage
> > function is writing back data to the backing RAM, so it is visible
> > to the I-cache when COWs of executable pages occur. However, while
> > we can pass this the vma, the vm_flags can't currently be used to
> > detect COW of temporarily non-executable pages - which is what we
> > want to detect to avoid having to clean the cache on every page
> > copy.
>
> copy_user_highpage() would work if we can detect the VM_EXEC flag but in
> this case, the linker does mprotect(RW) before writing to the page (BTW,
> this function could be fixed as well for RWX pages).
"can't currently be used" - yes, I'm aware of this. We could arrange
to remember that the region had executable permission, and use that as
a trigger for additional handling in copy_user_highpage().
> I don't think it's recommended to clean the D-cache (and invalidate the
> I-cache) every time in copy_user_highpage, therefore cache maintenance
> via mprotect -> change_protection -> flush_cache_range may be a better
> option.
I really don't believe so - try it yourself - run some benchmarks on your
ARMv6 or v7 system, comparing the results both with and without the patch.
Especially pay attention to the process creation/shell script performance.
I think you'll find that with your patch, it'll be worse than ARM systems
running at similar clock rates with VIVT caches.
More information about the linux-arm-kernel
mailing list