[PATCH 0/4] memcpy optimized with strd/ldrd
Nicolas Pitre
nicolas.pitre at linaro.org
Tue Apr 3 10:48:33 EDT 2012
On Tue, 3 Apr 2012, Boojin Kim wrote:
> Nicolas Pitre wrote:
>
> > > >
> > > > Here's my version. Lightly tested.
> > > > I have no A15 hardware to run any performance comparison though.
> > > >
> > > I'm reviewing and testing your patch. But, My other work disturbs to reviewing it.
> > > I will give you feedback soon within this week.
> > > Wait a little more.
> > > And, Thanks for your patches. :)
> >
> > FYI, it occurred to me that some corner cases might not be quite right
> > with regards to alignment for the STRD instruction. It seems that the
> > hardware on which I tested it (Marvell Dove CPU) apparently copes with
> > misaligned SDRD's when they're still 32-bit aligned. So I need to run
> > this code through a real validation harness on different hardware.
>
> It's sad, but the performance result wasn't better after adapting your patch.
> I think something on 1~3 patch brings performance degreasing.
If you could identify which patch is responsible that would be helpful.
Thanks.
Nicolas
More information about the linux-arm-kernel
mailing list