[PATCH 0/4] memcpy optimized with strd/ldrd

Nicolas Pitre nicolas.pitre at linaro.org
Tue Apr 3 10:48:33 EDT 2012


On Tue, 3 Apr 2012, Boojin Kim wrote:

> Nicolas Pitre wrote:
> 
> > > >
> > > > Here's my version.  Lightly tested.
> > > > I have no A15 hardware to run any performance comparison though.
> > > >
> > > I'm reviewing and testing your patch. But, My other work disturbs to reviewing it.
> > > I will give you feedback soon within this week.
> > > Wait a little more.
> > > And, Thanks for your patches. :)
> >
> > FYI, it occurred to me that some corner cases might not be quite right
> > with regards to alignment for the STRD instruction.  It seems that the
> > hardware on which I tested it (Marvell Dove CPU) apparently copes with
> > misaligned SDRD's when they're still 32-bit aligned.  So I need to run
> > this code through a real validation harness on different hardware.
> 
> It's sad, but the performance result wasn't better after adapting your patch.
> I think something on 1~3 patch brings performance degreasing.

If you could identify which patch is responsible that would be helpful.

Thanks.


Nicolas



More information about the linux-arm-kernel mailing list