[PATCH] usb: ehci: fix update qtd->token in qh_append_tds
Greg KH
greg at kroah.com
Sat Aug 27 12:07:46 EDT 2011
On Sat, Aug 27, 2011 at 11:33:26PM +0800, Ming Lei wrote:
> Hi,
>
> On Sat, Aug 27, 2011 at 11:13 PM, Greg KH <greg at kroah.com> wrote:
> > On Sat, Aug 27, 2011 at 10:48:35PM +0800, ming.lei at canonical.com wrote:
> >> From: Ming Lei <ming.lei at canonical.com>
> >>
> >> This patch fixs one performance bug on ARM Cortex A9 dual core platform,
> >> which has been reported on quite a few ARM machines(OMAP4, Tegra 2, snowball...),
> >> see details from link of https://bugs.launchpad.net/bugs/709245.
> >>
> >> In fact, one mb() on ARM is enough to flush L2 cache, but
> >> 'dummy->hw_token = token;' after mb() is added just for obeying
> >> correct mb() usage.
> >
> > Really? A mb() should not be flushing any caches, it's just a memory
> > barrier. Or is ARM somehow "special" in this way?
>
> As Santosh pointed out, mb on ARM will flush L2 write buffer. The
> description here is wrong.
Then this can't be accepted as-is :)
> I think the below should make the writing reach into memory on all
> ARCH after ' token = dummy->hw_token;' is executed.
>
> dummy->hw_token = token;
> mb()
> token = dummy->hw_token;
>
> The above is the idea introduced to fix the problem.
Are you sure? Have you read the documentation about memory barriers to
confirm this?
> >> The patch has been tested ok on OMAP4 panda A1 board, the performance
> >> of 'dd' over usb mass storage can be increased from 4~5MB/sec to
> >> 14~16MB/sec after applying this patch.
> >
> > That's impressive, but I don't think this is really the proper way to do
> > this...
> >
> >> Signed-off-by: Ming Lei <ming.lei at canonical.com>
> >> ---
> >> drivers/usb/host/ehci-q.c | 14 ++++++++++++++
> >> 1 files changed, 14 insertions(+), 0 deletions(-)
> >>
> >> diff --git a/drivers/usb/host/ehci-q.c b/drivers/usb/host/ehci-q.c
> >> index 0917e3a..65b5021 100644
> >> --- a/drivers/usb/host/ehci-q.c
> >> +++ b/drivers/usb/host/ehci-q.c
> >> @@ -1082,6 +1082,20 @@ static struct ehci_qh *qh_append_tds (
> >> wmb ();
> >> dummy->hw_token = token;
> >>
> >> + /* The mb() below is added to make sure that
> >> + * 'token' can be writen into qtd, so that ehci
> >> + * HC can see the up-to-date qtd descriptor. On
> >> + * some archs(at least on ARM Cortex A9 dual core),
> >> + * writing into coherenet memory doesn't mean the
> >> + * value written can reach physical memory
> >> + * immediately, and the value may be buffered
> >> + * inside L2 cache. 'dummy->hw_token = token;'
> >> + * after mb() is added for obeying correct mb()
> >> + * usage.
> >> + * */
> >> + mb();
> >> + token = dummy->hw_token;
> >
> > Your comment does not match the code, so something is wrong here.
>
> If you mean "L2 cache flush", I confess to the mistaken description,
> and will update it later. If you mean others, could you help to point it out?
I mean others, please read the the last 3 lines of the comment and
compare that to the code lines you added.
greg k-h
More information about the linux-arm-kernel
mailing list