[PATCH v3 00/18] Eliminate duplicate barriers on weakly-ordered archs

Sinan Kaya okaya at codeaurora.org
Fri Mar 16 09:16:13 PDT 2018


Code includes wmb() followed by writel() in multiple places. writel()
already has a barrier on some architectures like arm64.

This ends up CPU observing two barriers back to back before executing the
register write.

Since code already has an explicit barrier call, changing writel() to
writel_relaxed().

I did a regex search for wmb() followed by writel() in each drivers
directory.
I scrubbed the ones I care about in this series.

I considered "ease of change", "popular usage" and "performance critical
path" as the determining criteria for my filtering.

We used relaxed API heavily on ARM for a long time but
it did not exist on other architectures. For this reason, relaxed
architectures have been paying double penalty in order to use the common
drivers.

Now that relaxed API is present on all architectures, we can go and scrub
all drivers to see what needs to change and what can remain.

We start with mostly used ones and hope to increase the coverage over time.
It will take a while to cover all drivers.

Feel free to apply patches individually.

Changes since v2:
- https://www.spinics.net/lists/arm-kernel/msg641383.html
- compilation fix for ixgbevf
- add patches 7..18

Sinan Kaya (18):
  i40e/i40evf: Eliminate duplicate barriers on weakly-ordered archs
  ixgbe: eliminate duplicate barriers on weakly-ordered archs
  igbvf: eliminate duplicate barriers on weakly-ordered archs
  igb: eliminate duplicate barriers on weakly-ordered archs
  ixgbevf: keep writel() closer to wmb()
  ixgbevf: eliminate duplicate barriers on weakly-ordered archs
  drivers: net: cxgb: Eliminate duplicate barriers on weakly-ordered
    archs
  scsi: hpsa: Eliminate duplicate barriers on weakly-ordered archs
  fm10k: Eliminate duplicate barriers on weakly-ordered archs
  net: qla3xxx: Eliminate duplicate barriers on weakly-ordered archs
  qlcnic: Eliminate duplicate barriers on weakly-ordered archs
  bnx2x: Eliminate duplicate barriers on weakly-ordered archs
  net: cxgb4/cxgb4vf: Eliminate duplicate barriers on weakly-ordered
    archs
  net: cxgb3: Eliminate duplicate barriers on weakly-ordered archs
  RDMA/bnxt_re: Eliminate duplicate barriers on weakly-ordered archs
  IB/mlx4: Eliminate duplicate barriers on weakly-ordered archs
  RDMA/i40iw: Eliminate duplicate barriers on weakly-ordered archs
  infiniband: cxgb4: Eliminate duplicate barriers on weakly-ordered
    archs

 drivers/infiniband/hw/bnxt_re/qplib_rcfw.c          |  8 ++++----
 drivers/infiniband/hw/cxgb4/t4.h                    | 12 ++++++------
 drivers/infiniband/hw/i40iw/i40iw_ctrl.c            |  6 ++++--
 drivers/infiniband/hw/i40iw/i40iw_osdep.h           |  1 +
 drivers/infiniband/hw/i40iw/i40iw_uk.c              |  2 +-
 drivers/infiniband/hw/i40iw/i40iw_utils.c           | 11 +++++++++++
 drivers/infiniband/hw/mlx4/qp.c                     |  4 ++--
 drivers/net/ethernet/broadcom/bnx2x/bnx2x_vfpf.c    |  2 +-
 drivers/net/ethernet/chelsio/cxgb/sge.c             |  2 +-
 drivers/net/ethernet/chelsio/cxgb3/adapter.h        |  7 +++++++
 drivers/net/ethernet/chelsio/cxgb3/sge.c            | 19 ++++++++++---------
 drivers/net/ethernet/chelsio/cxgb4/cxgb4.h          |  6 ++++++
 drivers/net/ethernet/chelsio/cxgb4/cxgb4_main.c     | 13 +++++++------
 drivers/net/ethernet/chelsio/cxgb4/sge.c            |  8 ++++----
 drivers/net/ethernet/chelsio/cxgb4/t4_hw.c          |  2 +-
 drivers/net/ethernet/chelsio/cxgb4vf/adapter.h      | 14 ++++++++++++++
 drivers/net/ethernet/chelsio/cxgb4vf/sge.c          | 16 +++++++++-------
 drivers/net/ethernet/intel/fm10k/fm10k_main.c       |  2 +-
 drivers/net/ethernet/intel/i40e/i40e_txrx.c         |  8 ++++----
 drivers/net/ethernet/intel/i40evf/i40e_txrx.c       |  4 ++--
 drivers/net/ethernet/intel/igb/igb_main.c           |  4 ++--
 drivers/net/ethernet/intel/igbvf/netdev.c           |  4 ++--
 drivers/net/ethernet/intel/ixgbe/ixgbe_main.c       |  8 ++++----
 drivers/net/ethernet/intel/ixgbevf/ixgbevf.h        |  5 -----
 drivers/net/ethernet/intel/ixgbevf/ixgbevf_main.c   |  4 ++--
 drivers/net/ethernet/qlogic/qla3xxx.c               |  4 ++--
 drivers/net/ethernet/qlogic/qlcnic/qlcnic_83xx_hw.c |  2 +-
 drivers/scsi/hpsa.h                                 |  2 +-
 28 files changed, 110 insertions(+), 70 deletions(-)

-- 
2.7.4




More information about the linux-arm-kernel mailing list