[PATCH net-next v13 3/3] net: hisilicon: new hip04 ethernet driver
Arnd Bergmann
arnd at arndb.de
Thu Jan 15 01:05:19 PST 2015
On Thursday 15 January 2015 10:54:24 Ding Tianhong wrote:
> On 2015/1/14 16:53, Arnd Bergmann wrote:
> > On Wednesday 14 January 2015 14:34:14 Ding Tianhong wrote:
> >> +#define HIP04_MAX_TX_COALESCE_USECS 200
> >> +#define HIP04_MIN_TX_COALESCE_USECS 100
> >> +#define HIP04_MAX_TX_COALESCE_FRAMES 200
> >> +#define HIP04_MIN_TX_COALESCE_FRAMES 100
> >
> > It's not important, but in case you are creating another version of the
> > patch, maybe the allowed range can be extended somewhat. The example values
> > I picked when I sent my suggestion were really made up. It's great if
> > they work fine, but users might want to tune this far more depending on
> > their workloads, How about these
> >
> > #define HIP04_MAX_TX_COALESCE_USECS 100000
> > #define HIP04_MIN_TX_COALESCE_USECS 1
> > #define HIP04_MAX_TX_COALESCE_FRAMES (TX_DESC_NUM - 1)
> > #define HIP04_MIN_TX_COALESCE_FRAMES 1
> >
>
> Is it really ok that the so big range may break the driver and hip04 could not work fine?
> I am not sure it is ok, I will fix it in next version.
Obviously, performance will suffer when you pick a bad setting for
a given workload. If the numbers are too low, you will increase
the CPU load but get better latency, so I see nothing wrong in
allowing '1' as the minimum. For the descriptor number, you can't
go above TX_DESC_NUM, but there is nothing wrong in going close to
it, it will just mean that the timer should fire before you get
there and you again get more interrupts.
The 100ms maximum delay is a bit extreme, and will definitely impact
TX latency in some workloads if a user sets this, but the system
will should still be usable, and I couldn't think of a better
limit. Feel free to set this to e.g. 10ms or 1ms if you feel more
comfortable with that.
For reference, with TX_DESC_NUM=256 and 1500 byte frames, you have
up to 300ms worth of data in a full tx queue on a 10mbit link,
or 3ms respectively for a 1gbit link. Because of BQL, the actual
queue length will normally be much shorter than this, but on a
tx-only workload won't shrink below the currently used
tx_coalesce_usecs value.
Arnd
More information about the linux-arm-kernel
mailing list