[PATCH v1] i2c: imx: Retry transfer on transient failure

Uwe Kleine-König u.kleine-koenig at pengutronix.de
Thu Jul 14 23:49:31 PDT 2022


Hello Francesco,

On Thu, Jul 14, 2022 at 09:34:08AM +0200, Francesco Dolcini wrote:
> On Thu, Jul 14, 2022 at 09:07:05AM +0200, Oleksij Rempel wrote:
> > On Wed, Jul 13, 2022 at 10:25:41PM +0200, Francesco Dolcini wrote:
> > > Hello Oleksij,
> > > 
> > > On Wed, Jul 13, 2022 at 05:57:23PM +0200, Oleksij Rempel wrote:
> > > > On Wed, Jul 13, 2022 at 03:43:29PM +0200, Francesco Dolcini wrote:
> > > > > On Wed, Jul 13, 2022 at 03:24:37PM +0200, Oleksij Rempel wrote:
> > > > > > On Wed, Jul 13, 2022 at 01:57:50PM +0200, Francesco Dolcini wrote:
> > > > > > > + oleksandr.suvorov at foundries.io
> > > > > > > 
> > > > > > > Hello all,
> > > > > > > 
> > > > > > > On Tue, Jul 12, 2022 at 12:05:04PM +0200, Francesco Dolcini wrote:
> > > > > > > > On Tue, Jul 12, 2022 at 11:05:14AM +0200, Uwe Kleine-König wrote:
> > > > > > > > > In which situations does this help? Please mention these in the
> > > > > > > > > commit log.
> > > > > > > > I'll do
> > > > > > > 
> > > > > > > I did some investigation on this, unfortunately we have this change
> > > > > > > laying around since 1 year, it was written by Oleksandr, and in the
> > > > > > > meantime he moved to a new company. I added him to this email thread, so
> > > > > > > he can comment in case he remembers more.
> > > > > > > 
> > > > > > > We introduced this change while working on OV5640 camera sensor on an
> > > > > > > apalis-imx6q evaluation board, without this change we had some sporadic
> > > > > > > i2c communication issues. Unfortunately I do not have any better
> > > > > > > details.
> > > > > > > 
> > > > > > > To me looks like having some (3? 5?) retry as a default is somehow
> > > > > > > more reasonable than to never retry, not sure if this should be
> > > > > > > implemented as a default for all the i2c adapters. From what I was able
> > > > > > > to see that would not be a trivial change (the retry parameter is coming
> > > > > > > from the i2c_imx driver, there is no obvious way to have a default in
> > > > > > > the i2c core).
> > > > > > > 
> > > > > > > Would it work for you to keep the change as it is (just getting rid
> > > > > > > of the useless define) and add a little bit more blurb to the commit
> > > > > > > message to include the various comments collected so far?
> > > > > > 
> > > > > > I assume, it is related to reset time or other reason where the camera
> > > > > > is not responding. In this case, amount of retries would depend on I2C
> > > > > > CLK speed and host CPU speed.
> > > > > > 
> > > > > 
> > > > > The retry on the I2C IMX driver would trigger only on tx arbitration
> > > > > failure, that would be the SDA being tied low by the slave in an
> > > > > unexpected moment, correct? 
> > > > 
> > > > If it is the case, it is better to understand why. Are there some
> > > > special timing requirements?
> > > > 
> > > > > If the camera does not respond it will just
> > > > > not ack the transaction and that would not be recovered by the retry
> > > > > in this change.
> > > > > 
> > > > > Can this just a layout/cabling issue with some noise on the SDA line? We
> > > > > are talking about somehow long board to board cables with various
> > > > > signals on it. This is an issue that we had for sure in the past,
> > > > > however I do have record of this only on a different camera.
> > > > 
> > > > If it is cabling issue, then I would take a look at pinmux
> > > > configuration. If it is so noisy, that some errors are expected, then it would
> > > > affect camera configuration as well. I mean, system is potentially
> > > > writing trash to the config register.
> > > 
> > > I do not think that this is possible in the way you defined, if SDA is
> > > low when the master is driving it high the master will just stop
> > > transmitting and an arbitration lost interrupt is raised. I guess this
> > > is just the same for any I2C controller, anyway is defined in
> > > `35.7.4 I2C Status Register (I2Cx_I2SR)` in the i.MX6QDL reference manual.
> > > 
> > > I guess it would be still theoretically possible that the master read
> > > garbage from the slave, I'm not aware of any mechanism to avoid it.
> > > 
> > > Said that I do not have more details, for some unfortunate reason this
> > > change was laying in our downstream kernel since too long.
> > > 
> > > Anyway, let's look at this in a different way, from what I was able to
> > > understand digging on this topic retrying on I2C arbitration lost is
> > > just the normal thing to do and the I2C core provides support for this
> > > since ever, the comment in i2c-core is
> > > /* Retry automatically on arbitration loss */.
> > > 
> > > Setting retries to something like 3 or 5 is just very common, various
> > > drivers have this value in the first commit or had it added later on (as
> > > Uwe already commented)
> > > 
> > > To me it seems like the most sensible thing to do, is there any reason
> > > why the i2c_imx driver should not do it?
> > 
> > Here we go:
> > https://www.i2c-bus.org/i2c-primer/analysing-obscure-problems/master-reports-arbitration-lost/
> > "Possible reasons are the same as the ones described in “No Acknowledge
> > From I2C Slave”"
> > 
> > So, lets see what it can be:
> > https://www.i2c-bus.org/i2c-primer/analysing-obscure-problems/no-acknowledge-from-i2c-slave/
> > "Possible reasons are:
> > 
> > - The I2C slave could not correctly interpret the data on SDA because the SDA
> >   high or low-level voltages do not reach its appropriate input
> >   thresholds.
> > - The I2C slave missed an SCL cycle because the SCL high or low-level voltages
> >   do not reach its appropriate input thresholds.
> > - The I2C slave accidently interpreted a spike etc. as an SCL cycle.
> > 
> > With adequate serial resistors between master and slave, an analog shot
> > of the signals at the slave’s SDA and SCL pins provides a clue whether
> > the slave acknowledges and to which SCL clock pulse. The different SDA
> > low levels due to the serial resistor make it possible to distinguish
> > acknowledges from the slave from data bits from the master. "
> > 
> > I interpret it, that setting retry count on any non zero value is an
> > workaround for brocken circuit. It means, on HW development phase we
> > won't be able to detect HW issue, if retry count will be not 0.
> > 
> > IMHO, it board specific configuration and should not be set by driver.
> 
> Got your point, and in contrast to what you wrote, according to the I2C
> spec, a master is required ("must") to restart the transaction in case
> the arbitration is lost.
> 
> From I2C-bus specification and user manual [1], 3.1.8 Arbitration:
> 
> ```
> No information is lost during the arbitration process. A controller that loses the arbitration
> can generate clock pulses until the end of the byte in which it loses the arbitration and
> must restart its transaction when the bus is free.
> ```

I'd interpret that differently. IMHO this is not "After a arbitration
loss it's obligatory that the (previously aborted) message is repeated",
but more "On an arbitration loss, the master's transfer had no effect on
the slave, so if the message is still to be sent, it must be repeated
from its very beginning."

Otherwise I'm on Oleksij's side: Unless you have a multi-controller
setup an arbitration loss is a problem with the signal integrity. And
increasing the retry count is only a work around.

Best regards
Uwe

-- 
Pengutronix e.K.                           | Uwe Kleine-König            |
Industrial Linux Solutions                 | https://www.pengutronix.de/ |
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 488 bytes
Desc: not available
URL: <http://lists.infradead.org/pipermail/linux-arm-kernel/attachments/20220715/e47f8543/attachment.sig>


More information about the linux-arm-kernel mailing list