[PATCH v4 06/20] coresight: add try_get_module() in coresight_grab_device()

Mathieu Poirier mathieu.poirier at linaro.org
Thu Jul 23 15:15:40 EDT 2020


On Thu, Jul 23, 2020 at 08:18:37PM +0200, Greg Kroah-Hartman wrote:
> On Thu, Jul 23, 2020 at 12:04:47PM -0600, Mathieu Poirier wrote:
> > Hi Greg,
> > 
> > On Thu, Jul 23, 2020 at 01:07:07PM +0200, Greg Kroah-Hartman wrote:
> > > On Thu, Jul 23, 2020 at 12:27:48PM +0800, Tingwei Zhang wrote:
> > > > When coresight device is in an active session, driver module of
> > > > that device should not be removed. Use try_get_module() in
> > > > coresight_grab_device() to prevent module to be unloaded.
> > > 
> > > Are you sure this works?  Why is it needed at all?  Why not just tear
> > > down the children properly when a module is removed so that you don't
> > > need this at all?
> > 
> > Using the terms parent and child is somewhat ambiguous...  This is not a
> > parent-child relationship but simply an association between devices, something
> > like port 1 on device "parent" is connected to port 2 on device "child".  The
> > parent-child nomenclature was chosen to reflect that a device appears before
> > another in a coresight path.  Otherwise there is no other relation between
> > devices, hence the choice of using try_get_module()/put_module() to prevent
> > drivers from being taken away.  I'd be happy to proceed differently but haven't
> > found better options.
> > 
> > Going back to parent/child, we could have chosen left/right, up/down or A/B, all
> > of which are just as confusion. 
> 
> Ok, thanks.
> 
> But this causes confusion for everyone as seen below:
> 
> > > > Signed-off-by: Tingwei Zhang <tingwei at codeaurora.org>
> > > > ---
> > > >  drivers/hwtracing/coresight/coresight.c | 27 +++++++++++++++++++++----
> > > >  1 file changed, 23 insertions(+), 4 deletions(-)
> > > > 
> > > > diff --git a/drivers/hwtracing/coresight/coresight.c b/drivers/hwtracing/coresight/coresight.c
> > > > index b7151c5f81b1..17bc76ea86ae 100644
> > > > --- a/drivers/hwtracing/coresight/coresight.c
> > > > +++ b/drivers/hwtracing/coresight/coresight.c
> > > > @@ -640,7 +640,7 @@ struct coresight_device *coresight_get_sink_by_id(u32 id)
> > > >   * don't appear on the trace path, they should be handled along with the
> > > >   * the master device.
> > > >   */
> > > > -static void coresight_grab_device(struct coresight_device *csdev)
> > > > +static int coresight_grab_device(struct coresight_device *csdev)
> > > >  {
> > > >  	int i;
> > > >  
> > > > @@ -648,10 +648,25 @@ static void coresight_grab_device(struct coresight_device *csdev)
> > > >  		struct coresight_device *child;
> > > >  
> > > >  		child  = csdev->pdata->conns[i].child_dev;
> > > > -		if (child && child->type == CORESIGHT_DEV_TYPE_HELPER)
> > > > +		if (child && child->type == CORESIGHT_DEV_TYPE_HELPER) {
> > > > +			if (!try_module_get(child->dev.parent->driver->owner))
> > > 
> > > Why the child's parent?  Why not the child itself?
> > 
> > The device structure of each coresight_device is not associated with a driver.
> > It is there to take advantages of device goodies such as dev.type, dev.group,
> > dev.release and dev.bus.  Coresight IP blocks are discovered on the AMBA bus and as
> > such amba_device::dev::driver holds the driver itself.  In coresight_register()
> > the association coresigth::dev::parent = amba_device::dev is made.
> > 
> > > 
> > > 
> > > > +				goto err;
> > > 
> > > What about the error given to you here?  Why throw that away?
> > > 
> > > >  			pm_runtime_get_sync(child->dev.parent);
> > > > +		}
> > > >  	}
> > > > +	if (!try_module_get(csdev->dev.parent->driver->owner))
> > > > +		goto err;
> > > 
> > > You don't reduce the child's parent's driver owner module reference
> > > here?
> > 
> > Here @parent is referencing the current device.  Now that helper devices
> > connected to any of its outgoing ports have been enabled (and a reference count 
> > to the helper device driver incremented), a reference count to the current device
> > driver can also be incremented. 
> 
> I mean the fact that your error handling does not seem to roll back the
> module reference count you got up above in the other loop.

Ah! You were talking about the error condition, while I thought you were
referring to the normal execution path.  I am in agreement with all your comments on error
handling. 

> 
> Or if it does, it's really really not obvious, and should at the very
> least, be commented as to how it's all cleaning up properly.
> 
> thanks,
> 
> greg k-h



More information about the linux-arm-kernel mailing list