[PATCH 3/4] coresight: tmc-etr: Fix race condition between sysfs and perf mode
Suzuki K Poulose
suzuki.poulose at arm.com
Mon Dec 2 03:31:41 PST 2024
On 02/12/2024 11:19, James Clark wrote:
>
>
> On 02/12/2024 9:46 am, Suzuki K Poulose wrote:
>> Hi Yicong
>>
>> On 02/12/2024 09:24, Yicong Yang wrote:
>>> From: Yicong Yang <yangyicong at hisilicon.com>
>>>
>>> When trying to run perf and sysfs mode simultaneously, the WARN_ON()
>>> in tmc_etr_enable_hw() is triggered sometimes:
>>>
>>> WARNING: CPU: 42 PID: 3911571 at drivers/hwtracing/coresight/
>>> coresight-tmc-etr.c:1060 tmc_etr_enable_hw+0xc0/0xd8 [coresight_tmc]
>>> [..snip..]
>>> Call trace:
>>> tmc_etr_enable_hw+0xc0/0xd8 [coresight_tmc] (P)
>>> tmc_enable_etr_sink+0x11c/0x250 [coresight_tmc] (L)
>>> tmc_enable_etr_sink+0x11c/0x250 [coresight_tmc]
>>> coresight_enable_path+0x1c8/0x218 [coresight]
>>> coresight_enable_sysfs+0xa4/0x228 [coresight]
>>> enable_source_store+0x58/0xa8 [coresight]
>>> dev_attr_store+0x20/0x40
>>> sysfs_kf_write+0x4c/0x68
>>> kernfs_fop_write_iter+0x120/0x1b8
>>> vfs_write+0x2c8/0x388
>>> ksys_write+0x74/0x108
>>> __arm64_sys_write+0x24/0x38
>>> el0_svc_common.constprop.0+0x64/0x148
>>> do_el0_svc+0x24/0x38
>>> el0_svc+0x3c/0x130
>>> el0t_64_sync_handler+0xc8/0xd0
>>> el0t_64_sync+0x1ac/0x1b0
>>> ---[ end trace 0000000000000000 ]---
>>>
>>> Since the enablement of sysfs mode is separeted into two critical
>>> regions,
>>> one for sysfs buffer allocation and another for hardware enablement,
>>> it's
>>> possible to race with the perf mode. Fix this by double check whether
>>> the perf mode's been used before enabling the hardware in sysfs mode.
>>
>> Thanks for the fix. Some minor comments below.
>>
>> It needs a Fixes tag.
>>
>>>
>>> Signed-off-by: Yicong Yang <yangyicong at hisilicon.com>
>>> ---
>>> .../hwtracing/coresight/coresight-tmc-etr.c | 30 +++++++++++++++++++
>>> 1 file changed, 30 insertions(+)
>>>
>>> diff --git a/drivers/hwtracing/coresight/coresight-tmc-etr.c b/
>>> drivers/hwtracing/coresight/coresight-tmc-etr.c
>>> index ad83714ca4dc..d382d95da5ff 100644
>>> --- a/drivers/hwtracing/coresight/coresight-tmc-etr.c
>>> +++ b/drivers/hwtracing/coresight/coresight-tmc-etr.c
>>> @@ -1230,6 +1230,36 @@ static int tmc_enable_etr_sink_sysfs(struct
>>> coresight_device *csdev)
>>> spin_lock_irqsave(&drvdata->spinlock, flags);
>>> + /*
>>> + * Since the sysfs buffer allocation and the hardware enablement
>>> is not
>>> + * in the same critical region, it's possible to race with the perf
>>> + * mode:
>>> + * [sysfs mode] [perf mode]
>>> + * tmc_etr_get_sysfs_buffer()
>>> + * spin_lock(&drvdata->spinlock)
>>> + * [sysfs buffer allocation]
>>> + * spin_unlock(&drvdata->spinlock)
>>> + * spin_lock(&drvdata->spinlock)
>>> + * tmc_etr_enable_hw()
>>> + * drvdata->etr_buf =
>>> etr_perf->etr_buf
>>> + * spin_unlock(&drvdata->spinlock)
>>> + * spin_lock(&drvdata->spinlock)
>>> + * tmc_etr_enable_hw()
>>> + * WARN_ON(drvdata->etr_buf) // WARN sicne etr_buf
>>> initialized at
>>> + * the perf side
>>> + * spin_unlock(&drvdata->spinlock)
>>> + *
>>> + * So check here before continue.
>>> + */
>>> + if (coresight_get_mode(csdev) == CS_MODE_PERF) {
>>> + drvdata->sysfs_buf = NULL;
>>> + spin_unlock_irqrestore(&drvdata->spinlock, flags);
>>> +
>>> + /* Free allocated memory out side of the spinlock */
>>> + tmc_etr_free_sysfs_buf(sysfs_buf);
>>> + return -EBUSY;
>>> + }
>>
>> With this in place, I think we should remove the check for
>> CS_MODE_PERF in get_etr_sysfs_buf() to avoid confusion (which I
>> believe opened up this race)
>>
>> Suzuki
>>
>
> Is it not simpler to set the mode to SYSFS before allocating the buffer
> in the first place? Then we don't need to free if it races and can't get
> into the intermediate state where it's a half initialized sysfs mode.
> The lock doesn't need to be held the whole time, just when setting the
> mode.
>
> Or maybe to make it more consistent with etm4_enable() use
> coresight_take_mode() outside of the lock.
>
> And then also clean up the perf mode check in get_etr_sysfs_buf().
>
> diff --git a/drivers/hwtracing/coresight/coresight-tmc-etr.c b/drivers/
> hwtracing/coresight/coresight-tmc-etr.c
> index a48bb85d0e7f..29c07832127b 100644
> --- a/drivers/hwtracing/coresight/coresight-tmc-etr.c
> +++ b/drivers/hwtracing/coresight/coresight-tmc-etr.c
> @@ -1219,13 +1219,17 @@ static int tmc_enable_etr_sink_sysfs(struct
> coresight_device *csdev)
> int ret = 0;
> unsigned long flags;
> struct tmc_drvdata *drvdata = dev_get_drvdata(csdev->dev.parent);
> - struct etr_buf *sysfs_buf = tmc_etr_get_sysfs_buffer(csdev);
> + struct etr_buf *sysfs_buf;
>
> + spin_lock_irqsave(&drvdata->spinlock, flags);
> +
> + if (coresight_get_mode(csdev) == CS_MODE_PERF)
> + return -EBUSY;
> +
> + sysfs_buf = tmc_etr_get_sysfs_buffer(csdev);
^^ This would try to loack the spinlock again. Or we should explicitly
assert that the spinlock is held in the tmc_etr_get_sysfs_buffer() and
release it and then allocate the buffer. Which again opens up a race
for a PERF session to take it over ?
Suzuki
> if (IS_ERR(sysfs_buf))
> return PTR_ERR(sysfs_buf);
>
> - spin_lock_irqsave(&drvdata->spinlock, flags);
> -
> /*
> * In sysFS mode we can have multiple writers per sink. Since
> this
> * sink is already enabled no memory is needed and the HW need
> not be
>
>
More information about the linux-arm-kernel
mailing list