[PATCH] coresight: etb10: restore atomic_t for shared reading state

James Clark james.clark at linaro.org
Fri May 29 01:49:14 PDT 2026



On 28/05/2026 5:52 pm, Runyu Xiao wrote:
> The etb10 miscdevice uses drvdata->reading as a shared exclusivity gate
> for userspace buffer access. etb_open() claims that gate with
> local_cmpxchg(), and etb_release() clears it with local_set().
> 
> That gate is shared per-device state rather than CPU-local state. A
> running system can reach it whenever /dev/<etb> is opened, closed, and
> reopened by different tasks while the device remains registered, so the
> same drvdata->reading variable may be claimed on one CPU and later
> cleared on another.
> 
> This code used to use atomic_t for the same gate, but commit
> 27b10da8fff2 ("coresight: etb10: moving to local atomic operations")
> changed it to local_t even though the access pattern remained cross-task
> and cross-CPU. Restore atomic_t together with atomic_cmpxchg() and
> atomic_set() so the exclusivity gate again uses a primitive intended
> for shared state.
> 
> The issue was found on Linux v6.18.21 by our static analysis tool while
> scanning surviving local_t-on-shared-state sites, and then manually
> reviewed against the live etb10 file-op path.
> 
> It was runtime-validated with a reproducible QEMU no-device KCSAN PoC
> that kept the same report-local contract:
> 
>    1. use one shared struct etb_drvdata carrier and its
>       drvdata->reading gate;
>    2. call etb_open() and etb_release() sequentially on that gate to
>       confirm the original claim/clear path;
>    3. bind the open side to CPU0 and the release side to CPU1 for the
>       same gate to show cross-CPU ownership;
>    4. run bound workers that repeatedly race etb_open() and
>       etb_release() on the same gate until KCSAN reports a target hit.
> 
> The harness recorded:
> 
>    L1 passed open=1 release=1
>    reading_after_open=1 reading_after_release=0
>    L2 passed open_cpu=0 release_cpu=1
>    cross_cpu_release=1 reading_after=0 open_ret=0
> 
> Representative KCSAN excerpt from the no-device validation run:
> 
>    BUG: KCSAN: data-race in etb_open.constprop.0.isra.0 [vuln_msv]
> 
>    write to 0xffffffffc0003810 of 4 bytes by task 216 on cpu 1:
>     etb_open.constprop.0.isra.0+0x38/0x80 [vuln_msv]
>     l3_worker_thread_fn+0x4f/0xf0 [vuln_msv]
>     kthread+0x17e/0x1c0
>     ret_from_fork+0x22/0x30
> 
>    read to 0xffffffffc0003810 of 4 bytes by task 215 on cpu 0:
>     etb_open.constprop.0.isra.0+0x18/0x80 [vuln_msv]
>     l3_worker_thread_fn+0x4f/0xf0 [vuln_msv]
>     kthread+0x17e/0x1c0
>     ret_from_fork+0x22/0x30
> 
>    value changed: 0x00000000 -> 0x00000001
> 
>    Reported by Kernel Concurrency Sanitizer on:
>    CPU: 0 PID: 215 Comm: etb10_l3_a Tainted: G           O       6.1.66 #2
> 
> This no-device harness is not a real ETB10 hardware end-to-end run, but
> it preserves the same shared drvdata->reading gate and the same
> etb_open()/etb_release() claim/clear contract. No real ETB10 hardware
> was available for runtime testing.
> 
> Build-tested with:
>    make olddefconfig
>    make -j"$(nproc)" drivers/hwtracing/coresight/coresight-etb10.o
> 
> Fixes: 27b10da8fff2 ("coresight: etb10: moving to local atomic operations")
> Cc: stable at vger.kernel.org
> Signed-off-by: Runyu Xiao <runyu.xiao at seu.edu.cn>
> ---
>   drivers/hwtracing/coresight/coresight-etb10.c | 6 +++---
>   1 file changed, 3 insertions(+), 3 deletions(-)
> 
> diff --git a/drivers/hwtracing/coresight/coresight-etb10.c b/drivers/hwtracing/coresight/coresight-etb10.c
> index 35db1b6093d1..98269ea6f7ae 100644
> --- a/drivers/hwtracing/coresight/coresight-etb10.c
> +++ b/drivers/hwtracing/coresight/coresight-etb10.c
> @@ -85,7 +85,7 @@ struct etb_drvdata {
>   	struct coresight_device	*csdev;
>   	struct miscdevice	miscdev;
>   	raw_spinlock_t		spinlock;
> -	local_t			reading;
> +	atomic_t		reading;
>   	pid_t			pid;
>   	u8			*buf;
>   	u32			buffer_depth;
> @@ -603,7 +603,7 @@ static int etb_open(struct inode *inode, struct file *file)
>   	struct etb_drvdata *drvdata = container_of(file->private_data,
>   						   struct etb_drvdata, miscdev);
>   
> -	if (local_cmpxchg(&drvdata->reading, 0, 1))
> +	if (atomic_cmpxchg(&drvdata->reading, 0, 1))
>   		return -EBUSY;
>   
>   	dev_dbg(&drvdata->csdev->dev, "%s: successfully opened\n", __func__);
> @@ -641,7 +641,7 @@ static int etb_release(struct inode *inode, struct file *file)
>   {
>   	struct etb_drvdata *drvdata = container_of(file->private_data,
>   						   struct etb_drvdata, miscdev);
> -	local_set(&drvdata->reading, 0);
> +	atomic_set(&drvdata->reading, 0);
>   
>   	dev_dbg(&drvdata->csdev->dev, "%s: released\n", __func__);
>   	return 0;


Reviewed-by: James Clark <james.clark at linaro.org>

Semi-related to this change, etb_read() doesn't have any lock when 
reading drvdata->buffer_dept or drvdata->buf. It locks in etb_dump(), 
but then unlocks before actually calling copy_to_user().

Seems like concurrent calls to etb_read() might end up with corrupt 
data, although I'm not sure if that would ever happen in practice 
because it only allows one open file handle.




More information about the linux-arm-kernel mailing list