[PATCH v2 2/2] firmware: arm_scmi: Support 'reg-io-width' property for shared memory
Florian Fainelli
florian.fainelli at broadcom.com
Fri Aug 16 10:39:42 PDT 2024
On 8/16/24 10:02, Cristian Marussi wrote:
> On Tue, Aug 13, 2024 at 11:07:47AM -0700, Florian Fainelli wrote:
>> Some shared memory areas might only support a certain access width,
>> such as 32-bit, which memcpy_{from,to}_io() does not adhere to at least
>> on ARM64 by making both 8-bit and 64-bit accesses to such memory.
>>
>> Update the shmem layer to support reading from and writing to such
>> shared memory area using the specified I/O width in the Device Tree. The
>> various transport layers making use of the shmem.c code are updated
>> accordingly to pass the I/O accessors that they store.
>>
>
> Hi Florian,
>
> I gave it ago at this on a JUNO regarding the mailbox/shmem transport
> without any issue. I'll have a go later on an OPTEE/shmem scenario too.
>
> This looks fundamentally good to me, since you moved all ops setup at
> setup time and you keep the pointers per-channel instead of global...
Thanks!
>
> A few remarks down below.
>
>> Signed-off-by: Florian Fainelli <florian.fainelli at broadcom.com>
[snip]
>
> ... I may be missing a lot here...bear with me...so...
>
> ... AFAIU, as suggested by Peng, you moved away from iowrite##s and ioread##s
> in favour of __raw_write/read##w so as to avoid the implicit barriers on each
> loop iteration...(I suppose..)
>
> ...but should we place some sort of final io barrier (similarly to iowrite)
> at the end of the loop ?
There is no leading or trailing barrier with the ARM64
memcpy_{to,from}io routines which is why this was carried forward as-is.
There is an implicit barrier with the iowrite32() in the tx_prepare(),
so I suppose we are somewhat safe on that part. Likewise with the
fetch_response()/fetch_notification() we have an implicit barrier within
the ioread32() and then there is a data dependency since we ought to be
consuming the response/notification.
For ARM 32-bit the implementation uses readb()/writeb() which does
include barriers.
>
>> +static inline void shmem_memcpy_toio##s(volatile void __iomem *to, \
>> + const void *from, \
>> + size_t count) \
>> +{ \
>> + while (count) { \
>> + __raw_write##w(*(u##s *)from, to); \
>> + from += amt; \
>> + to += amt; \
>> + count -= amt; \
>> + } \
>> +}
>
> ...same concern here
>
>> +static struct scmi_shmem_io_ops shmem_io_ops##s = { \
>> + .fromio = shmem_memcpy_fromio##s, \
>> + .toio = shmem_memcpy_toio##s, \
>> +};
The macro might be a tad too much given that we only support one width,
in case we needed to add a specific size in the future we could use a
macro again I suppose, for now, just inlined the implementation for the
4-byte / 32-bit size.
>> +
>
> There are a bunch of warn/errs from checkpatch --strict, beside the volatile
> here and on the previous typedefs, also about args reuse and trailing semicolon
> in these macros...
I don't think we can silence the volatile ones, checkpatch --strict did
not complain about the typedefs in my case, what did it look like in yours?
--
Florian
More information about the linux-arm-kernel
mailing list