[PATCHv2 net-next 05/16] net: mvpp2: introduce PPv2.2 HW descriptors and adapt accessors
Robin Murphy
robin.murphy at arm.com
Fri Feb 3 08:31:25 PST 2017
On 03/02/17 15:02, Thomas Petazzoni wrote:
> Hello,
>
> On Fri, 3 Feb 2017 14:05:13 +0000, Robin Murphy wrote:
>
>>> So do you have a better suggestion? The descriptors only have enough
>>> space to store a 40-bit virtual address, which is not enough to fit the
>>> virtual addresses used by Linux for SKBs. This is why I'm instead
>>> relying on the fact that the descriptors can store the 40-bit physical
>>> address, and convert it back to a virtual address, which should be fine
>>> on ARM64 because the entire physical memory is part of the kernel linear
>>> mapping.
>>
>> OK, that has nothing to do with DMA addresses then.
>
> Well, it has to do with DMA in the sense that those buffers are
> mapped with dma_map_single(). So the address that is given to us by the
> hardware as the "physical address of the RX buffer" is the one that we
> have initially given to the hardware and was the result of
> dma_map_single().
Ah, right. DMA addresses are not physical addresses. They may look very
much alike on many platforms, but that cannot be relied upon. On some
platforms, some of the "address bits" actually encode things like bus
attributes. If the device is behind an IOMMU, the DMA address can be
entirely arbitrary, and the only way to convert it back to a CPU address
(physical or virtual) is to walk the IOMMU page table.
>>> Russell proposal of testing the size of a virtual address
>>> pointer instead would solve this I believe, correct?
>>
>> AFAICS, even that shouldn't really be necessary - for all VA/PA
>> combinations of 32/32, 32/40 and 64/40, storing virt_to_phys() of the
>> SKB VA won't overflow 40 bits,
>
> I'm already lost here. Why are you talking about virt_to_phys() ? See
> above: we have the dma_addr_t returned from dma_map_single(), and we
> need to find back the corresponding virtual address, because there is
> not enough room in the HW descriptors to store a full 64-bit VA.
That's the wrong approach. You need to keep track of the VA which you
gave to dma_map_single() in the first place; *that* can be packed into
40 bits with a round-trip through a CPU physical address, assuming that
this "VA" field in the hardware descriptor is in addition to the actual
DMA address (and that the hardware descriptor is something different to
the rx_desc struct to which you could presumably just add an extra pointer).
>> so a corresponding phys_to_virt() at the other end can't go wrong
>> either. If you really do want to special-case things based on VA
>> size, though, either CONFIG_64BIT or sizeof(void *) would indeed be
>> infinitely more useful than the unrelated DMA address width - I know
>> this driver's never going to run on SPARC64, but that's one example
>> of where the above logic would lead to precisely the truncated VA
>> it's trying to avoid.
>
> What is different on SPARC64 here?
void * and phys_addr_t are 64-bit, but dma_addr_t is 32-bit. The more
realistic concern, though, is that this using dma_to_phys() - which is a
DMA-API-internal helper for SWIOTLB - will break the driver's
COMPILE_TEST dependency.
> The situation we have is the following:
>
> - On systems where VAs are 32-bit wide, we have enough room to store
> the VA in the HW descriptor. So when we receive a packet, the HW
> descriptor provides us directly with the VA of the network packet,
> and the DMA address of the packet. We can dma_unmap_single() the
> packet, and do its processing.
>
> - On systems where VAs are 64-bit wide, we don't have enough room to
> store the VA in the HW descriptor. However, on 64-bit systems, the
> entire physical memory is mapped in the kernel linear mapping, so
> phys_to_virt() is valid on any physical address. And we use this
> property to retrieve the full 64-bit VA using the DMA address that
> we get from the HW descriptor.
>
> Since what we get from the HW descriptor is a DMA address, that's
> why we're using phys_to_virt(dma_to_phys(...)).
Marvell's product page tells me that Armada7k/8k include an IOMMU; at
some point, somebody's surely going to want to use it, and any invalid
assumptions about physical addresses are going to go bang. You need to
treat CPU and DMA addresses as entirely separate, but since any CPU
address used for streaming DMA - i.e. dma_map_*() - must be a valid
linear map address, virtual and physical are relatively interchangeable
if the latter works out more convenient.
Robin.
>
> Best regards,
>
> Thomas
>
More information about the linux-arm-kernel
mailing list