On 02.09.25 15:31, Alex Deucher wrote:
> On Tue, Sep 2, 2025 at 9:27 AM Christian König <[email protected]>
> wrote:
>>
>> On 02.09.25 15:25, Alex Deucher wrote:
>>> On Tue, Sep 2, 2025 at 3:38 AM Christian König <[email protected]>
>>> wrote:
>>>>
>>>> On 02.09.25 05:29, Srinivasan Shanmugam wrote:
>>>>> Add mmio_remap bookkeeping to amdgpu_device and introduce
>>>>> amdgpu_ttm_mmio_remap_bo_init()/fini() to manage a kernel-owned,
>>>>> one-page (4K) BO in AMDGPU_GEM_DOMAIN_MMIO_REMAP.
>>>>>
>>>>> Bookkeeping:
>>>>> - adev->rmmio_remap.bo : kernel-owned singleton BO
>>>>>
>>>>> The BO is allocated during TTM init when a remap bus address is available
>>>>> (adev->rmmio_remap.bus_addr) and PAGE_SIZE <= AMDGPU_GPU_PAGE_SIZE (4K),
>>>>> and freed during TTM fini.
>>>>>
>>>>> v2:
>>>>> - Check mmio_remap bus address (adev->rmmio_remap.bus_addr) instead of
>>>>> rmmio_base. (Alex)
>>>>> - Skip quietly if PAGE_SIZE > AMDGPU_GPU_PAGE_SIZE or no bus address
>>>>> (no warn). (Alex)
>>>>> - Use `amdgpu_bo_create()` (not *_kernel) - Only with this The object
>>>>> is stored in adev->mmio_remap.bo and will later be exposed to
>>>>> userspace via a GEM handle. (Christian)
>>>>>
>>>>> v3:
>>>>> - Remove obvious comment before amdgpu_ttm_mmio_remap_bo_fini() call.
>>>>> (Alex)
>>>>>
>>>>> v4:
>>>>> - Squash bookkeeping into this patch
>>>>> - Place longer declaration first; clear bp via memset
>>>>> - Reserve + pin + kmap(+kunmap) the BO at init; unpin in fini
>>>>> (Christian)
>>>>>
>>>>> Suggested-by: Christian König <[email protected]>
>>>>> Suggested-by: Alex Deucher <[email protected]>
>>>>> Signed-off-by: Srinivasan Shanmugam <[email protected]>
>>>>> ---
>>>>> drivers/gpu/drm/amd/amdgpu/amdgpu.h | 1 +
>>>>> drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c | 87 +++++++++++++++++++++++++
>>>>> 2 files changed, 88 insertions(+)
>>>>>
>>>>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu.h
>>>>> b/drivers/gpu/drm/amd/amdgpu/amdgpu.h
>>>>> index ddd472e56f69..24501d3fbefe 100644
>>>>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu.h
>>>>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu.h
>>>>> @@ -752,6 +752,7 @@ typedef void (*amdgpu_block_wreg_t)(struct
>>>>> amdgpu_device*, uint32_t, uint32_t, u
>>>>> struct amdgpu_mmio_remap {
>>>>> u32 reg_offset;
>>>>> resource_size_t bus_addr;
>>>>> + struct amdgpu_bo *bo;
>>>>> };
>>>>>
>>>>> /* Define the HW IP blocks will be used in driver , add more if
>>>>> necessary */
>>>>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c
>>>>> b/drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c
>>>>> index 1a68ba17a62d..0d03e3a6f92d 100644
>>>>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c
>>>>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c
>>>>> @@ -1854,6 +1854,87 @@ static void amdgpu_ttm_pools_fini(struct
>>>>> amdgpu_device *adev)
>>>>> adev->mman.ttm_pools = NULL;
>>>>> }
>>>>>
>>>>> +/**
>>>>> + * amdgpu_ttm_mmio_remap_bo_init - Allocate the singleton 4K MMIO_REMAP
>>>>> BO
>>>>> + * @adev: amdgpu device
>>>>> + *
>>>>> + * Allocates a one-page (4K) GEM BO in AMDGPU_GEM_DOMAIN_MMIO_REMAP when
>>>>> the
>>>>> + * hardware exposes a remap base (adev->rmmio_remap.bus_addr) and the
>>>>> host
>>>>> + * PAGE_SIZE is <= AMDGPU_GPU_PAGE_SIZE (4K). The BO is created as a
>>>>> regular
>>>>> + * GEM object (amdgpu_bo_create).
>>>>> + *
>>>>> + * Return:
>>>>> + * * 0 on success or intentional skip (feature not present/unsupported)
>>>>> + * * negative errno on allocation failure
>>>>> + */
>>>>> +static int amdgpu_ttm_mmio_remap_bo_init(struct amdgpu_device *adev)
>>>>> +{
>>>>> + struct amdgpu_bo_param bp;
>>>>> + int r;
>>>>
>>>>> + void *kptr;
>>>>
>>>> kptr should potentially be saved in amdgpu_mmio_remap.
>>>>
>>>>> +
>>>>> + /* Skip if HW doesn't expose remap, or if PAGE_SIZE >
>>>>> AMDGPU_GPU_PAGE_SIZE (4K). */
>>>>> + if (!adev->rmmio_remap.bus_addr || PAGE_SIZE > AMDGPU_GPU_PAGE_SIZE)
>>>>> + return 0;
>>>>> +
>>>>> + memset(&bp, 0, sizeof(bp));
>>>>> +
>>>>> + /* Create exactly one GEM BO in the MMIO_REMAP domain. */
>>>>> + bp.type = ttm_bo_type_device; /* userspace-mappable
>>>>> GEM */
>>>>> + bp.size = AMDGPU_GPU_PAGE_SIZE; /* 4K */
>>>>> + bp.byte_align = AMDGPU_GPU_PAGE_SIZE;
>>>>> + bp.domain = AMDGPU_GEM_DOMAIN_MMIO_REMAP;
>>>>> + bp.flags = 0;
>>>>> + bp.resv = NULL;
>>>>> + bp.bo_ptr_size = sizeof(struct amdgpu_bo);
>>>>> +
>>>>> + r = amdgpu_bo_create(adev, &bp, &adev->rmmio_remap.bo);
>>>>> + if (r)
>>>>> + return r;
>>>>> +
>>>>> + r = amdgpu_bo_reserve(adev->rmmio_remap.bo, false);
>>>>
>>>> The last parameter should probably be true here.
>>>>
>>>>> + if (r)
>>>>> + goto err_unref;
>>>>> +
>>>>> + r = amdgpu_bo_pin(adev->rmmio_remap.bo,
>>>>> AMDGPU_GEM_DOMAIN_MMIO_REMAP);
>>>>> + if (r)
>>>>> + goto err_unres;
>>>>> +
>>>>> + r = amdgpu_bo_kmap(adev->rmmio_remap.bo, &kptr);
>>>
>>> Can't we just skip this? We don't need the CPU address in the kernel.
>>
>> I thought you suggested to use the remapped HDP registers for the HDP flush
>> in the kernel as well?
>>
>> If we don't want to do this we can just skip this.
>
> In the kernel we just use the existing mmio memory map via the WREG()
> macros. Using this other buffer would just complicate things.
Ok in this case I misunderstood you. @Srini please remove the kmap again.
Thanks,
Christian.
>
> Alex
>
>>
>> Christian.
>>
>>>
>>> Alex
>>>
>>>>> + if (r)
>>>>> + goto err_unpin;
>>>>> +
>>>>> + amdgpu_bo_kunmap(adev->rmmio_remap.bo);
>>>>> + amdgpu_bo_unreserve(adev->rmmio_remap.bo);
>>>>> + return 0;
>>>>> +
>>>>> +err_unpin:
>>>>> + amdgpu_bo_unpin(adev->rmmio_remap.bo);
>>>>> +err_unres:
>>>>> + amdgpu_bo_unreserve(adev->rmmio_remap.bo);
>>>>> +err_unref:
>>>>> + amdgpu_bo_unref(&adev->rmmio_remap.bo);
>>>>> + adev->rmmio_remap.bo = NULL;
>>>>> + return r;
>>>>> +}
>>>>> +
>>>>> +/**
>>>>> + * amdgpu_ttm_mmio_remap_bo_fini - Free the singleton MMIO_REMAP BO
>>>>> + * @adev: amdgpu device
>>>>> + *
>>>>> + * Frees the kernel-owned MMIO_REMAP BO if it was allocated by
>>>>> + * amdgpu_ttm_mmio_remap_bo_init().
>>>>> + */
>>>>> +static void amdgpu_ttm_mmio_remap_bo_fini(struct amdgpu_device *adev)
>>>>> +{
>>>>> + if (!amdgpu_bo_reserve(adev->rmmio_remap.bo, false)) {
>>>>
>>>> Same here.
>>>>
>>>> Apart from that looks good to me, feel free to add my rb.
>>>>
>>>> Regards,
>>>> Christian.
>>>>
>>>>> + amdgpu_bo_unpin(adev->rmmio_remap.bo);
>>>>> + amdgpu_bo_unreserve(adev->rmmio_remap.bo);
>>>>> + }
>>>>> + amdgpu_bo_unref(&adev->rmmio_remap.bo);
>>>>> + adev->rmmio_remap.bo = NULL;
>>>>> +}
>>>>> +
>>>>> /*
>>>>> * amdgpu_ttm_init - Init the memory management (ttm) as well as various
>>>>> * gtt/vram related fields.
>>>>> @@ -2028,6 +2109,11 @@ int amdgpu_ttm_init(struct amdgpu_device *adev)
>>>>> return r;
>>>>> }
>>>>>
>>>>> + /* Allocate the singleton MMIO_REMAP BO (4K) if supported */
>>>>> + r = amdgpu_ttm_mmio_remap_bo_init(adev);
>>>>> + if (r)
>>>>> + return r;
>>>>> +
>>>>> /* Initialize preemptible memory pool */
>>>>> r = amdgpu_preempt_mgr_init(adev);
>>>>> if (r) {
>>>>> @@ -2091,6 +2177,7 @@ void amdgpu_ttm_fini(struct amdgpu_device *adev)
>>>>> amdgpu_bo_free_kernel(&adev->mman.sdma_access_bo, NULL,
>>>>> &adev->mman.sdma_access_ptr);
>>>>>
>>>>> + amdgpu_ttm_mmio_remap_bo_fini(adev);
>>>>> amdgpu_ttm_fw_reserve_vram_fini(adev);
>>>>> amdgpu_ttm_drv_reserve_vram_fini(adev);
>>>>>
>>>>
>>