[AMD Official Use Only] Hello Christian,
Thank you for your comment. Others have suggested that it isn't necessary to keep checking such a low probability issue and we may let it fail if we continuously hit this condition. So I've sent a v3 patch which change the while loop into a one-time if condition. Could you review that version? I believe you could find it in another review request email. Thank you. Yubiao Wang -----Original Message----- From: Koenig, Christian <christian.koe...@amd.com> Sent: Tuesday, June 29, 2021 7:16 PM To: Wang, YuBiao <yubiao.w...@amd.com>; amd-gfx@lists.freedesktop.org Cc: Grodzovsky, Andrey <andrey.grodzov...@amd.com>; Quan, Evan <evan.q...@amd.com>; Chen, Horace <horace.c...@amd.com>; Tuikov, Luben <luben.tui...@amd.com>; Deucher, Alexander <alexander.deuc...@amd.com>; Xiao, Jack <jack.x...@amd.com>; Zhang, Hawking <hawking.zh...@amd.com>; Liu, Monk <monk....@amd.com>; Xu, Feifei <feifei...@amd.com>; Wang, Kevin(Yang) <kevin1.w...@amd.com> Subject: Re: [PATCH 1/1] drm/amdgpu: Read clock counter via MMIO to reduce delay Am 29.06.21 um 11:47 schrieb YuBiao Wang: > [Why] > GPU timing counters are read via KIQ under sriov, which will introduce > a delay. > > [How] > It could be directly read by MMIO. > > v2: Add additional check to prevent carryover issue. > > Signed-off-by: YuBiao Wang <yubiao.w...@amd.com> > --- > drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c | 13 +++++++++++-- > 1 file changed, 11 insertions(+), 2 deletions(-) > > diff --git a/drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c > b/drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c > index ff7e9f49040e..191b9e3ee3ea 100644 > --- a/drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c > +++ b/drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c > @@ -7610,6 +7610,7 @@ static int gfx_v10_0_soft_reset(void *handle) > static uint64_t gfx_v10_0_get_gpu_clock_counter(struct amdgpu_device *adev) > { > uint64_t clock; > + uint64_t clock_lo, clock_hi, hi_check; > > amdgpu_gfx_off_ctrl(adev, false); > mutex_lock(&adev->gfx.gpu_clock_mutex); > @@ -7620,8 +7621,16 @@ static uint64_t gfx_v10_0_get_gpu_clock_counter(struct > amdgpu_device *adev) > ((uint64_t)RREG32_SOC15(SMUIO, 0, > mmGOLDEN_TSC_COUNT_UPPER_Vangogh) << 32ULL); > break; > default: > - clock = (uint64_t)RREG32_SOC15(SMUIO, 0, > mmGOLDEN_TSC_COUNT_LOWER) | > - ((uint64_t)RREG32_SOC15(SMUIO, 0, > mmGOLDEN_TSC_COUNT_UPPER) << 32ULL); > + clock_hi = RREG32_SOC15_NO_KIQ(SMUIO, 0, > mmGOLDEN_TSC_COUNT_UPPER); > + clock_lo = RREG32_SOC15_NO_KIQ(SMUIO, 0, > mmGOLDEN_TSC_COUNT_LOWER); > + hi_check = RREG32_SOC15_NO_KIQ(SMUIO, 0, > mmGOLDEN_TSC_COUNT_UPPER); > + // If carry happens, continuously read until no carry happens > + while (hi_check != clock_hi) { > + clock_lo = RREG32_SOC15_NO_KIQ(SMUIO, 0, > mmGOLDEN_TSC_COUNT_LOWER); > + clock_hi = hi_check; > + hi_check = RREG32_SOC15_NO_KIQ(SMUIO, 0, > mmGOLDEN_TSC_COUNT_UPPER); > + } This could be refined into: do { clock_hi =READ_... clock_lo = READ_.... } while (unlikely(clock_hi != READ_....)) Apart from that looks like a good idea to me. Regards, Christian. > + clock = (uint64_t)clock_lo | ((uint64_t)clock_hi << 32ULL); > break; > } > mutex_unlock(&adev->gfx.gpu_clock_mutex); _______________________________________________ amd-gfx mailing list amd-gfx@lists.freedesktop.org https://lists.freedesktop.org/mailman/listinfo/amd-gfx