Re: [PATCH 0/4] Refine GPU recovery sequence to enhance its stability

Andrey Grodzovsky Mon, 05 Apr 2021 10:58:23 -0700

Denis, Christian, are there any updates in the plan on how to move onwith this ? As you know I need very similar code for my up-streaming ofdevice hot-unplug. My latest solution(https://lists.freedesktop.org/archives/amd-gfx/2021-January/058606.html)was not acceptable because of low level guards on the register accessorslevel which was hurting performance. Basically I need a way to preventany MMIO write accesses from kernel driver after device is removed (UMDaccesses are taken care of by page faulting dummy page). We are usingnow hot-unplug code for Freemont program and so up-streaming became moreof a priority then before. This MMIO access issue is currently my mainblocker from up-streaming. Is there any way I can assist in pushing thison ?


Andrey


On 2021-03-18 5:51 a.m., Christian König wrote:

Am 18.03.21 um 10:30 schrieb Li, Dennis:
>>> The GPU reset doesn't complete the fences we wait for. It onlycompletes the hardware fences as part of the reset.
>>> So waiting for a fence while holding the reset lock is illegaland needs to be avoided.
I understood your concern. It is more complex for DRM GFX, thereforeI abandon adding lock protection for DRM ioctls now. Maybe we can tryto add all kernel dma_fence waiting in a list, and signal all inrecovery threads. Do you have same concern for compute cases?
Yes, compute (KFD) is even harder to handle.
See you can't signal the dma_fence waiting. Waiting for a dma_fencealso means you wait for the GPU reset to finish.
When we would signal the dma_fence during the GPU reset then we wouldrun into memory corruption because the hardware jobs running after theGPU reset would access memory which is already freed.
>>> Lockdep also complains about this when it is used correctly. Theonly reason it doesn't complain here is because you use anatomic+wait_event instead of a locking primitive.
Agree. This approach will escape the monitor of lockdep. Its goal isto block other threads when GPU recovery thread start. But I couldn’tfind a better method to solve this problem. Do you have some suggestion?
Well, completely abandon those change here.
What we need to do is to identify where hardware access happens andthen insert taking the read side of the GPU reset lock so that wedon't wait for a dma_fence or allocate memory, but still protect thehardware from concurrent access and reset.
Regards,
Christian.
Best Regards

Dennis Li

*From:* Koenig, Christian <christian.koe...@amd.com>
*Sent:* Thursday, March 18, 2021 4:59 PM
*To:* Li, Dennis <dennis...@amd.com>; amd-gfx@lists.freedesktop.org;Deucher, Alexander <alexander.deuc...@amd.com>; Kuehling, Felix<felix.kuehl...@amd.com>; Zhang, Hawking <hawking.zh...@amd.com>*Subject:* AW: [PATCH 0/4] Refine GPU recovery sequence to enhanceits stability
Exactly that's what you don't seem to understand.
The GPU reset doesn't complete the fences we wait for. It onlycompletes the hardware fences as part of the reset.
So waiting for a fence while holding the reset lock is illegal andneeds to be avoided.
Lockdep also complains about this when it is used correctly. The onlyreason it doesn't complain here is because you use anatomic+wait_event instead of a locking primitive.
Regards,

Christian.

------------------------------------------------------------------------

*Von:*Li, Dennis <dennis...@amd.com <mailto:dennis...@amd.com>>
*Gesendet:* Donnerstag, 18. März 2021 09:28
*An:* Koenig, Christian <christian.koe...@amd.com<mailto:christian.koe...@amd.com>>; amd-gfx@lists.freedesktop.org<mailto:amd-gfx@lists.freedesktop.org> <amd-gfx@lists.freedesktop.org<mailto:amd-gfx@lists.freedesktop.org>>; Deucher, Alexander<alexander.deuc...@amd.com <mailto:alexander.deuc...@amd.com>>;Kuehling, Felix <felix.kuehl...@amd.com<mailto:felix.kuehl...@amd.com>>; Zhang, Hawking<hawking.zh...@amd.com <mailto:hawking.zh...@amd.com>>*Betreff:* RE: [PATCH 0/4] Refine GPU recovery sequence to enhanceits stability
>>> Those two steps need to be exchanged or otherwise it is possiblethat new delayed work items etc are started before the lock is taken.What about adding check for adev->in_gpu_reset in work item? Ifexchange the two steps, it maybe introduce the deadlock. Forexample, the user thread hold the read lock and waiting for thefence, if recovery thread try to hold write lock and then completefences, in this case, recovery thread will always be blocked.
Best Regards
Dennis Li
-----Original Message-----
From: Koenig, Christian <christian.koe...@amd.com<mailto:christian.koe...@amd.com>>
Sent: Thursday, March 18, 2021 3:54 PM
To: Li, Dennis <dennis...@amd.com <mailto:dennis...@amd.com>>;amd-gfx@lists.freedesktop.org <mailto:amd-gfx@lists.freedesktop.org>;Deucher, Alexander <alexander.deuc...@amd.com<mailto:alexander.deuc...@amd.com>>; Kuehling, Felix<felix.kuehl...@amd.com <mailto:felix.kuehl...@amd.com>>; Zhang,Hawking <hawking.zh...@amd.com <mailto:hawking.zh...@amd.com>>Subject: Re: [PATCH 0/4] Refine GPU recovery sequence to enhance itsstability
Am 18.03.21 um 08:23 schrieb Dennis Li:
> We have defined two variables in_gpu_reset and reset_sem in adevobject. The atomic type variable in_gpu_reset is used to avoidrecovery thread reenter and make lower functions return more earlierwhen recovery start, but couldn't block recovery thread when itaccess hardware. The r/w semaphore reset_sem is used to solve thesesynchronization issues between recovery thread and other threads.
>
> The original solution locked registers' access in lower functions,which will introduce following issues:
>
> 1) many lower functions are used in both recovery thread andothers. Firstly we must harvest these functions, it is easy to misssomeones. Secondly these functions need select which lock (read lockor write lock) will be used, according to the thread it is runningin. If the thread context isn't considered, the added lock willeasily introduce deadlock. Besides that, in most time, developereasily forget to add locks for new functions.
>
> 2) performance drop. More lower functions are more frequently called.
>
> 3) easily introduce false positive lockdep complaint, because writelock has big range in recovery thread, but low level functions willhold read lock may be protected by other locks in other threads.
>
> Therefore the new solution will try to add lock protection forioctls of kfd. Its goal is that there are no threads except forrecovery thread or its children (for xgmi) to access hardware whendoing GPU reset and resume. So refine recovery thread as the following:
>
> Step 0: atomic_cmpxchg(&adev->in_gpu_reset, 0, 1)
> 1). if failed, it means system had a recovery thread running,current thread exit directly;
>     2). if success, enter recovery thread;
>
> Step 1: cancel all delay works, stop drm schedule, complete allunreceived fences and so on. It try to stop or pause other threads.
>
> Step 2: call down_write(&adev->reset_sem) to hold write lock, whichwill block recovery thread until other threads release read locks.
Those two steps need to be exchanged or otherwise it is possible thatnew delayed work items etc are started before the lock is taken.
Just to make it clear until this is fixed the whole patch set is a NAK.

Regards,
Christian.

>
> Step 3: normally, there is only recovery threads running to accesshardware, it is safe to do gpu reset now.
>
> Step 4: do post gpu reset, such as call all ips' resume functions;
>
> Step 5: atomic set adev->in_gpu_reset as 0, wake up other threadsand release write lock. Recovery thread exit normally.
>
> Other threads call the amdgpu_read_lock to synchronize withrecovery thread. If it finds that in_gpu_reset is 1, it shouldrelease read lock if it has holden one, and then blocks itself towait for recovery finished event. If thread successfully hold readlock and in_gpu_reset is 0, it continues. It will exit normally or bestopped by recovery thread in step 1.
>
> Dennis Li (4):
>    drm/amdgpu: remove reset lock from low level functions
>    drm/amdgpu: refine the GPU recovery sequence
>    drm/amdgpu: instead of using down/up_read directly
>    drm/amdkfd: add reset lock protection for kfd entry functions
>
>   drivers/gpu/drm/amd/amdgpu/amdgpu.h           | 6 +
>   drivers/gpu/drm/amd/amdgpu/amdgpu_debugfs.c   | 14 +-
> drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 173+++++++++++++-----
>   .../gpu/drm/amd/amdgpu/amdgpu_ras_eeprom.c    | 8 -
>   drivers/gpu/drm/amd/amdgpu/gmc_v10_0.c        | 4 +-
>   drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c         | 9 +-
>   drivers/gpu/drm/amd/amdgpu/mxgpu_ai.c         | 5 +-
>   drivers/gpu/drm/amd/amdgpu/mxgpu_nv.c         | 5 +-
>   drivers/gpu/drm/amd/amdkfd/kfd_chardev.c      | 172 ++++++++++++++++-
>   drivers/gpu/drm/amd/amdkfd/kfd_priv.h         | 3 +-
>   drivers/gpu/drm/amd/amdkfd/kfd_process.c      | 4 +
>   .../amd/amdkfd/kfd_process_queue_manager.c    | 17 ++
>   12 files changed, 345 insertions(+), 75 deletions(-)
>
_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

Re: [PATCH 0/4] Refine GPU recovery sequence to enhance its stability

Reply via email to