回复: 回复: [RFC] a new approach to detect which ring is the real black sheep upon TDR reported

2021-02-27 Thread Liu, Monk
[AMD Official Use Only - Internal Distribution Only] >> So gfx job hangs because it has a dependency on buggy compute job which >> already is hanging ? No, there is no dependency between this gfx job and that compute job from a software perspective , but the CU is shared thus gfx is affected by

Re: 回复: [RFC] a new approach to detect which ring is the real black sheep upon TDR reported

2021-02-27 Thread Andrey Grodzovsky
On 2021-02-26 10:56 p.m., Liu, Monk wrote: [AMD Official Use Only - Internal Distribution Only] H Andrey The scenario I hit here is not the one you mentioned, let me explain it with more details by another much easier understood example: Consider ring you have a job1 on KCQ, but the timeo

[no subject]

2021-02-27 Thread CCF_100
get 058960 ___ amd-gfx mailing list amd-gfx@lists.freedesktop.org https://lists.freedesktop.org/mailman/listinfo/amd-gfx

Re: 回复: [RFC] a new approach to detect which ring is the real black sheep upon TDR reported

2021-02-27 Thread Christian König
Am 27.02.21 um 04:50 schrieb Liu, Monk: [AMD Official Use Only - Internal Distribution Only] the code I pasted is to illustrate why the innocent job is already taken out in the mirror list thus my suggested proposal won’t work unless we don’t delete the job in sched_job_timeout() routine, an

Re: [bug] tab crash on media playback

2021-02-27 Thread Christian König
Am 26.02.21 um 18:31 schrieb Cory Bolar: On 2/26/21 11:15 AM, Nirmoy wrote: Please try this[1] debug patch and let us know if you see anything interesting in dmesg. [1] https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnirmoy%2Flinux%2Fcommit%2F72c7a7e16284b309