[AMD Official Use Only - Internal Distribution Only]
>> So gfx job hangs because it has a dependency on buggy compute job which
>> already is hanging ?
No, there is no dependency between this gfx job and that compute job from a
software perspective , but the CU is shared thus gfx is affected by
On 2021-02-26 10:56 p.m., Liu, Monk wrote:
[AMD Official Use Only - Internal Distribution Only]
H Andrey
The scenario I hit here is not the one you mentioned, let me explain
it with more details by another much easier understood example:
Consider ring you have a job1 on KCQ, but the timeo
get 058960
___
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx
Am 27.02.21 um 04:50 schrieb Liu, Monk:
[AMD Official Use Only - Internal Distribution Only]
the code I pasted is to illustrate why the innocent job is already
taken out in the mirror list thus my suggested proposal won’t work
unless we don’t delete the job in sched_job_timeout() routine, an
Am 26.02.21 um 18:31 schrieb Cory Bolar:
On 2/26/21 11:15 AM, Nirmoy wrote:
Please try this[1] debug patch and let us know if you see anything
interesting in dmesg.
[1]
https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnirmoy%2Flinux%2Fcommit%2F72c7a7e16284b309