RE: [PATCH 2/2] drm/amdgpu: Do core dump immediately when job tmo

2024-08-21 Thread Huang, Trigger
gt; >Regards >Sunil K From: Khatri, Sunil Sent: Wednesday, August 21, 2024 12:54 AM To: Alex Deucher Cc: Huang, Trigger ; amd-gfx@lists.freedesktop.org; Deucher, Alexander Subject: Re: [PATCH 2/2] drm/amdgpu: Do core dump immediately when job tmo On 8/20/2024 9:31 PM, Alex Deuche

Re: [PATCH 2/2] drm/amdgpu: Do core dump immediately when job tmo

2024-08-20 Thread Khatri, Sunil
: Huang, Trigger ; amd-gfx@lists.freedesktop.org Cc: Deucher, Alexander Subject: Re: [PATCH 2/2] drm/amdgpu: Do core dump immediately when job tmo On 8/19/2024 3:23 PM, trigger.hu...@amd.com wrote: From: Trigger Huang Do the coredump immediately after a job timeout to get a closer representation

Re: [PATCH 2/2] drm/amdgpu: Do core dump immediately when job tmo

2024-08-20 Thread Alex Deucher
August 19, 2024 6:31 PM > >> To: Huang, Trigger ; amd-gfx@lists.freedesktop.org > >> Cc: Deucher, Alexander > >> Subject: Re: [PATCH 2/2] drm/amdgpu: Do core dump immediately when job > >> tmo > >> > >> > >> On 8/19/2024 3:23 PM, trigger

Re: [PATCH 2/2] drm/amdgpu: Do core dump immediately when job tmo

2024-08-20 Thread Khatri, Sunil
/amdgpu: Do core dump immediately when job tmo On 8/19/2024 3:23 PM, trigger.hu...@amd.com wrote: From: Trigger Huang Do the coredump immediately after a job timeout to get a closer representation of GPU's error status. V2: This will skip printing vram_lost as the GPU reset is not happene

Re: [PATCH 2/2] drm/amdgpu: Do core dump immediately when job tmo

2024-08-20 Thread Alex Deucher
Original Message- > >>> From: Khatri, Sunil > >>> Sent: Monday, August 19, 2024 6:31 PM > >>> To: Huang, Trigger ; amd-gfx@lists.freedesktop.org > >>> Cc: Deucher, Alexander > >>> Subject: Re: [PATCH 2/2] drm/amdgpu: Do core dump

Re: [PATCH 2/2] drm/amdgpu: Do core dump immediately when job tmo

2024-08-20 Thread Khatri, Sunil
Cc: Deucher, Alexander Subject: Re: [PATCH 2/2] drm/amdgpu: Do core dump immediately when job tmo On 8/19/2024 3:23 PM, trigger.hu...@amd.com wrote: From: Trigger Huang Do the coredump immediately after a job timeout to get a closer representation of GPU's error status. V2: This will

Re: [PATCH 2/2] drm/amdgpu: Do core dump immediately when job tmo

2024-08-20 Thread Alex Deucher
op.org > > Cc: Deucher, Alexander > > Subject: Re: [PATCH 2/2] drm/amdgpu: Do core dump immediately when job > > tmo > > > > > > On 8/19/2024 3:23 PM, trigger.hu...@amd.com wrote: > > > From: Trigger Huang > > > > > > Do the cor

RE: [PATCH 2/2] drm/amdgpu: Do core dump immediately when job tmo

2024-08-20 Thread Huang, Trigger
[AMD Official Use Only - AMD Internal Distribution Only] > -Original Message- > From: Khatri, Sunil > Sent: Monday, August 19, 2024 6:31 PM > To: Huang, Trigger ; amd-gfx@lists.freedesktop.org > Cc: Deucher, Alexander > Subject: Re: [PATCH 2/2] drm/amdgpu: Do core dum

Re: [PATCH 2/2] drm/amdgpu: Do core dump immediately when job tmo

2024-08-19 Thread Khatri, Sunil
On 8/19/2024 3:23 PM, trigger.hu...@amd.com wrote: From: Trigger Huang Do the coredump immediately after a job timeout to get a closer representation of GPU's error status. V2: This will skip printing vram_lost as the GPU reset is not happened yet (Alex) V3: Unconditionally call the core du

[PATCH 2/2] drm/amdgpu: Do core dump immediately when job tmo

2024-08-19 Thread Trigger.Huang
From: Trigger Huang Do the coredump immediately after a job timeout to get a closer representation of GPU's error status. V2: This will skip printing vram_lost as the GPU reset is not happened yet (Alex) V3: Unconditionally call the core dump as we care about all the reset functions(soft-recove