[PATCH v8 4/6] drm/xe/guc: Move xe_hw_engine_snapshot creation back to xe_hw_engine.c

2025-02-13 Thread Alan Previn
to respect "xe_hw_engine" namespace (Rodrigo) Signed-off-by: Alan Previn Reviewed-by: Zhanjun Dong --- drivers/gpu/drm/xe/xe_devcoredump.c | 2 +- drivers/gpu/drm/xe/xe_guc_capture.c | 30 --- drivers/gpu/drm/xe/xe_guc_capture.h | 1 - drivers/gpu/drm/xe/x

[PATCH v8 5/6] drm/xe/xe_hw_engine: Update xe_hw_engine capture for debugfs/gt_reset

2025-02-13 Thread Alan Previn
manual captures that were attached to a job vs late, raw jobless ones. v8:- Rename the enum xe_guc_capture_snapshot_source to xe_engine_capture_source to match the defines (Matthew Brost/John Harrison). - Minor patch header comment improvement. (Alan Previn) v7:- Fix mis

[PATCH v8 3/6] drm/xe/guc: Split engine state print between xe_hw_engine vs xe_guc_capture

2025-02-13 Thread Alan Previn
node's register list. v7: - Fix function name to respect "xe_hw_engine" name space. (Rodrigo) - Remove additional newline in engine dump (Jose Souza) + ensure changes didn't break mesa's aubinator tool (Rodrigo) Signed-off-by: Alan Previn

[PATCH v8 6/6] drm/xe/guc: Update comments on GuC-Err-Capture flows

2025-02-13 Thread Alan Previn
Update the comments on GuC-Err-Capture flows with the updated function names. Signed-off-by: Alan Previn Reviewed-by: Zhanjun Dong --- drivers/gpu/drm/xe/xe_guc_capture.c | 42 - 1 file changed, 29 insertions(+), 13 deletions(-) diff --git a/drivers/gpu/drm/xe

[PATCH v8 0/6] Maintenence of devcoredump <-> GuC-Err-Capture plumbing

2025-02-13 Thread Alan Previn
ensures 'xe_devcoredump_snapshot' owns 'xe_hw_engine_snapshot generation' and the latter owns 'xe_guc_capture_snapshot' retrieval (with GuC-Error-Capture as its helper). Alan Previn (6): drm/xe/guc: Rename __guc_capture_parsed_output drm/xe/guc: Don't store cap

[PATCH v8 1/6] drm/xe/guc: Rename __guc_capture_parsed_output

2025-02-13 Thread Alan Previn
yright header fix in xe_guc_capture_snapshot_types.h (Rodrigo) Signed-off-by: Alan Previn Reviewed-by: Rodrigo Vivi Reviewed-by: Zhanjun Dong --- drivers/gpu/drm/xe/xe_devcoredump_types.h | 2 +- drivers/gpu/drm/xe/xe_guc_capture.c | 83 ++- drivers/gpu/drm/xe/xe_guc_capt

[PATCH v8 2/6] drm/xe/guc: Don't store capture nodes in xe_devcoredump_snapshot

2025-02-13 Thread Alan Previn
anual-sourced capture node is found during xe_hw_engine printing because this can be valid in some code-paths such as for gt-reset events. (John Harrison) Signed-off-by: Alan Previn --- drivers/gpu/drm/xe/xe_devcoredump.c | 3 - drivers/gpu/drm/xe/xe_devcoredump_types.h

Re: [PATCH v6 2/6] drm/xe/guc: Don't store capture nodes in xe_devcoredump_snapshot

2025-02-12 Thread Teres Alexis, Alan Previn
alan: I will respin this rev with the changes mentiond below - thanks Zhanjun for the time in reviewing this. :) On Mon, 2025-02-10 at 18:41 -0500, Dong, Zhanjun wrote: > See my comments inline below. > > Regards, > Zhanjun Dong > > On 2025-01-28 1:36 p.m., Alan Previn

Re: [PATCH v7 6/6] drm/xe/guc: Update comments on GuC-Err-Capture flows

2025-02-12 Thread Teres Alexis, Alan Previn
On Tue, 2025-02-11 at 18:09 -0500, Dong, Zhanjun wrote: > > > alan:snip > >    * > > - * User Sysfs / Debugfs > > - * > > - *  --> xe_devcoredump_read-> > > + * User Devcoredump Sysfs > > + * -- > > + *  --> xe_devcoredump_read-> (user cats devcore

Re: [PATCH v7 3/6] drm/xe/guc: Split engine state print between xe_hw_engine vs xe_guc_capture

2025-02-12 Thread Teres Alexis, Alan Previn
On Tue, 2025-02-11 at 17:48 -0500, Dong, Zhanjun wrote: > > > On 2025-02-10 6:32 p.m., Alan Previn wrote: > > Relocate the xe_engine_snapshot_print function from xe_guc_capture.c > > into xe_hw_engine.c but split out the GuC-Err-Capture register printing > > portion

Re: [PATCH V9] drm/xe/uapi: Use hint for guc to set GT frequency

2025-02-12 Thread Teres Alexis, Alan Previn
On Wed, 2025-02-12 at 17:48 +0530, Tejas Upadhyay wrote: > Allow user to provide a low latency hint. When set, KMD sends a hint > to GuC which results in special handling for that process. SLPC will > ramp the GT frequency aggressively every time it switches to this > process. > > We need to enabl

Re: [PATCH v7 5/6] drm/xe/xe_hw_engine: Update xe_hw_engine capture for debugfs/gt_reset

2025-02-11 Thread Teres Alexis, Alan Previn
On Mon, 2025-02-10 at 15:32 -0800, Teres Alexis, Alan Previn wrote: > xe_hw_engine_print is called by debugfs to do an immediate raw > --- a/drivers/gpu/drm/xe/xe_guc_capture_snapshot_types.h > +++ b/drivers/gpu/drm/xe/xe_guc_capture_snapshot_types.h > @@ -12,7 +12,11 @@ >  stru

[PATCH v7 4/6] drm/xe/guc: Move xe_hw_engine_snapshot creation back to xe_hw_engine.c

2025-02-10 Thread Alan Previn
to respect "xe_hw_engine" namespace (Rodrigo) Signed-off-by: Alan Previn --- drivers/gpu/drm/xe/xe_devcoredump.c | 2 +- drivers/gpu/drm/xe/xe_guc_capture.c | 30 --- drivers/gpu/drm/xe/xe_guc_capture.h | 1 - drivers/gpu/drm/xe/xe_hw_engine.c | 38 ++

[PATCH v7 0/6] Maintenence of devcoredump <-> GuC-Err-Capture plumbing

2025-02-10 Thread Alan Previn
ensures 'xe_devcoredump_snapshot' owns 'xe_hw_engine_snapshot generation' and the latter owns 'xe_guc_capture_snapshot' retrieval (with GuC-Error-Capture as its helper). Alan Previn (6): drm/xe/guc: Rename __guc_capture_parsed_output drm/xe/guc: Don't store cap

[PATCH v7 5/6] drm/xe/xe_hw_engine: Update xe_hw_engine capture for debugfs/gt_reset

2025-02-10 Thread Alan Previn
bless. v7:- Fix mismatch func name vs comment (kernel robot) - Differentiate between early manual captures that have a job association vs raw manual captures that may not have a job association like in gt-reset events. (John Harrison). Signed-off-by: Alan Previn --- driver

[PATCH v7 3/6] drm/xe/guc: Split engine state print between xe_hw_engine vs xe_guc_capture

2025-02-10 Thread Alan Previn
node's register list. v7: - Fix function name to respect "xe_hw_engine" name space. (Rodrigo) - Remove additional newline in engine dump (Jose Souza) + ensure changes didn't break mesa's aubinator tool (Rodrigo) Signed-off-by: Alan Previn

[PATCH v7 6/6] drm/xe/guc: Update comments on GuC-Err-Capture flows

2025-02-10 Thread Alan Previn
Update the comments on GuC-Err-Capture flows with the updated function names. Signed-off-by: Alan Previn --- drivers/gpu/drm/xe/xe_guc_capture.c | 19 +++ 1 file changed, 11 insertions(+), 8 deletions(-) diff --git a/drivers/gpu/drm/xe/xe_guc_capture.c b/drivers/gpu/drm/xe

[PATCH v7 2/6] drm/xe/guc: Don't store capture nodes in xe_devcoredump_snapshot

2025-02-10 Thread Alan Previn
igned-off-by: Alan Previn --- drivers/gpu/drm/xe/xe_devcoredump.c | 3 - drivers/gpu/drm/xe/xe_devcoredump_types.h | 6 - drivers/gpu/drm/xe/xe_guc_capture.c | 153 ++ drivers/gpu/drm/xe/xe_guc_capture.h | 9 +- drivers/gpu/drm/xe/xe_guc_submit.c

[PATCH v7 1/6] drm/xe/guc: Rename __guc_capture_parsed_output

2025-02-10 Thread Alan Previn
yright header fix in xe_guc_capture_snapshot_types.h (Rodrigo) Signed-off-by: Alan Previn Reviewed-by: Rodrigo Vivi Reviewed-by: Zhanjun Dong --- drivers/gpu/drm/xe/xe_devcoredump_types.h | 2 +- drivers/gpu/drm/xe/xe_guc_capture.c | 83 ++- drivers/gpu/drm/xe/xe_guc_capt

Re: [PATCH v6 3/6] drm/xe/guc: Split engine state print between xe_hw_engine vs xe_guc_capture

2025-02-10 Thread Teres Alexis, Alan Previn
On Fri, 2025-01-31 at 10:55 -0800, Teres Alexis, Alan Previn wrote: > On Thu, 2025-01-30 at 17:42 -0500, Vivi, Rodrigo wrote: > > On Tue, Jan 28, 2025 at 10:36:49AM -0800, Alan Previn wrote: > > > > > alan:snip > > > -   if (!snapshot->matched_no

Re: [PATCH v6 3/6] drm/xe/guc: Split engine state print between xe_hw_engine vs xe_guc_capture

2025-01-31 Thread Teres Alexis, Alan Previn
On Thu, 2025-01-30 at 17:42 -0500, Vivi, Rodrigo wrote: > On Tue, Jan 28, 2025 at 10:36:49AM -0800, Alan Previn wrote: > > Relocate the xe_engine_snapshot_print function from xe_guc_capture.c > > into xe_hw_engine.c but split out the GuC-Err-Capture register printing > &g

Re: [PATCH v6 4/6] drm/xe/guc: Move xe_hw_engine_snapshot creation back to xe_hw_engine.c

2025-01-31 Thread Teres Alexis, Alan Previn
On Thu, 2025-01-30 at 17:43 -0500, Vivi, Rodrigo wrote: > On Tue, Jan 28, 2025 at 10:36:50AM -0800, Alan Previn wrote: alan:snip > > @@ -55,8 +55,7 @@ void xe_hw_engine_handle_irq(struct xe_hw_engine *hwe, > > u16 intr_vec); > >  void xe_hw_engine_enable_ring(struct xe_hw_e

Re: [PATCH v6 1/6] drm/xe/guc: Rename __guc_capture_parsed_output

2025-01-31 Thread Teres Alexis, Alan Previn
> > +++ b/drivers/gpu/drm/xe/xe_guc_capture_snapshot_types.h > > @@ -0,0 +1,53 @@ > > +/* SPDX-License-Identifier: MIT */ > > +/* > > + * Copyright © 2021-2024 Intel Corporation > > 2025 > > then > > Reviewed-by: Rodrigo Vivi > will do - thanks

Re: [PATCH v6 2/6] drm/xe/guc: Don't store capture nodes in xe_devcoredump_snapshot

2025-01-30 Thread Teres Alexis, Alan Previn
On Tue, 2025-01-28 at 10:36 -0800, Teres Alexis, Alan Previn wrote: > GuC-Err-Capture should not be storing register snapshot > nodes directly inside of the top level xe_devcoredump_snapshot > structure that it doesn't control. Furthermore, that is > is not right from a driver s

Re: [PATCH v4 0/1] Maintenence of devcoredump <-> GuC-Err-Capture plumbing

2025-01-28 Thread Teres Alexis, Alan Previn
Update: new URL with next rev that includes the split -> https://patchwork.freedesktop.org/series/144050/ basically the same squash but i decided to drop some trivial things like comments and name of function/variables. ...alan On Thu, 2025-01-23 at 18:01 +, Teres Alexis, Alan Pre

[PATCH v6 2/6] drm/xe/guc: Don't store capture nodes in xe_devcoredump_snapshot

2025-01-28 Thread Alan Previn
ister snapshot within its internal outlist). Signed-off-by: Alan Previn --- drivers/gpu/drm/xe/xe_devcoredump.c | 3 - drivers/gpu/drm/xe/xe_devcoredump_types.h | 6 - drivers/gpu/drm/xe/xe_guc_capture.c | 153 ++ drivers/gpu/drm/xe/xe_guc_capture.h

[PATCH v6 4/6] drm/xe/guc: Move xe_hw_engine_snapshot creation back to xe_hw_engine.c

2025-01-28 Thread Alan Previn
xe_devcoredump calls xe_engine_snapshot_capture_for_queue() to allocate and populate the xe_hw_engine_snapshot structure. Move that function back into xe_hw_engine.c since it doesn't make sense for GuC-Err-Capture to allocate a structure it doesn't own. Signed-off-by: Alan Previn --

[PATCH v6 1/6] drm/xe/guc: Rename __guc_capture_parsed_output

2025-01-28 Thread Alan Previn
Since '__guc_capture_parsed_output *' is a handle that is retrieved, stored and relinquished by an entity external to GuC (i.e. xe_devcoredump), lets rename it to something formal without the'__' prefix and export it via give a header file. Signed-off-by: Alan Previn ---

[PATCH v6 6/6] drm/xe/guc: Update comments on GuC-Err-Capture flows

2025-01-28 Thread Alan Previn
Update the comments on GuC-Err-Capture flows with the updated function names. Signed-off-by: Alan Previn --- drivers/gpu/drm/xe/xe_guc_capture.c | 19 +++ 1 file changed, 11 insertions(+), 8 deletions(-) diff --git a/drivers/gpu/drm/xe/xe_guc_capture.c b/drivers/gpu/drm/xe

[PATCH v6 0/6] Maintenence of devcoredump <-> GuC-Err-Capture plumbing

2025-01-28 Thread Alan Previn
ensures 'xe_devcoredump_snapshot' owns 'xe_hw_engine_snapshot generation' and the latter owns 'xe_guc_capture_snapshot' retrieval (with GuC-Error-Capture as its helper). Alan Previn (6): drm/xe/guc: Rename __guc_capture_parsed_output drm/xe/guc: Don't store cap

[PATCH v6 5/6] drm/xe/xe_hw_engine: Update hw_engine_snapshot_capture for debugfs

2025-01-28 Thread Alan Previn
can also invoke GuC-Err-Capture for an immediate jobless snapshot. Signed-off-by: Alan Previn --- drivers/gpu/drm/xe/xe_guc_capture.c | 26 ++ drivers/gpu/drm/xe/xe_guc_capture.h | 2 ++ drivers/gpu/drm/xe/xe_hw_engine.c | 15 +-- 3 files changed, 41

[PATCH v6 3/6] drm/xe/guc: Split engine state print between xe_hw_engine vs xe_guc_capture

2025-01-28 Thread Alan Previn
node's register list. Signed-off-by: Alan Previn --- drivers/gpu/drm/xe/xe_guc_capture.c | 72 + drivers/gpu/drm/xe/xe_guc_capture.h | 4 +- drivers/gpu/drm/xe/xe_hw_engine.c | 28 +++ drivers/gpu/drm/xe/xe_hw_engine.h | 1 + 4 files change

Re: [PATCH v4 0/1] Maintenence of devcoredump <-> GuC-Err-Capture plumbing

2025-01-23 Thread Teres Alexis, Alan Previn
On Tue, 2025-01-21 at 18:15 -0500, Vivi, Rodrigo wrote: > On Tue, Jan 21, 2025 at 11:09:34AM -0800, Alan Previn wrote: > > > A 'series' of 1 patch is not a series. Cover letter is not needed. > > However, this patch is the size of a series and it should be > s

[PATCH v4 0/1] Maintenence of devcoredump <-> GuC-Err-Capture plumbing

2025-01-21 Thread Alan Previn
, rearranging the layering back to what its should be (xe_devcoredump_snapshot owns xe_hw_engine_snapshot owns xe_guc_capture_snapshot).. Alan Previn (1): drm/xe/guc/capture: Maintenence of devcoredump <-> GuC-Err-Capture plumbing drivers/gpu/drm/xe/xe_devcoredump.c | 3 - d

[PATCH v4 1/1] drm/xe/guc/capture: Maintenence of devcoredump <-> GuC-Err-Capture plumbing

2025-01-21 Thread Alan Previn
ting manual capture (CI-test) v2: Bail on manual capture when running on a VF (Zhanjun) Signed-off-by: Alan Previn Reviewed-by: Zhanjun Dong --- drivers/gpu/drm/xe/xe_devcoredump.c | 3 - drivers/gpu/drm/xe/xe_devcoredump_types.h | 6 - drivers/gpu/drm/xe/xe_guc_capture.c

[PATCH v3 1/1] drm/xe/guc/capture: Maintenence of devcoredump <-> GuC-Err-Capture plumbing

2024-12-03 Thread Alan Previn
-test) v1 -> v2: - Bail on manual capture when running on a VF (Zhanjun) Signed-off-by: Alan Previn --- drivers/gpu/drm/xe/xe_devcoredump.c | 3 - drivers/gpu/drm/xe/xe_devcoredump_types.h | 6 - drivers/gpu/drm/xe/xe_guc_capture.c | 406 -

[PATCH v3 0/1] Maintenence of devcoredump <-> GuC-Err-Capture plumbing

2024-12-03 Thread Alan Previn
, rearranging the layering back to what its should be (xe_devcoredump_snapshot owns xe_hw_engine_snapshot owns xe_guc_capture_snapshot).. Alan Previn (1): drm/xe/guc/capture: Maintenence of devcoredump <-> GuC-Err-Capture plumbing drivers/gpu/drm/xe/xe_devcoredump.c | 3 - d

[PATCH v2 1/1] drm/xe/guc/capture: Maintenence of devcoredump <-> GuC-Err-Capture plumbing

2024-12-02 Thread Alan Previn
e header) for xe_hw_engine_snapshot to use. 6. Since xe_hw_engine_snapshot can also be called by via debugfs without a job, create a new function that does a manual capture of engine registers without any associated job. v1 -> v2: - Bail on manual capture when running on a VF (Zhanju

[PATCH v2 0/1] Maintenence of devcoredump <-> GuC-Err-Capture plumbing

2024-12-02 Thread Alan Previn
, rearranging the layering back to what its should be (xe_devcoredump_snapshot owns xe_hw_engine_snapshot owns xe_guc_capture_snapshot).. Alan Previn (1): drm/xe/guc/capture: Maintenence of devcoredump <-> GuC-Err-Capture plumbing drivers/gpu/drm/xe/xe_devcoredump.c | 3 - d

Re: [PATCH 1/1] drm/xe/guc/capture: Maintenence of devcoredump <-> GuC-Err-Capture plumbing

2024-12-02 Thread Teres Alexis, Alan Previn
On Tue, 2024-11-26 at 12:09 -0500, Dong, Zhanjun wrote: > See my comments inline below: > > Regards, > Zhanjun > > On 2024-11-17 1:44 p.m., Alan Previn wrote: > > The order of the devcoredump event flow is: > > drm-scheduler -> guc-submission-execq-timed-out-j

[PATCH 1/1] drm/xe/guc/capture: Maintenence of devcoredump <-> GuC-Err-Capture plumbing

2024-11-17 Thread Alan Previn
e header) for xe_hw_engine_snapshot to use. 6. Since xe_hw_engine_snapshot can also be called by via debugfs without a job, create a new function that does a manual capture of engine registers without any associated job. Signed-off-by: Alan Previn --- drivers/gpu/drm/xe/xe_devcoredump.c

[PATCH 0/1] Maintenence of devcoredump <-> GuC-Err-Capture plumbing

2024-11-17 Thread Alan Previn
, rearranging the layering back to what its should be (xe_devcoredump_snapshot owns xe_hw_engine_snapshot owns xe_guc_capture_snapshot).. Alan Previn (1): drm/xe/guc/capture: Maintenence of devcoredump <-> GuC-Err-Capture plumbing drivers/gpu/drm/xe/xe_devcoredump.c | 3 - d

Re: [PATCH v1] drm/i915/guc: Flush ct receive tasklet during reset preparation

2024-11-04 Thread Teres Alexis, Alan Previn
Just some minor nits on header. Otherwise, LGTM: Reviewed-by: Alan Previn On Wed, 2024-10-30 at 15:38 -0700, Zhanjun Dong wrote: > GuC to host communication is interrupt driven, the handling has 3 > parts: interrupt context, tasklet and request queue worker. > During GuC rese

[PATCH v5 1/1] drm/i915/pxp: Add missing tag for Wa_14019159160

2024-10-15 Thread Alan Previn
ude IP_VER 12.71. (Matt Roper) v3: - Check targeted platforms using IP_VAL. (John Harrison) v2: - Fix WA id number (John Harrison). - Improve comments and code to be specific for the targeted platforms (John Harrison) Signed-off-by: Alan Previn --- drivers/gpu/drm/i915/gt/intel

Re: [PATCH v2] drm/i915/guc: Enable PXP GuC autoteardown flow

2024-09-06 Thread Teres Alexis, Alan Previn
LGTM: Reviewed-by: Alan Previn On Fri, 2024-09-06 at 10:40 -0700, john.c.harri...@intel.com wrote: > From: Juston Li > > This feature flag enables GuC autoteardown which allows for a grace > period before session teardown. > > Also add a HAS_PXP() helper to share with th

[PATCH v1 1/1] drm/xe/bo: Fix fixed placement ggtt pinning code

2024-06-21 Thread Alan Previn
When calling xe_bo_create_pin_map_at, use the correct starting offset provided by caller at xe_ggtt_insert_bo_at. Fixes: 44e694958b95 ("drm/xe/display: Implement display support") Signed-off-by: Alan Previn --- drivers/gpu/drm/xe/xe_bo.c | 2 +- 1 file changed, 1 insertion(+),

[PATCH v1 1/1] drm/xe/bo: Fix fixed placement ggtt pinning code

2024-06-21 Thread Alan Previn
When calling xe_bo_create_pin_map_at, use the correct starting offset provided by caller at xe_ggtt_insert_bo_at. Fixes: 44e694958b95 ("drm/xe/display: Implement display support") Signed-off-by: Alan Previn --- drivers/gpu/drm/xe/xe_bo.c | 2 +- 1 file changed, 1 insertion(+),

Re: [PATCH] drm/i915/guc: Correct capture of EIR register on hang

2024-02-27 Thread Teres Alexis, Alan Previn
" }, \ > { GEN8_RING_PDP_UDW(0, 3),  0,  0, "PDP3_UDW" } >   > -#define COMMON_BASE_HAS_EU \ > -   { EIR,  0,  0, "EIR" } > - alan:snip alan: Thanks for catching this one. Reviewed-by: Alan Previn

Re: [PATCH v9 0/2] Resolve suspend-resume racing with GuC destroy-context-worker

2024-01-02 Thread Teres Alexis, Alan Previn
On Wed, 2023-12-27 at 20:55 -0800, Teres Alexis, Alan Previn wrote: > This series is the result of debugging issues root caused to > races between the GuC's destroyed_worker_func being triggered > vs repeating suspend-resume cycles with concurrent delayed > fence signals f

[PATCH v9 2/2] drm/i915/guc: Close deregister-context race against CT-loss

2023-12-29 Thread Alan Previn
so it can get picked up on the next destroy worker invocation (if suspend aborted) or get fully purged as part of a GuC sanitization (end of suspend) or a reset flow. Signed-off-by: Alan Previn Signed-off-by: Anshuman Gupta Tested-by: Mousumi Jana Acked-by: Daniele Ceraolo Spurio Reviewed-by

[PATCH v9 0/2] Resolve suspend-resume racing with GuC destroy-context-worker

2023-12-27 Thread Alan Previn
_busy_loop, we need to undo i.e. decrement the outstanding_submission_g2h. - Patch #3 In wait_for_suspend, fix checking of return from intel_gt_pm_wait_timeout_for_idle to now use -ETIMEDOUT and add documentation for intel_wakeref_wait_for_idle. (Rodrigo). Ala

[PATCH v9 1/2] drm/i915/guc: Flush context destruction worker at suspend

2023-12-27 Thread Alan Previn
ff-by: Alan Previn Reviewed-by: Rodrigo Vivi Tested-by: Mousumi Jana --- drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c | 5 + drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h | 2 ++ drivers/gpu/drm/i915/gt/uc/intel_uc.c | 2 ++ 3 files changed, 9 insertions(+) diff --git a/dr

[PATCH v9 2/2] drm/i915/guc: Close deregister-context race against CT-loss

2023-12-27 Thread Alan Previn
so it can get picked up on the next destroy worker invocation (if suspend aborted) or get fully purged as part of a GuC sanitization (end of suspend) or a reset flow. Signed-off-by: Alan Previn Signed-off-by: Anshuman Gupta Tested-by: Mousumi Jana Acked-by: Daniele Ceraolo Spurio --- dr

Re: [PATCH v8 2/2] drm/i915/guc: Close deregister-context race against CT-loss

2023-12-27 Thread Teres Alexis, Alan Previn
On Tue, 2023-12-26 at 10:11 -0500, Vivi, Rodrigo wrote: > On Wed, Dec 20, 2023 at 11:08:59PM +, Teres Alexis, Alan Previn wrote: > > On Wed, 2023-12-13 at 16:23 -0500, Vivi, Rodrigo wrote: alan:snip > > > > > > alan: Thanks Rodrigo for the RB last week, just quick

Re: [PATCH v8 2/2] drm/i915/guc: Close deregister-context race against CT-loss

2023-12-20 Thread Teres Alexis, Alan Previn
On Wed, 2023-12-13 at 16:23 -0500, Vivi, Rodrigo wrote: > On Tue, Dec 12, 2023 at 08:57:16AM -0800, Alan Previn wrote: > > If we are at the end of suspend or very early in resume > > its possible an async fence signal (via rcu_call) is triggered > > to free_engines which

[PATCH v8 1/2] drm/i915/guc: Flush context destruction worker at suspend

2023-12-12 Thread Alan Previn
ff-by: Alan Previn Reviewed-by: Rodrigo Vivi Tested-by: Mousumi Jana --- drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c | 5 + drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h | 2 ++ drivers/gpu/drm/i915/gt/uc/intel_uc.c | 2 ++ 3 files changed, 9 insertions(+) diff --git a/dr

[PATCH v8 2/2] drm/i915/guc: Close deregister-context race against CT-loss

2023-12-12 Thread Alan Previn
so it can get picked up on the next destroy worker invocation (if suspend aborted) or get fully purged as part of a GuC sanitization (end of suspend) or a reset flow. Signed-off-by: Alan Previn Signed-off-by: Anshuman Gupta Tested-by: Mousumi Jana Acked-by: Daniele Ceraolo Spurio --- dr

[PATCH v8 0/2] Resolve suspend-resume racing with GuC destroy-context-worker

2023-12-12 Thread Alan Previn
nt the outstanding_submission_g2h. - Patch #3 In wait_for_suspend, fix checking of return from intel_gt_pm_wait_timeout_for_idle to now use -ETIMEDOUT and add documentation for intel_wakeref_wait_for_idle. (Rodrigo). Alan Previn (2): drm/i915/guc: Flush context destruc

Re: [PATCH v7 2/2] drm/i915/guc: Close deregister-context race against CT-loss

2023-11-30 Thread Teres Alexis, Alan Previn
> As far as i can tell, its only if we started resetting / wedging right after > this > queued worker got started. alan: hope Daniele can proof read my tracing and confirm if got it right.

Re: [PATCH v7 2/2] drm/i915/guc: Close deregister-context race against CT-loss

2023-11-30 Thread Teres Alexis, Alan Previn
On Thu, 2023-11-30 at 16:18 -0500, Vivi, Rodrigo wrote: > On Wed, Nov 29, 2023 at 04:20:13PM -0800, Alan Previn wrote: alan:snip > > + > > if (unlikely(disabled)) { > > release_guc_id(guc, ce); > > __guc_context_destroy(

Re: [PATCH v2 1/1] drm/i915/pxp: Add missing tag for Wa_14019159160

2023-11-29 Thread Teres Alexis, Alan Previn
On Wed, 2023-11-29 at 13:13 -0800, Teres Alexis, Alan Previn wrote: > On Mon, 2023-11-27 at 15:24 -0500, Vivi, Rodrigo wrote: > > On Wed, Nov 22, 2023 at 12:30:03PM -0800, Alan Previn wrote: > alan:snip > alan: thanks for reviewing and apologize for replyi

[PATCH v7 0/2] Resolve suspend-resume racing with GuC destroy-context-worker

2023-11-29 Thread Alan Previn
- Patch #3 In wait_for_suspend, fix checking of return from intel_gt_pm_wait_timeout_for_idle to now use -ETIMEDOUT and add documentation for intel_wakeref_wait_for_idle. (Rodrigo). Alan Previn (2): drm/i915/guc: Flush context destruction worker at suspend drm/i915/guc:

[PATCH v7 2/2] drm/i915/guc: Close deregister-context race against CT-loss

2023-11-29 Thread Alan Previn
so it can get picked up on the next destroy worker invocation (if suspend aborted) or get fully purged as part of a GuC sanitization (end of suspend) or a reset flow. Signed-off-by: Alan Previn Signed-off-by: Anshuman Gupta Tested-by: Mousumi Jana --- drivers/gpu/drm/i915/gem/i915_gem_pm.c

[PATCH v7 1/2] drm/i915/guc: Flush context destruction worker at suspend

2023-11-29 Thread Alan Previn
ff-by: Alan Previn Reviewed-by: Rodrigo Vivi Tested-by: Mousumi Jana --- drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c | 5 + drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h | 2 ++ drivers/gpu/drm/i915/gt/uc/intel_uc.c | 2 ++ 3 files changed, 9 insertions(+) diff --git a/dr

Re: [PATCH v2 1/1] drm/i915/pxp: Add missing tag for Wa_14019159160

2023-11-29 Thread Teres Alexis, Alan Previn
On Mon, 2023-11-27 at 15:24 -0500, Vivi, Rodrigo wrote: > On Wed, Nov 22, 2023 at 12:30:03PM -0800, Alan Previn wrote: alan:snip alan: thanks for reviewing and apologize for replying to this late. > > /* > > -* On MTL and newer platforms, protected contexts require setting

Re: [Intel-gfx] [PATCH v5] drm/i915/pxp: Add drm_dbgs for critical PXP events.

2023-11-29 Thread Teres Alexis, Alan Previn
On Fri, 2023-11-24 at 08:30 +, Tvrtko Ursulin wrote: > On 22/11/2023 19:15, Alan Previn wrote: alan:snip alan: thanks for reviewing. > > if (iir & GEN12_DISPLAY_STATE_RESET_COMPLETE_INTERRUPT) > > - pxp->session_events |= PXP_TERMINATION_COMPLET

Re: [PATCH v6 2/2] drm/i915/guc: Close deregister-context race against CT-loss

2023-11-29 Thread Teres Alexis, Alan Previn
On Mon, 2023-11-27 at 16:51 -0500, Vivi, Rodrigo wrote: alan: Firstly, thanks for taking the time to review this, knowing you have a lot on your plate right now. > alan:snip > > @@ -3301,19 +3315,38 @@ static inline void guc_lrc_desc_unpin(struct > > intel_context *ce) > > /* Seal race with

[PATCH v4 1/1] drm/i915/pxp: Add missing tag for Wa_14019159160

2023-11-29 Thread Alan Previn
- Improve comments and code to be specific for the targeted platforms (John Harrison) Signed-off-by: Alan Previn --- drivers/gpu/drm/i915/gt/intel_lrc.c | 8 +--- 1 file changed, 5 insertions(+), 3 deletions(-) diff --git a/drivers/gpu/drm/i915/gt/intel_lrc.c b/drivers/gpu/drm/i915/gt/i

Re: [Intel-gfx] [PATCH v3 1/1] drm/i915/pxp: Add missing tag for Wa_14019159160

2023-11-29 Thread Teres Alexis, Alan Previn
On Tue, 2023-11-28 at 10:03 -0800, Roper, Matthew D wrote: > On Mon, Nov 27, 2023 at 12:11:50PM -0800, Alan Previn wrote: > > Add missing tag for "Wa_14019159160 - Case 2" (for existing > > PXP code that ensures run alone mode bit is set to allow > > PxP-decrypti

[PATCH v3 1/1] drm/i915/pxp: Add missing tag for Wa_14019159160

2023-11-27 Thread Alan Previn
for the targeted platforms (John Harrison) Signed-off-by: Alan Previn --- drivers/gpu/drm/i915/gt/intel_lrc.c | 8 +--- 1 file changed, 5 insertions(+), 3 deletions(-) diff --git a/drivers/gpu/drm/i915/gt/intel_lrc.c b/drivers/gpu/drm/i915/gt/intel_lrc.c index 7c367ba8d9dc..1152cf25d

[PATCH v2 1/1] drm/i915/pxp: Add missing tag for Wa_14019159160

2023-11-22 Thread Alan Previn
Add missing tag for "Wa_14019159160 - Case 2" (for existing PXP code that ensures run alone mode bit is set to allow PxP-decryption. v2: - Fix WA id number (John Harrison). - Improve comments and code to be specific for the targetted platforms (John Harrison) Signed-of

[PATCH v5] drm/i915/pxp: Add drm_dbgs for critical PXP events.

2023-11-22 Thread Alan Previn
emove __func__ since drm_dbg covers that (Jani). - add timeout dbg of the restart from front-end (Alan). Signed-off-by: Alan Previn Reviewed-by: Vivaik Balasubrawmanian --- drivers/gpu/drm/i915/gt/uc/intel_gsc_proxy.c | 2 ++ drivers/gpu/drm/i915/pxp/intel_pxp.c

[PATCH 1/1] drm/i915/pxp: Add missing tag for Wa_14019159160

2023-11-21 Thread Alan Previn
Add missing tag for "Wa_14019159160 - Case 2" (for existing PXP code that ensures run alone mode bit is set to allow PxP-decryption. Signed-off-by: Alan Previn --- drivers/gpu/drm/i915/gt/intel_lrc.c | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/d

Re: [PATCH v1 1/1] drm/i915/pxp: Bail early in pxp tee backend on first teardown error

2023-11-16 Thread Teres Alexis, Alan Previn
On Thu, 2023-11-16 at 15:20 -0800, Teres Alexis, Alan Previn wrote: > For Gen12 when using mei-pxp tee backend tranport, if we are coming > up from a cold boot or from a resume (not runtime resume), we can > optionally quicken the very first session cleanup that would occur > as part

[PATCH v1 1/1] drm/i915/pxp: Bail early in pxp tee backend on first teardown error

2023-11-16 Thread Alan Previn
e that to the helper's caller. Signed-off-by: Alan Previn --- drivers/gpu/drm/i915/pxp/intel_pxp.c | 1 + drivers/gpu/drm/i915/pxp/intel_pxp_gsccs.c | 3 ++- drivers/gpu/drm/i915/pxp/intel_pxp_pm.c | 1 + drivers/gpu/drm/i915/pxp/intel_pxp_session.c | 1 + driver

Re: [PATCH v1 1/1] drm/i915/gt: Dont wait forever when idling in suspend

2023-11-14 Thread Teres Alexis, Alan Previn
On Tue, 2023-11-14 at 08:22 -0800, Teres Alexis, Alan Previn wrote: > When suspending, add a timeout when calling > intel_gt_pm_wait_for_idle else if we have a leaked > wakeref (which would be indicative of a bug elsewhere > in the driver), driver will at exit the suspend-resume > c

Re: [Intel-gfx] [PATCH v4 3/3] drm/i915/gt: Timeout when waiting for idle in suspending

2023-11-14 Thread Teres Alexis, Alan Previn
On Tue, 2023-11-14 at 17:52 +, Tvrtko Ursulin wrote: > On 14/11/2023 17:37, Teres Alexis, Alan Previn wrote: > > On Tue, 2023-11-14 at 17:27 +, Tvrtko Ursulin wrote: > > > On 13/11/2023 17:57, Teres Alexis, Alan Previn wrote: > > > > On Wed, 2023-10-25 at 13:

Re: [Intel-gfx] [PATCH v4 3/3] drm/i915/gt: Timeout when waiting for idle in suspending

2023-11-14 Thread Teres Alexis, Alan Previn
On Tue, 2023-11-14 at 12:36 -0500, Vivi, Rodrigo wrote: > On Tue, Nov 14, 2023 at 05:27:18PM +, Tvrtko Ursulin wrote: > > > > On 13/11/2023 17:57, Teres Alexis, Alan Previn wrote: > > > On Wed, 2023-10-25 at 13:58 +0100, Tvrtko Ursulin wrote: > > > > On

Re: [Intel-gfx] [PATCH v4 3/3] drm/i915/gt: Timeout when waiting for idle in suspending

2023-11-14 Thread Teres Alexis, Alan Previn
On Tue, 2023-11-14 at 17:27 +, Tvrtko Ursulin wrote: > On 13/11/2023 17:57, Teres Alexis, Alan Previn wrote: > > On Wed, 2023-10-25 at 13:58 +0100, Tvrtko Ursulin wrote: > > > On 04/10/2023 18:59, Teres Alexis, Alan Previn wrote: > > > > On Thu, 2023-09-28 at 13:

[PATCH v1 1/1] drm/i915/gt: Dont wait forever when idling in suspend

2023-11-14 Thread Alan Previn
instead of hanging in the kernel forever which then requires serial connection or ramoops dump to debug further. Signed-off-by: Alan Previn Reviewed-by: Rodrigo Vivi Tested-by: Mousumi Jana --- drivers/gpu/drm/i915/gt/intel_engine_cs.c | 2 +- drivers/gpu/drm/i915/gt/intel_gt_pm.c | 7

[PATCH v6 2/2] drm/i915/guc: Close deregister-context race against CT-loss

2023-11-14 Thread Alan Previn
so it can get picked up on the next destroy worker invocation (if suspend aborted) or get fully purged as part of a GuC sanitization (end of suspend) or a reset flow. Signed-off-by: Alan Previn Signed-off-by: Anshuman Gupta Tested-by: Mousumi Jana --- drivers/gpu/drm/i915/gem/i915_gem_pm.c

[PATCH v6 1/2] drm/i915/guc: Flush context destruction worker at suspend

2023-11-14 Thread Alan Previn
ff-by: Alan Previn Reviewed-by: Rodrigo Vivi Tested-by: Mousumi Jana --- drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c | 5 + drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h | 2 ++ drivers/gpu/drm/i915/gt/uc/intel_uc.c | 2 ++ 3 files changed, 9 insertions(+) diff --git a/dr

[PATCH v6 0/2] Resolve suspend-resume racing with GuC destroy-context-worker

2023-11-14 Thread Alan Previn
sion_g2h. - Patch #3 In wait_for_suspend, fix checking of return from intel_gt_pm_wait_timeout_for_idle to now use -ETIMEDOUT and add documentation for intel_wakeref_wait_for_idle. (Rodrigo). Alan Previn (2): drm/i915/guc: Flush context destruction worker at suspend dr

Re: [Intel-gfx] [PATCH v3] drm/i915: Skip pxp init if gt is wedged

2023-11-13 Thread Teres Alexis, Alan Previn
s not -ENODEV (since -ENODEV can happen on the majority of cases with legacy products or with non-PXP kernel configs): Reviewed-by: Alan Previn

Re: [PATCH] drm/i915: Initialize residency registers earlier

2023-11-13 Thread Teres Alexis, Alan Previn
ddress this "lack of a clean mirrored-function design of rpm_get/put across the rc6 startup sequences", here is my rb: Reviewed-by: Alan Previn

Re: [Intel-gfx] [PATCH v4 3/3] drm/i915/gt: Timeout when waiting for idle in suspending

2023-11-13 Thread Teres Alexis, Alan Previn
On Wed, 2023-10-25 at 13:58 +0100, Tvrtko Ursulin wrote: > On 04/10/2023 18:59, Teres Alexis, Alan Previn wrote: > > On Thu, 2023-09-28 at 13:46 +0100, Tvrtko Ursulin wrote: > > > On 27/09/2023 17:36, Teres Alexis, Alan Previn wrote: alan:snip > > > It is not possi

Re: [Intel-gfx] [PATCH] drm/i915: Skip pxp init if gt is wedged

2023-10-31 Thread Teres Alexis, Alan Previn
On Fri, 2023-10-27 at 10:13 +0300, Jani Nikula wrote: > On Thu, 26 Oct 2023, Zhanjun Dong wrote: > alan:snip > I'll note that nobody checks intel_pxp_init() return status, so this > silently skips PXP. > > BR, > Jani. alan:snip > > + if (intel_gt_is_wedged(gt)) > > + return -ENODEV;

[PATCH v5 3/3] drm/i915/gt: Timeout when waiting for idle in suspending

2023-10-13 Thread Alan Previn
counters), instead of hanging in the kernel. Signed-off-by: Alan Previn Reviewed-by: Rodrigo Vivi Tested-by: Mousumi Jana --- drivers/gpu/drm/i915/gt/intel_engine_cs.c | 2 +- drivers/gpu/drm/i915/gt/intel_gt_pm.c | 7 ++- drivers/gpu/drm/i915/gt/intel_gt_pm.h | 7 ++- drivers

[PATCH v5 0/3] Resolve suspend-resume racing with GuC destroy-context-worker

2023-10-13 Thread Alan Previn
t_timeout_for_idle to now use -ETIMEDOUT and add documentation for intel_wakeref_wait_for_idle. (Rodrigo). Alan Previn (3): drm/i915/guc: Flush context destruction worker at suspend drm/i915/guc: Close deregister-context race against CT-loss drm/i915/gt: Timeout when

[PATCH v5 1/3] drm/i915/guc: Flush context destruction worker at suspend

2023-10-13 Thread Alan Previn
ff-by: Alan Previn Reviewed-by: Rodrigo Vivi Tested-by: Mousumi Jana --- drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c | 5 + drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h | 2 ++ drivers/gpu/drm/i915/gt/uc/intel_uc.c | 2 ++ 3 files changed, 9 insertions(+) diff --git a/dr

[PATCH v5 2/3] drm/i915/guc: Close deregister-context race against CT-loss

2023-10-13 Thread Alan Previn
so it can get picked up on the next destroy worker invocation (if suspend aborted) or get fully purged as part of a GuC sanitization (end of suspend) or a reset flow. Signed-off-by: Alan Previn Signed-off-by: Anshuman Gupta Tested-by: Mousumi Jana --- drivers/gpu/drm/i915/gem/i915_gem_pm.c

Re: [Intel-gfx] [PATCH v4 3/3] drm/i915/gt: Timeout when waiting for idle in suspending

2023-10-04 Thread Teres Alexis, Alan Previn
On Thu, 2023-09-28 at 13:46 +0100, Tvrtko Ursulin wrote: > On 27/09/2023 17:36, Teres Alexis, Alan Previn wrote: > > Thanks for taking the time to review this Tvrtko, replies inline below. alan:snip > > > > > > Main concern is that we need to be sure there are no possi

Re: [PATCH v4 2/3] drm/i915/guc: Close deregister-context race against CT-loss

2023-10-04 Thread Teres Alexis, Alan Previn
On Wed, 2023-10-04 at 06:34 +, Gupta, Anshuman wrote: > > > -Original Message- > > From: Teres Alexis, Alan Previn > @@ -289,6 +289,13 @@ int intel_gt_resume(struct intel_gt *gt) > > > > static void wait_for_suspend(struct intel_gt *gt) { > >

Re: [Intel-gfx] [PATCH v4 3/3] drm/i915/gt: Timeout when waiting for idle in suspending

2023-09-27 Thread Teres Alexis, Alan Previn
Thanks for taking the time to review this Tvrtko, replies inline below. On Wed, 2023-09-27 at 10:02 +0100, Tvrtko Ursulin wrote: > On 26/09/2023 20:05, Alan Previn wrote: > > When suspending, add a timeout when calling > > intel_gt_pm_wait_for_idle else if we have a lost > >

[PATCH v4 3/3] drm/i915/gt: Timeout when waiting for idle in suspending

2023-09-26 Thread Alan Previn
counters), instead of hanging in the kernel. Signed-off-by: Alan Previn Reviewed-by: Rodrigo Vivi Tested-by: Mousumi Jana --- drivers/gpu/drm/i915/gt/intel_engine_cs.c | 2 +- drivers/gpu/drm/i915/gt/intel_gt_pm.c | 6 +- drivers/gpu/drm/i915/gt/intel_gt_pm.h | 7 ++- drivers

[PATCH v4 0/3] Resolve suspend-resume racing with GuC destroy-context-worker

2023-09-26 Thread Alan Previn
out_for_idle to now use -ETIMEDOUT and add documentation for intel_wakeref_wait_for_idle. (Rodrigo). Alan Previn (3): drm/i915/guc: Flush context destruction worker at suspend drm/i915/guc: Close deregister-context race against CT-loss drm/i915/gt: Timeout when waiting f

[PATCH v4 2/3] drm/i915/guc: Close deregister-context race against CT-loss

2023-09-26 Thread Alan Previn
aborted) or get fully purged as part of a GuC sanitization (end of suspend) or a reset flow. Signed-off-by: Alan Previn Signed-off-by: Anshuman Gupta Tested-by: Mousumi Jana --- drivers/gpu/drm/i915/gt/intel_gt_pm.c | 7 ++ .../gpu/drm/i915/gt/uc/intel_guc_submission.c | 81 +

[PATCH v4 1/3] drm/i915/guc: Flush context destruction worker at suspend

2023-09-26 Thread Alan Previn
ff-by: Alan Previn Reviewed-by: Rodrigo Vivi Tested-by: Mousumi Jana --- drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c | 5 + drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h | 2 ++ drivers/gpu/drm/i915/gt/uc/intel_uc.c | 2 ++ 3 files changed, 9 insertions(+) diff --git a/dr

[PATCH v4] drm/i915/pxp: Add drm_dbgs for critical PXP events.

2023-09-19 Thread Alan Previn
overs that (Jani). - add timeout dbg of the restart from front-end (Alan). Signed-off-by: Alan Previn Reviewed-by: Vivaik Balasubrawmanian --- drivers/gpu/drm/i915/gt/uc/intel_gsc_proxy.c | 2 ++ drivers/gpu/drm/i915/pxp/intel_pxp.c | 15 --- drivers/gpu/drm/i91

[PATCH v7 1/3] drm/i915/pxp/mtl: Update pxp-firmware response timeout

2023-09-17 Thread Alan Previn
ts the GSC command streamer. Also, move GSC_REPLY_LATENCY_MS definition from pxp header to intel_gsc_uc_heci_cmd_submit.h since its for any GSC HECI packet. Signed-off-by: Alan Previn Reviewed-by: Vivaik Balasubrawmanian --- .../i915/gt/uc/intel_gsc_uc_heci_cmd_submit.

  1   2   3   4   5   6   >