[AMD Official Use Only - AMD Internal Distribution Only]

Hi Alex,
The call trace is generated when the gdm is launched, as below.
I tried running on a standalone workqueue but still see the workqueue is 
flushed.
Thanks.

[   21.558439] ------------[ cut here ]------------
[   21.558443] workqueue: WQ_MEM_RECLAIM gfx_0.0.0:drm_sched_run_job_work 
[amd_sched] is flushing !WQ_MEM_RECLAIM 
events:amdgpu_gfx_profile_idle_work_handler [amdgpu]
[   21.558716] WARNING: CPU: 0 PID: 115 at kernel/workqueue.c:3706 
check_flush_dependency+0x151/0x180
[   21.558724] Modules linked in: snd_seq_dummy snd_hrtimer qrtr sunrpc amd_atl 
intel_rapl_msr intel_rapl_common snd_hda_codec_hdmi snd_hda_intel 
snd_intel_dspcfg edac_mce_amd snd_intel_sdw_acpi snd_usb_audio snd_hda_codec 
kvm_amd snd_usbmidi_lib snd_hda_core snd_ump mc snd_hwdep snd_pcm kvm 
snd_seq_midi snd_seq_midi_event crct10dif_pclmul snd_rawmidi polyval_clmulni 
polyval_generic ghash_clmulni_intel spd5118 sha256_ssse3 sha1_ssse3 snd_seq 
aesni_intel crypto_simd cryptd snd_seq_device snd_timer rapl wmi_bmof ccp snd 
i2c_piix4 k10temp i2c_smbus soundcore input_leds joydev gpio_amdpt mac_hid 
binfmt_misc sch_fq_codel msr parport_pc ppdev lp parport efi_pstore nfnetlink 
dmi_sysfs ip_tables x_tables autofs4 hid_generic usbhid hid amdgpu(OE) 
amddrm_ttm_helper(OE) amdttm(OE) amddrm_buddy(OE) amdxcp(OE) drm_exec 
drm_suballoc_helper amd_sched(OE) amdkcl(OE) drm_display_helper cec rc_core 
nvme i2c_algo_bit drm_ttm_helper crc32_pclmul r8169 xhci_pci nvme_core ahci ttm 
xhci_pci_renesas libahci realtek nvme_auth video wmi
[   21.558817] CPU: 0 UID: 0 PID: 115 Comm: kworker/u64:1 Tainted: G           
OE      6.11.0-17-generic #17~24.04.2-Ubuntu
[   21.558822] Tainted: [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
[   21.558823] Hardware name: Micro-Star International Co., Ltd. MS-7D76/MAG 
B650M MORTAR WIFI (MS-7D76), BIOS A.J0 12/17/2024
[   21.558825] Workqueue: gfx_0.0.0 drm_sched_run_job_work [amd_sched]
[   21.558830] RIP: 0010:check_flush_dependency+0x151/0x180
[   21.558833] Code: 56 18 4d 89 e0 48 8d 8b c0 00 00 00 48 c7 c7 e8 88 09 a1 
c6 05 e8 4d 8d 02 01 48 8b 70 08 48 81 c6 c0 00 00 00 e8 6f 54 fd ff <0f> 0b e9 
d2 fe ff ff 44 0f b6 3d ca 4d 8d 02 41 80 ff 01 77 0f 41
[   21.558836] RSP: 0018:ffffae930051fbe8 EFLAGS: 00010046
[   21.558838] RAX: 0000000000000000 RBX: ffff9abf80201400 RCX: 0000000000000000
[   21.558840] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[   21.558842] RBP: ffffae930051fc10 R08: 0000000000000000 R09: 0000000000000000
[   21.558843] R10: 0000000000000000 R11: 0000000000000000 R12: ffffffffc0992ad0
[   21.558844] R13: 0000000000000000 R14: ffff9abf8030d440 R15: ffffae930051fc40
[   21.558846] FS:  0000000000000000(0000) GS:ffff9ace9d800000(0000) 
knlGS:0000000000000000
[   21.558848] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   21.558850] CR2: 0000073bf2b6c000 CR3: 000000004623e000 CR4: 0000000000f50ef0
[   21.558852] PKRU: 55555554
[   21.558853] Call Trace:
[   21.558855]  <TASK>
[   21.558859]  ? show_regs+0x6c/0x80
[   21.558864]  ? __warn+0x88/0x140
[   21.558867]  ? check_flush_dependency+0x151/0x180
[   21.558870]  ? report_bug+0x182/0x1b0
[   21.558875]  ? handle_bug+0x6e/0xb0
[   21.558880]  ? exc_invalid_op+0x18/0x80
[   21.558883]  ? asm_exc_invalid_op+0x1b/0x20
[   21.558888]  ? __pfx_amdgpu_gfx_profile_idle_work_handler+0x10/0x10 [amdgpu]
[   21.559113]  ? check_flush_dependency+0x151/0x180
[   21.559116]  ? check_flush_dependency+0x151/0x180
[   21.559120]  __flush_work+0x238/0x310
[   21.559124]  ? __mod_timer+0x122/0x340
[   21.559129]  cancel_delayed_work_sync+0x76/0x80
[   21.559133]  amdgpu_gfx_profile_ring_begin_use+0x34/0xa0 [amdgpu]
[   21.559341]  gfx_v12_0_ring_begin_use+0x12/0x30 [amdgpu]
[   21.559531]  amdgpu_ring_alloc+0x40/0x70 [amdgpu]
[   21.559675]  amdgpu_ib_schedule+0x172/0x830 [amdgpu]
[   21.559821]  amdgpu_job_run+0x8d/0x200 [amdgpu]
[   21.559994]  drm_sched_run_job_work+0x2bb/0x450 [amd_sched]
[   21.559997]  process_one_work+0x178/0x3d0
[   21.560000]  worker_thread+0x2de/0x410
[   21.560002]  ? __pfx_worker_thread+0x10/0x10
[   21.560004]  kthread+0xe1/0x110
[   21.560006]  ? __pfx_kthread+0x10/0x10
[   21.560008]  ret_from_fork+0x44/0x70
[   21.560010]  ? __pfx_kthread+0x10/0x10
[   21.560012]  ret_from_fork_asm+0x1a/0x30
[   21.560017]  </TASK>
[   21.560017] ---[ end trace 0000000000000000 ]---


-----Original Message-----
From: Alex Deucher <alexdeuc...@gmail.com>
Sent: Wednesday, March 19, 2025 8:54 PM
To: Feng, Kenneth <kenneth.f...@amd.com>
Cc: amd-gfx@lists.freedesktop.org; Wang, Yang(Kevin) <kevinyang.w...@amd.com>
Subject: Re: [PATCH] drm/amd/amdgpu: Revert "drm/amd/amdgpu: shorten the gfx 
idle worker timeout"

Caution: This message originated from an External Source. Use proper caution 
when opening attachments, clicking links, or responding.


On Wed, Mar 19, 2025 at 2:38 AM Kenneth Feng <kenneth.f...@amd.com> wrote:
>
> This reverts commit b00fb9765ea4b05198d67256118445c6f13f9ddf.
>
> Reason for revert: this causes some tests fail with call trace.

Do you have a copy of the call trace?  I can't see how this would be an issue?

Alex

>
> Signed-off-by: Kenneth Feng <kenneth.f...@amd.com>
> ---
>  drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.h | 4 ++--
>  1 file changed, 2 insertions(+), 2 deletions(-)
>
> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.h
> b/drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.h
> index a6d3a4554caa..75af4f25a133 100644
> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.h
> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.h
> @@ -57,8 +57,8 @@ enum amdgpu_gfx_pipe_priority {  #define
> AMDGPU_GFX_QUEUE_PRIORITY_MINIMUM  0  #define
> AMDGPU_GFX_QUEUE_PRIORITY_MAXIMUM  15
>
> -/* 10 millisecond timeout */
> -#define GFX_PROFILE_IDLE_TIMEOUT       msecs_to_jiffies(10)
> +/* 1 second timeout */
> +#define GFX_PROFILE_IDLE_TIMEOUT       msecs_to_jiffies(1000)
>
>  enum amdgpu_gfx_partition {
>         AMDGPU_SPX_PARTITION_MODE = 0,
> --
> 2.34.1
>

Reply via email to