I think Srikant's other reply addressed this?

Matt
________________________________
From: David Fong <da...@chronostech.com>
Sent: Monday, March 7, 2022 11:12 AM
To: Poremba, Matthew <matthew.pore...@amd.com>; David Fong via gem5-users 
<gem5-users@gem5.org>; Bharadwaj, Srikant <srikant.bharad...@amd.com>
Cc: Bobby Bruce <bbr...@ucdavis.edu>; Matt Sinclair <sincl...@cs.wisc.edu>
Subject: gem5 + APU latency numbers


Hi Matt P.,



I notice these stat numbers in the overall number for cpu3 (APU).

For 40, overall cpu3 (APU) latency numbers are reduced but shaderActiveTicks 
increased.

Do these numbers make sense?



David



Modified:

gem5/build/GCN3_X86/gpu-compute/GPU.py



    mem_req_latency = Param.Int(40, "Latency for request from the cu to ruby. "\

                                "Represents the pipeline to reach the TCP "\

                                "and specified in GPU clock cycles")

    mem_resp_latency = Param.Int(40, "Latency for responses from ruby to the "\

                                 "cu. Represents the pipeline between the "\

                                 "TCP and cu as well as TCP data array "\

                                 "access. Specified in GPU clock cycles")



m5out/stats.txt



40 (mem_req_latency, mem_resp_latency) (smaller is better)

system.cpu3.allLatencyDist::mean         458572.656250                       # 
delay distribution for all (Unspecified)

system.cpu3.allLatencyDist::stdev        429452.145064                       # 
delay distribution for all (Unspecified)



50 (mem_req_latency, mem_resp_latency)

system.cpu3.allLatencyDist::mean         491744.531250                       # 
delay distribution for all (Unspecified)

system.cpu3.allLatencyDist::stdev        439992.936927                       # 
delay distribution for all (Unspecified)



Latency is reduced for mean and stdev.



40 (mem_req_latency, mem_resp_latency) (smaller is better)

system.cpu3.allLatencyDist::overflows              97      1.52%    100.00% # 
delay distribution for all (Unspecified)

system.cpu3.allLatencyDist::min_value           84000                       # 
delay distribution for all (Unspecified)

system.cpu3.allLatencyDist::max_value         3796000                       # 
delay distribution for all (Unspecified)



50 (mem_req_latency, mem_resp_latency)

system.cpu3.allLatencyDist::overflows              125      1.95%    100.00% # 
delay distribution for all (Unspecified)

system.cpu3.allLatencyDist::min_value          104000                       # 
delay distribution for all (Unspecified)

system.cpu3.allLatencyDist::max_value         2651000                       # 
delay distribution for all (Unspecified)



40 (mem_req_latency, mem_resp_latency) (larger is better ??????)

system.cpu3.shaderActiveTicks               172369999                       # 
Total ticks that any CU attached to this shader is active (Unspecified)



50 (mem_req_latency, mem_resp_latency)

system.cpu3.shaderActiveTicks               171038999                       # 
Total ticks that any CU attached to this shader is active (Unspecified)






_______________________________________________
gem5-users mailing list -- gem5-users@gem5.org
To unsubscribe send an email to gem5-users-le...@gem5.org
%(web_page_url)slistinfo%(cgiext)s/%(_internal_name)s

Reply via email to