Try with --inj-vnet=0 (ie single flit pavkets) and vcs-per-vnet=8 first (such 
that sufficient number of VCs) ..
Before going to multi-flit packets (inj-vnet=2)

And then with multi-flit packets also try more VCs .. that may be the reason 
for lower throughput than you anticipate.
From my recollection of garnet stats, you get about 0.4 flits/node/cycle of 
saturation throughput for 8x8 mesh (recall that theoretical max for 8x8 mesh is 
0.5 flits/node/cycle) with single flit packets.

Cheers,
Tushar
On Oct 11, 2024 at 1:56 PM -0400, Άγγελος Καβαλέρος via gem5-users 
<gem5-users@gem5.org>, wrote:
Greetings dear gem5 Community,

The problem I encounter is related with my inability to explain the
output of the metrics that heterogarnet provided.

The environment that I work in WSL(Windows Subsystem for Linux) from VSCode.

step_1)
As written clearly inside the:
https://www.gem5.org/documentation/learning_gem5/part1/building/
using the command :
python3 which scons build/NULL/gem5.opt -j3

step_2)
and after that from :
https://www.gem5.org/documentation/general_docs/ruby/garnet_synthetic_traffic/
using the following:
scons defconfig build/NULL build_opts/NULL
scons setconfig build/NULL RUBY_PROTOCOL_GARNET_STANDALONE=y
scons build/NULL/gem5.debug

step_3)
and I also made some changes inside
"/gem5/src/mem/ruby/network/garnet/GarnetNetwork.py"
altering (the lines 47 and 48) buffers_per_data_vc and
buffers_per_ctrl_vc to 8 before step_2, to change the data and ctrl
buffer size.

The environment is ready.

Then with an automated script I run the command below with injection
rate form 0.01 to 0.15 with packet length 1024 (the script stops when
latency is above 100 cycles):

./build/NULL/gem5.debug configs/example/garnet_synth_traffic.py
--num-cpus=64
--num-dirs=64
--network=garnet
--topology=Mesh_XY
--mesh-rows=8
--sim-cycles=100000000
--synthetic=uniform_random
--inj-vne=2
--injectionrate=$injectionrate
--router-latency=1
--link-latency=1
--vcs-per-vnet=1
--routing-algorithm=1
--sys-clock=1GHz
--ruby-clock=1GHz
--link-width-bits=1024

The problem is that for uniform random traffic in an 8x8 2D mesh with
routing (XY) algorithm, the saturation throughput flits/node/cycle that
i get is much less than the expected one from related papers ( my
network has a congestion too early!). For example i have attached some
pictures:

Any commend would be quite helpful. I will also provide the metrics in
excel for better understanding

Thank you in advance!

<Screenshot 2024-10-11 204829.png><Screenshot 2024-10-11 204757.png><Screenshot 
2024-09-29 at 5.02.12 PM.png>
_______________________________________________
gem5-users mailing list -- gem5-users@gem5.org
To unsubscribe send an email to gem5-users-le...@gem5.org

Reply via email to