[AMD Official Use Only - General]

I see, thanks again.  The verification should have passed it that case...

The docker image is *only* for building applications, so that the user does not 
need to install ROCm locally to build applications.  You do *not* run gem5 
simulations in a docker for full system GPU.


-Matt

From: Rajesh Shashi Kumar <reachrajesh...@gmail.com>
Sent: Wednesday, December 7, 2022 11:32 AM
To: Poremba, Matthew <matthew.pore...@amd.com>
Cc: The gem5 Users mailing list <gem5-users@gem5.org>
Subject: Re: [gem5-users] GPU-FS simulation progress

Caution: This message originated from an External Source. Use proper caution 
when opening attachments, clicking links, or responding.

By older ROCm I was referring to this v4.2 which is mentioned in the 
gem5-resources documentation. I used the following command to build square. Not 
sure if this makes a difference, but do you run the simulation inside the 
docker image or on the host machine?


docker run --rm -v ${PWD}:${PWD} -w ${PWD} gcr.io/gem5-test/gpu-fs:latest bash 
-c 'make clean; HCC_AMDGPU_TARGET=gfx900 make'

On Wed, Dec 7, 2022 at 11:03 AM Poremba, Matthew 
<matthew.pore...@amd.com<mailto:matthew.pore...@amd.com>> wrote:

[AMD Official Use Only - General]

Thanks Rajesh,

That is good to know.  I don't think there is an list anywhere of which CPUs 
work with KVM.

Which older ROCm do you mean here? Was square compiled with an older version?  
Ideally the verification should be passing as well.  At least, it does on my 
local setup so it would be difficult for me to debug why it does not work for 
other folks.


-Matt

From: Rajesh Shashi Kumar 
<reachrajesh...@gmail.com<mailto:reachrajesh...@gmail.com>>
Sent: Tuesday, December 6, 2022 5:49 PM
To: Poremba, Matthew <matthew.pore...@amd.com<mailto:matthew.pore...@amd.com>>
Cc: The gem5 Users mailing list 
<gem5-users@gem5.org<mailto:gem5-users@gem5.org>>
Subject: Re: [gem5-users] GPU-FS simulation progress

Caution: This message originated from an External Source. Use proper caution 
when opening attachments, clicking links, or responding.

Okay turns out the issue was indeed using a slow local machine somehow (AMD 
Ryzen 7 5800H)

I ran the same thing on a "AMD EPYC 7451 24-Core Processor x2" and I am able to 
run square now within 10minutes or so. I guess the last two lines could be 
because of using an older rocm version.

Running ../../gpu/square/bin/square
info: running on device Vega 10 XTX [Radeon Vega Frontier Edition]
info: architecture on AMD GPU device is: 900
info: allocate host and device mem (  7.63 MB)
info: launch 'vector_square' kernel
info: check result
error: 'hipErrorUnknown'(999) at square.cpp:82
./script.sh: line 13:   548 Segmentation fault      ./myapp

Thank you again for your time on this!

--
Rajesh Shashi Kumar

On Tue, Dec 6, 2022 at 7:39 PM Rajesh Shashi Kumar 
<reachrajesh...@gmail.com<mailto:reachrajesh...@gmail.com>> wrote:
Thank you for your time. I tried using the provided example for booting Ubuntu 
from a disk-image.

./build/X86/gem5.opt configs/example/gem5_library/x86-ubuntu-run.py

With this, I see that the boot did complete with this example and kvm-ok 
returns as expected on my machine.
Also, I should mention that I'm using the pre-compiled image/kernel for GPU-FS 
to rule out any uncertainty there.

Term output:
Welcome to Ubuntu 18.04.2 LTS!

systemd[1]: Set hostname to <gem5-host>.
systemd[1]: File /lib/systemd/system/systemd-journald.service:36 configures an 
IP firewall (IPAddressDeny=any), but the local system does not support 
BPF/cgroup based firewalling.
systemd[1]: Proceeding WITHOUT firewalling in effect! (This warning is only 
shown for the first loaded unit using IP firewalling.)
random: systemd: uninitialized urandom read (16 bytes read)
systemd[1]: Reached target Remote File Systems.
[  OK  ] Reached target Remote File Systems.
random: systemd: uninitialized urandom read (16 bytes read)
systemd[1]: Created slice System Slice.
[  OK  ] Created slice System Slice.
...

On Tue, Dec 6, 2022 at 6:30 PM Poremba, Matthew 
<matthew.pore...@amd.com<mailto:matthew.pore...@amd.com>> wrote:

[AMD Official Use Only - General]

At this point I would check if the other KVM scripts are working for you (there 
are some simple tests somewhere like boot Ubuntu and exit).  KVM works on some 
CPUs better than others, I believe, or at least this was true in the past.  I 
have a few other ideas to try, but I would like to see if any other scripts are 
working first and understand your setup to see if other folks might run into 
the same issue in the future.


-Matt

From: Rajesh Shashi Kumar 
<reachrajesh...@gmail.com<mailto:reachrajesh...@gmail.com>>
Sent: Tuesday, December 6, 2022 4:09 PM
To: Poremba, Matthew <matthew.pore...@amd.com<mailto:matthew.pore...@amd.com>>
Cc: The gem5 Users mailing list 
<gem5-users@gem5.org<mailto:gem5-users@gem5.org>>
Subject: Re: [gem5-users] GPU-FS simulation progress

Caution: This message originated from an External Source. Use proper caution 
when opening attachments, clicking links, or responding.

Thank you for your response.

I double checked my image and kernel, I don't think KVM is hanging but the 
progress seems to be a character printed on the term every once in a while. I 
assume this is even before it could finish booting. Not sure if fastfoward 
could help here

My term output:
==== m5 terminal: Terminal 0 ====
[    0.000000] Linux version


Thanks,
Rajesh

On Tue, Dec 6, 2022 at 4:20 PM Poremba, Matthew 
<matthew.pore...@amd.com<mailto:matthew.pore...@amd.com>> wrote:
[AMD Official Use Only - General]

Hi Rajesh,


I looks like no progress has been made since a very early tick number (the 
timestamp print by Linux is equal to the current simulation tick / 1 trillion). 
For reference it should take no more than 1-3 wall clock minutes to full boot 
Linux and begin running the application with the KVM CPU.  I have seen fairly 
rarely where the KVM simply hangs and makes no progress but simply running 
again fixed this.  Your command looks correct though.

Maybe someone who knows more about debugging KVM can comment how to see what 
the KVM CPU is doing.


-Matt

From: Rajesh Shashi Kumar via gem5-users 
<gem5-users@gem5.org<mailto:gem5-users@gem5.org>>
Sent: Tuesday, December 6, 2022 2:06 PM
To: gem5 users mailing list <gem5-users@gem5.org<mailto:gem5-users@gem5.org>>
Cc: Rajesh Shashi Kumar 
<reachrajesh...@gmail.com<mailto:reachrajesh...@gmail.com>>
Subject: [gem5-users] GPU-FS simulation progress

Caution: This message originated from an External Source. Use proper caution 
when opening attachments, clicking links, or responding.

Hi,

I followed the instructions on running gpu-fs square using the gem5-resources 
repository. My simulation has been stuck here for a while

...
build/VEGA_X86/arch/x86/kvm/x86_cpu.cc:1561: warn: kvm-x86: MSR (0xc0010015) 
unsupported by gem5. Skipping.
build/VEGA_X86/arch/x86/kvm/x86_cpu.cc:1561: warn: kvm-x86: MSR (0x4b564d05) 
unsupported by gem5. Skipping.
build/VEGA_X86/dev/x86/pc.cc:117: warn: Don't know what interrupt to clear for 
console.
16964000000000: system.pc.com_1.device: attach terminal 0

I tried attaching a terminal on a different tab using the following but I'm not 
sure if my image has booted or if the application is running:
$ util/term/m5term localhost 3456
==== m5 terminal: Terminal 0 ====
[    0.00000

Any advice is appreciated!

My run command:
build/VEGA_X86/gem5.opt configs/example/gpufs/vega10_kvm.py --disk-image 
../disk-image/rocm42/rocm42-image/rocm42 --kernel ../vmlinux-5.4.0-105-generic 
--gpu-mmio-trace ../vega_mmio.log --app ../../gpu/square/bin/square

Thanks,
Rajesh
_______________________________________________
gem5-users mailing list -- gem5-users@gem5.org
To unsubscribe send an email to gem5-users-le...@gem5.org

Reply via email to