By older ROCm I was referring to this v4.2 which is mentioned in the gem5-resources documentation. I used the following command to build square. Not sure if this makes a difference, but do you run the simulation inside the docker image or on the host machine?
docker run --rm -v ${PWD}:${PWD} -w ${PWD} gcr.io/gem5-test/gpu-fs:latest bash -c 'make clean; HCC_AMDGPU_TARGET=gfx900 make' On Wed, Dec 7, 2022 at 11:03 AM Poremba, Matthew <matthew.pore...@amd.com> wrote: > [AMD Official Use Only - General] > > > > Thanks Rajesh, > > > > That is good to know. I don’t think there is an list anywhere of which > CPUs work with KVM. > > > > Which older ROCm do you mean here? Was square compiled with an older > version? Ideally the verification should be passing as well. At least, it > does on my local setup so it would be difficult for me to debug why it does > not work for other folks. > > > > > > -Matt > > > > *From:* Rajesh Shashi Kumar <reachrajesh...@gmail.com> > *Sent:* Tuesday, December 6, 2022 5:49 PM > *To:* Poremba, Matthew <matthew.pore...@amd.com> > *Cc:* The gem5 Users mailing list <gem5-users@gem5.org> > *Subject:* Re: [gem5-users] GPU-FS simulation progress > > > > *Caution:* This message originated from an External Source. Use proper > caution when opening attachments, clicking links, or responding. > > > > Okay turns out the issue was indeed using a slow local machine somehow > (AMD Ryzen 7 5800H) > > I ran the same thing on a "AMD EPYC 7451 24-Core Processor x2" and I am > able to run square now within 10minutes or so. I guess the last two lines > could be because of using an older rocm version. > > Running ../../gpu/square/bin/square > info: running on device Vega 10 XTX [Radeon Vega Frontier Edition] > info: architecture on AMD GPU device is: 900 > info: allocate host and device mem ( 7.63 MB) > info: launch 'vector_square' kernel > info: check result > > > *error: 'hipErrorUnknown'(999) at square.cpp:82 ./script.sh: line 13: > 548 Segmentation fault ./myapp * > Thank you again for your time on this! > > -- > Rajesh Shashi Kumar > > > > On Tue, Dec 6, 2022 at 7:39 PM Rajesh Shashi Kumar < > reachrajesh...@gmail.com> wrote: > > Thank you for your time. I tried using the provided example for booting > Ubuntu from a disk-image. > > *./build/X86/gem5.opt configs/example/gem5_library/x86-ubuntu-run.py* > > With this, I see that the boot did complete with this example and *kvm-ok* > returns as expected on my machine. > Also, I should mention that I'm using the pre-compiled image/kernel for > GPU-FS to rule out any uncertainty there. > > Term output: > Welcome to Ubuntu 18.04.2 LTS! > > systemd[1]: Set hostname to <gem5-host>. > systemd[1]: File /lib/systemd/system/systemd-journald.service:36 > configures an IP firewall (IPAddressDeny=any), but the local system does > not support BPF/cgroup based firewalling. > systemd[1]: Proceeding WITHOUT firewalling in effect! (This warning is > only shown for the first loaded unit using IP firewalling.) > random: systemd: uninitialized urandom read (16 bytes read) > systemd[1]: Reached target Remote File Systems. > [ OK ] Reached target Remote File Systems. > random: systemd: uninitialized urandom read (16 bytes read) > systemd[1]: Created slice System Slice. > [ OK ] Created slice System Slice. > ... > > > > On Tue, Dec 6, 2022 at 6:30 PM Poremba, Matthew <matthew.pore...@amd.com> > wrote: > > [AMD Official Use Only - General] > > > > At this point I would check if the other KVM scripts are working for you > (there are some simple tests somewhere like boot Ubuntu and exit). KVM > works on some CPUs better than others, I believe, or at least this was true > in the past. I have a few other ideas to try, but I would like to see if > any other scripts are working first and understand your setup to see if > other folks might run into the same issue in the future. > > > > > > -Matt > > > > *From:* Rajesh Shashi Kumar <reachrajesh...@gmail.com> > *Sent:* Tuesday, December 6, 2022 4:09 PM > *To:* Poremba, Matthew <matthew.pore...@amd.com> > *Cc:* The gem5 Users mailing list <gem5-users@gem5.org> > *Subject:* Re: [gem5-users] GPU-FS simulation progress > > > > *Caution:* This message originated from an External Source. Use proper > caution when opening attachments, clicking links, or responding. > > > > Thank you for your response. > > I double checked my image and kernel, I don't think KVM is hanging but the > progress seems to be a character printed on the term every once in a while. > I assume this is even before it could finish booting. Not sure if > fastfoward could help here > > My term output: > ==== m5 terminal: Terminal 0 ==== > [ 0.000000] Linux version > > > Thanks, > Rajesh > > > > On Tue, Dec 6, 2022 at 4:20 PM Poremba, Matthew <matthew.pore...@amd.com> > wrote: > > [AMD Official Use Only - General] > > Hi Rajesh, > > > I looks like no progress has been made since a very early tick number (the > timestamp print by Linux is equal to the current simulation tick / 1 > trillion). For reference it should take no more than 1-3 wall clock minutes > to full boot Linux and begin running the application with the KVM CPU. I > have seen fairly rarely where the KVM simply hangs and makes no progress > but simply running again fixed this. Your command looks correct though. > > Maybe someone who knows more about debugging KVM can comment how to see > what the KVM CPU is doing. > > > -Matt > > From: Rajesh Shashi Kumar via gem5-users <gem5-users@gem5.org> > Sent: Tuesday, December 6, 2022 2:06 PM > To: gem5 users mailing list <gem5-users@gem5.org> > Cc: Rajesh Shashi Kumar <reachrajesh...@gmail.com> > Subject: [gem5-users] GPU-FS simulation progress > > Caution: This message originated from an External Source. Use proper > caution when opening attachments, clicking links, or responding. > > Hi, > > I followed the instructions on running gpu-fs square using the > gem5-resources repository. My simulation has been stuck here for a while > > ... > build/VEGA_X86/arch/x86/kvm/x86_cpu.cc:1561: warn: kvm-x86: MSR > (0xc0010015) unsupported by gem5. Skipping. > build/VEGA_X86/arch/x86/kvm/x86_cpu.cc:1561: warn: kvm-x86: MSR > (0x4b564d05) unsupported by gem5. Skipping. > build/VEGA_X86/dev/x86/pc.cc:117: warn: Don't know what interrupt to clear > for console. > 16964000000000: system.pc.com_1.device: attach terminal 0 > > I tried attaching a terminal on a different tab using the following but > I'm not sure if my image has booted or if the application is running: > $ util/term/m5term localhost 3456 > ==== m5 terminal: Terminal 0 ==== > [ 0.00000 > > Any advice is appreciated! > > My run command: > build/VEGA_X86/gem5.opt configs/example/gpufs/vega10_kvm.py --disk-image > ../disk-image/rocm42/rocm42-image/rocm42 --kernel > ../vmlinux-5.4.0-105-generic --gpu-mmio-trace ../vega_mmio.log --app > ../../gpu/square/bin/square > > Thanks, > Rajesh > >
_______________________________________________ gem5-users mailing list -- gem5-users@gem5.org To unsubscribe send an email to gem5-users-le...@gem5.org