Hello,
I am running a benchmark binary compiled with clang with  armv8a+fp+simd+crypto 
options.
All the workloads of this compiled benchmark have similar performance in gem5 
compared to actual mobile device except this one workload (quite simple 
workload, running Convolutional Neural Network by using C++ code from scratch 
without using any external library).
I generated tarmac tracing for first few thousand instructions starting from 
the ROI.
I see that, there are SVC instructions and MSR, MRS instructions at EL2 level. 
I am failing to understand, why there is no HVC instructions in tarmac tracing 
log but I am seeing so many MSR and MRS instructions executed at EL2! I do 
think, this is causing the particular workload to perform poorly. I don’t see 
any such EL2 instructions for other workloads of the same benchmark,
I am using gem5’s fs_bigLIttle.py script (O3 ARM CPU tuned) for FS simulation.

Any insight on this topic would be very helpful. Thanks.
_______________________________________________
gem5-users mailing list -- gem5-users@gem5.org
To unsubscribe send an email to gem5-users-le...@gem5.org

Reply via email to