Hi,

After updating to changeset 10338 (condition code registers for ARM), I
observed some (relatively significant) performance improvement. On a
particular workload which I usually check when I update gem5, the execution
time improves by 9%. It seems like the perf improvement is caused by having
more registers as FullRegisterEvents goes down by 25%, which makes sense.
While I understand that introducing CC registers could improve performance,
I did not expect adding half a dozen CC registers could have such a huge
perf impact. Any thoughts?

ARM, SE, O3 4-way, 128 int registers

Thanks,
Amin
_______________________________________________
gem5-users mailing list
gem5-users@gem5.org
http://m5sim.org/cgi-bin/mailman/listinfo/gem5-users

Reply via email to