https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79784
Richard Earnshaw changed:
What|Removed |Added
Status|WAITING |UNCONFIRMED
Ever confirmed|1
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79784
--- Comment #10 from Chen Baozi ---
I have attached the testcase I used to benchmark synchronization of OpenMP on
AArch64, which is extracted from EPCC OpenMP micro-benchmark suite.
The operating system I use is ubuntu 16.04 with 4.4.0 kernel. T
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79784
--- Comment #9 from Jakub Jelinek ---
Or of course aarch64 could replace not just futex.h, but wait.h if it has
something smarter. This needs to be done by somebody who knows the ISA though
(i.e. not me).
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79784
--- Comment #8 from Andrew Pinski ---
Wfe kinda works. But yield might be better. Though one most implementions
yield is just a nop. Even on the (4thread/core) hyperthread CN99xx, it is a
nop.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79784
--- Comment #7 from Chen Baozi ---
Created attachment 40867
--> https://gcc.gnu.org/bugzilla/attachment.cgi?id=40867&action=edit
synchronization test case
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79784
--- Comment #6 from Jakub Jelinek ---
BTW, aarch64 doesn't override cpu_relax, does the HW have any instruction
similar to __builtin_ia32_pause () that could be used for spinning?
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79784
--- Comment #5 from Jakub Jelinek ---
(In reply to Andrew Pinski from comment #4)
> Or the case the futex syscall is returning right away ...
The spinning is completely configurable, see the documented OMP_WAIT_POLICY
(standard) and GOMP_SPINCOU
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79784
--- Comment #4 from Andrew Pinski ---
Or the case the futex syscall is returning right away ...
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79784
--- Comment #3 from Andrew Pinski ---
I wonder if the case we are spinning too much in user space before hitting the
futex syscall.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79784
--- Comment #2 from Andrew Pinski ---
I should note I have seen some scalability issues with GOMP compared to LLVM's
openmp implementation on ThunderX 2 CN99xx (and on ThunderX 1 CN88xx). I don't
know if this is related to this or not because I
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79784
Richard Earnshaw changed:
What|Removed |Added
Status|UNCONFIRMED |WAITING
Last reconfirmed|
11 matches
Mail list logo