On 26/04/2021 14:49, Iain Sandoe via Gcc wrote:
Alexander Monakov <amona...@ispras.ru> wrote:

On Fri, 23 Apr 2021, Josh Haberman via Gcc wrote:

On Fri, Apr 23, 2021 at 1:10 PM Iain Sandoe <idsan...@googlemail.com> wrote:
I did try to use it this ^ for GCC coroutines (where such a guarantee is
pretty important)

However, the issue there is that not all targets support indirect tailcalls.

I'm not familiar with this limitation. What targets do not support
indirect tail calls?

On Power64 tailcalls to anything that is not a static function are tricky, and current Clang ICEs for musttail indirect calls (or direct calls to a function which is not static) on that target. GCC also doesn't do indirect tailcalls on
Power64.

Also, NVPTX (being an intermediate instruction set) does not define a "tailcall instruction", so Clang also ICEs there. Just mentioning it for completeness.

Also aarch64-linux if I remember correctly.

Tailcalling is possible on aarch64 using the normal branch instructions (B & BR).

There however be reasons why those can't be used in a specific compilation context.


One could use a trampoline to deal with the loading of additional state, but  I do not want to go down that route for coroutines, if possible - since we already have a state frame, it seems better to me to extend the ABI for that to include space
for target-specific data at a known offset from the frame pointer.

Iain


What about tailcalls between DSOs?

What issues are there around DSOs? Clang/LLVM don't treat these
specially AFAIK, and it seems that tail calls through a PLT should
work okay?

No, some targets have additional requirements for calls that potentially go via
a (PIC) PLT. The most well-known example is probably 32-bit x86, which
requries the %ebx register to hold the address of GOT when entering a PLT
trampoline. Since %ebx is callee-saved, this makes tailcalling impossible.

LLVM solves this by transforming such calls to "no-plt" GOT-indirect calls under
'musttail', so evidently it does treat them specially.

Another example is mips64, where even non-PIC PLT is special (but looks like
LLVM does not do any tailcalls on mips64 at all).

Alexander


Reply via email to