Re: [Mingw-w64-public] [PATCH 3/3] crt: Use naked functions for ARM64 assembly functions.

Jacek Caban Wed, 02 Apr 2025 08:11:49 -0700

On 1.04.2025 22:55, Martin Storsjö wrote:

On Tue, 1 Apr 2025, Jacek Caban wrote:
On ARM64EC, function declarations have additional nuances:
- Function names are mangled by prefixing them with "#"
- An unmangled symbol is defined as a weak anti-dependency alias tothe mangled
 symbol
- An entry thunk is generated to convert from the x86_64 callingconvention to
 the ARM64EC calling convention, used by the emulator
- A .hybmp section entry is generated to associate the function withits entry
 thunk
The compiler can handle all of this if provided with the necessaryinformation.
Naked functions are the most convenient way to achieve this.
Use naked functions only on Clang. GCC doesn’t support them on ARMtargets
Does this hold for GCC on e.g. aarch64 linux as well, or do you meanthe in-progress aarch64-mingw target?

It’s not supported on aarch64 at all, seems to be blocked by someideological concerns, see:


https://gcc.gnu.org/bugzilla/show_bug.cgi?id=77882

and has broken behavior on x86_64 by emitting .seh_endprologue.
Right, that probably disqualifies using it overall.
Regarding me not liking global scope asm function definitions; I'mambivalent about whether I like naked functions more or less thanglobal scope asm. Using naked functions is better in the sense that itis clear on a C level what the functions are. But it's also an evenmore obscure feature which is even more of a tricky case to use - asnoted by your observations about which compilers support it above.

I mostly agree. I like the idea of naked functions, they provide a muchcleaner and more flexible approach (e.g., allowing static assemblyfunctions). However, compatibility concerns limit their usefulness.

On ARM64EC in particular, I think the benefits outweigh the drawbacks.For entry thunks, we could also use _Arm64XGenerateThunk. It’s not yetsupported by Clang, but implementing it shouldn’t be too difficult. Ieven started looking into it at one point but never finished the patch.

That said, I find its design unappealing. The idea of inserting abuilt-in function in what otherwise looks like an implementation just toprevent the compiler from emitting that implementation and doingsomething else instead feels ugly. And, as I recall, I could crash theMSVC compiler with my test cases (which isn’t uncommon with MSVC’sARM64EC behavior...).

Using naked functions instead provides entry thunks without needing todeal with _Arm64XGenerateThunk while also handling mangling and aliases.

Anyway, again this was just a note for the record; I do agree thatit's reasonable to use the feature here for arm64ec.
Before proceeding with this direction, I'd like to get a grasp of howmany functions this change will cover. I'd guess that essentially anyarm64 function in a .S file will need to be wrapped in this form, atleast if we want to be able to call them from x86_64 code? Is this thecase for only these couple of fucntions covered in this series, orwill there be dozens of similar functions converted afterwards?

There isn’t much more than this, which is why I eliminated a bunch ofassembly files first. Aside from this series, the main remaining part issetjmp/longjmp. I haven’t tackled that yet, I was waiting to see howthis series goes first. We could either place only ARM64EC versions in aC file or move them for all architectures.

There’s also __argtos, but since it’s internal, I think it’s fine toignore x86_64 callers and just mangle the name in the .S file instead.

---
mingw-w64-crt/include/internal.h      | 8 ++++++++
mingw-w64-crt/math/arm64/nearbyint.c  | 7 +++++++
mingw-w64-crt/math/arm64/nearbyintf.c | 7 +++++++
mingw-w64-crt/math/arm64/nearbyintl.c | 7 +++++++
mingw-w64-crt/math/arm64/trunc.c      | 7 +++++++
mingw-w64-crt/math/arm64/truncf.c     | 7 +++++++
6 files changed, 43 insertions(+)
diff --git a/mingw-w64-crt/include/internal.hb/mingw-w64-crt/include/internal.h
index b30ae0e5f..445928045 100644
--- a/mingw-w64-crt/include/internal.h
+++ b/mingw-w64-crt/include/internal.h
@@ -287,6 +287,8 @@ static inline unsigned int __mingw_statusfp(void)
    return flags;
}

+#ifndef __clang__
+
#define __ASM_FUNC_CODE(name,code)  \
    asm(".text\n\t" \
        ".p2align 2\n\t" \
@@ -295,6 +297,12 @@ static inline unsigned int __mingw_statusfp(void)
__MINGW64_STRINGIFY(__MINGW_USYMBOL(trunc))":\n\t" \
        code "\n\t");

+#else
+
+#define __ASM_FUNC_CODE(name,code) asm(code "\n\t");
+
+#endif
+
#ifdef __cplusplus
}
#endif
diff --git a/mingw-w64-crt/math/arm64/nearbyint.cb/mingw-w64-crt/math/arm64/nearbyint.c
index 64ade2750..02e433722 100644
--- a/mingw-w64-crt/math/arm64/nearbyint.c
+++ b/mingw-w64-crt/math/arm64/nearbyint.c
@@ -7,8 +7,15 @@
#include <math.h>
#include <internal.h>

+#ifdef __clang__
+double __attribute__((naked)) nearbyint(double x)
+{
+#endif
__ASM_FUNC_CODE(nearbyint,
This ifdeffery feels quite ugly TBH. Ideally I wouldn't want to haveany such ifdefs within the implementation files. Is there any way thatwe hide these details inside the macro?
Then we'd need to pass the function signature through the macro, whichis tricky, especially for the function arguments.
But I have a faint memory that it may be possible to pass such thingswithin parentheses, e.g. we could do __ASM_FUNC_CODE(pow, double,(double x, double y), "<code>").
If I'm mistaken, then it's indeed more tricky, but perhaps we could atleast make a version for functions that take one argument and returnsthe same type?

I think that would work. I do like the idea of having an explicit Csignature spelled out rather than buried in macro arguments, but I admitthe extra #ifdef usage isn’t great. I’ll make the change.



Thanks,

Jacek



_______________________________________________
Mingw-w64-public mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/mingw-w64-public

Re: [Mingw-w64-public] [PATCH 3/3] crt: Use naked functions for ARM64 assembly functions.

Reply via email to