Re: Byte swapping support

Jim Wilson Wed, 13 Sep 2017 15:22:25 -0700

On 09/12/2017 02:32 AM, Jürg Billeter wrote:

To support applications that assume big-endian memory layout on little-
endian systems, I'm considering adding support for reversing the
storage order to GCC. In contrast to the existing scalar storage order
support for structs, the goal is to reverse the storage order for all
memory operations to achieve maximum compatibility with the behavior on
big-endian systems, as far as observable by the application.

Intel has support for this in icc. It took about 5 years for a smallteam to make it work on a very large application. That includes boththe compiler development and application development time. There are alot of complicated issues that need to solved to make this work on realcode, both in the compiler and in the application code. There is a DrDobbs article about some of it, search for "Writing a Bi-EndianCompiler" if you are interested.

Even though they got it working, it was painful to use. Icc goes to alot of trouble to optimize away unnecessary byte-swapping to improveperformance, but that meant any variable could be big or little endiandespite how it was declared, and could be different endianness atdifferent places in the code, and could even be both endianness (storedin two locations) at the same time if the code needed both endianness.Sometimes we'd find a bug, and it would take a week to figure out if itwas a compiler bug or an application bug.

To facilitate byte swapping at endian boundaries (kernel or libraries),
I'm also considering developing a new GCC builtin that can byte-swap
whole structs in memory. There are limitations to this, e.g., unions
could not be supported in general. However, I still expect this to be
very useful.

There is a lot more stuff that will cause problems. Byte-swapping FPdoesn't make sense. You can only byte swap a variable if you know itstype, but you don't know the type of a va_list ap argument, so you can'tcall a big-endian vprintf from little-endian code and vice versa. Ifyou have a template expanded in both big and little endian code, youwill run into problems unless name mangling changes to include endianinfo, which means you lose ABI compatibility with the current namemangling scheme.

There will also be trouble with variables in shared libraries that getinitialized by the dynamic linker. You will either have to add a newset of other-endian relocations, or else you will have to add code tobyte-swap data after relocations are performed, probably via an initroutine, which will have to run before the other init routines. Thereis also the same issue with static linking, but that one is a littleeasier to handle, as you can use a post-linking pass to edit the binaryand byte swap stuff that needs to be byte swapped after relocations areperformed.

To handle endian boundaries, you willl need to force all declarations tohave an endianness, and you will need to convert when calling abig-endian function from a little-endian function, and vice versa, andyou will need to give an error if you see something you can't convert,like a va_list argument. Besides the issue of the C library notchanging endianness, you will likely also have third party librariesthat you can't change the endianness of, and that need to be linked intoyour application.

Before you start, you should give some thought to how debugging willwork. DWARF does have an endianity attribute, you will need to set itcorrectly, or debugging will be hopeless. Even if you set it correctly,if you have optimizations to remove unnecessary byte swapping, debuggingoptimized code will still be hard and people using the compiler willhave to be trained on how to deal with endianness issues.

And there are lots of other problems, I don't have time to document themall, or even remember them all. Personally, I think you are better offtrying to fix the application to make it more portable. Fixing thecompiler is not a magic solution to the problem that is any easier thanfixing the application.

Jim

Re: Byte swapping support

Reply via email to