Re: [FFmpeg-devel] [PATCH] avcodec/aarch64: Access externs via GOT with PIC

Martin Storsjö Mon, 11 Jul 2022 02:00:39 -0700

On Mon, 11 Jul 2022, Triang3l wrote:

However, static libraries barely have anything to configure overall asfar as I know, so disabling exports specifically for FFmpeg may becomplicated — but thankfully, we can (and even should, to reduce thefile size) use -fvisibility=hidden globally in our application, if thathelps fix the issue.

Yes, we could consider if we should build the libraries with-fvisibility=hidden (maybe as an option), but that's not alwaysnecessarily the best option. In particular we would want to set thedefault visibility for e.g. the public API symbols in that case. (Tryingto export such symbols via the version script doesn't help, when they'reexplicitly set as hidden originally.)

Note that building your own application code with this option doesn't helpmuch here; it's the libavcodec object files that would need to be buildthat way.

-Wl,-Bsymbolic should be fine too, and possibly even lessintrusive.

Yes, that's quite non-intrusive, and should be easy to add as a linkeroption in your case.

If using __attribute__((visibility("hidden"))) for the lookuptables prevents dynamic relocations from being inserted, and if thatdoesn't break common usages of libavcodec, yes, it should be a waybetter solution than introducing an additional memory load at runtime.

I did a quick test with that and it seems like it works - I'll post apatch for that shortly.

If we're able to avoid using the global object table this way though,maybe it could be desirable to also get rid of `movrelx` in the AArch32code as well?

I wouldn't start touching that - if I remember correctly, movrelx isneeded there for a bunch of other reasons - due to different relocationtypes and addressing modes there.

By the way, I'm also super confused by how the offset is applied inthe local `movrel` currently, it looks very inconsistent. The `adrp` and`add` combination, as I understand its operation, should work for any32-bit literal value, not specifically for addresses of known objects —`adrp` accepting the upper 20 bits as a literal and adding them to thePC, and then `add` just adding the lower 12 bits, the offset within thepage, also taken as a literal.


Trust me, it specifically needs to be like this for a reason.

if everything `movrel` does is adding the PC to the input literal… do weeven need to emit `sub` for negative offsets in it?

When the final binary is linked and run, then yes, all the adrp+add pairdoes is add a literal to PC.

But before that, when an object file is assembled, the instruction opcodescan't be finalized with the actual literal value, as the distance from theadrp/add pair to the targeted symbol only is known at link time.

Therefore, the object file stores relocations that say "fix up this adrpinstruction with the actual offset to 'symbol X + 42 bytes'". For ELFobject files, the object file format and relocations allow a negativeoffset, but for MachO and COFF, it doesn't (or it might be limitedaccidentally by the tools). In either case; on MachO and COFF we can'tpractically express a relocation against "symbol minus some bytes" - so weproduce an extra 'sub' instruction in those cases.

This is also true for the Windows implementation — whose existenceoverall is questionable, as Windows DLLs use a different relocationmethod, and PIC doesn't apply to them at all if I understand correctly;

While Windows code doesn't do proper strict PIC like on ELF, CONFIG_PICdoes end up set in those configurations (like I already mentioned in theprevious mail), and referencing symbols with adrp+add is generallypreferrable over the non-PIC codepath of "ldr rX, =\val+\offset".

The latter will always store an absolute address in the constant islandproduced by the ldr pseudo instruction, and storing an absolute addressemits a so called "base relocation" into the linked PE-COFF DLL. When aDLL is loaded at a non-default address, the loader will need to fix thoseup - essentially the same as text relocations on ELF. When using adrp+addon PE-COFF, no such base relocations are needed.

So while PE-COFF doesn't have true strict PIC, in practice you need veryfew base relocations on AArch64 - but if we'd skip the adrp+addinstructions and use the non-PIC codepath of ldr as you suggest, we'd havemuch more base relocations.

is there a reason to emit the subtraction instruction that you canremember,


Yes, there is a reason.

or would it be safe to possibly even remove the offset argumentcompletely?


No, it's not safe to remove that, it's clearly there for a reason.

// Martin
_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".

Re: [FFmpeg-devel] [PATCH] avcodec/aarch64: Access externs via GOT with PIC

Reply via email to