On Mon, Jun 1, 2015 at 1:33 PM, Ramana Radhakrishnan <ramana....@googlemail.com> wrote: > On Mon, Jun 1, 2015 at 7:55 PM, Sriraman Tallam <tmsri...@google.com> wrote: >> On Mon, Jun 1, 2015 at 11:41 AM, Ramana Radhakrishnan >> <ramana....@googlemail.com> wrote: >>> On Mon, Jun 1, 2015 at 7:01 PM, Sriraman Tallam <tmsri...@google.com> wrote: >>>> On Mon, Jun 1, 2015 at 1:24 AM, Ramana Radhakrishnan >>>> <ramana.radhakrish...@arm.com> wrote: >>>>> >>>>>>> Why isn't it just an indirect call in the cases that would require a GOT >>>>>>> slot and a direct call otherwise ? I'm trying to work out what's so >>>>>>> different on each target that mandates this to be in the target backend. >>>>>>> Also it would be better to push the tests into gcc.dg if you can and >>>>>>> check >>>>>>> for the absence of a relocation so that folks at least see these as >>>>>>> being >>>>>>> UNSUPPORTED on their target. >>>>>> >>>>>> >>>>> >>>>> >>>>> To be even more explicit, shouldn't this be handled similar to the way in >>>>> which -fno-plt is handled in a target agnostic manner ? After all, if you >>>>> can handle this for the command line, doing the same for a function which >>>>> has been decorated with attribute((noplt)) should be simple. >>>> >>>> -fno-plt does not work for non-PIC code, having non-PIC code not use >>>> PLT was my primary motivation. Infact, if you go back in this thread, >>>> I suggested to HJ if I should piggyback on -fno-plt. I tried using >>>> the -fno-plt implementation to do this by removing the flag_pic check >>>> in calls.c, but that does not still work for non-PIC code. > > If you want __attribute__ ((noplt)) to work for non-PIC code, we > should look to code it in the same place surely by making all > __attribute__((noplt)) calls, indirect calls irrespective of whether > it's fpic or not. > > >>> >>> You're missing my point, unless I'm missing something basic here - I >>> should have been even more explicit and said -fPIC was a given in all >>> this discussion. >>> >>> calls.c:229 has >>> >>> else if (flag_pic && !flag_plt && fndecl_or_type >>> && TREE_CODE (fndecl_or_type) == FUNCTION_DECL >>> && !targetm.binds_local_p (fndecl_or_type)) >>> >>> why can't we merge the check in here for the attribute noplt ? >> >> We can and and please see this thread, that is the exact patch I proposed : >> https://gcc.gnu.org/ml/gcc-patches/2015-05/msg02682.html >> >> However, there was one caveat. I want this working without -fPIC too. >> non-PIC code also generates PLT calls and I want them eliminated. >> >>> >>> If a new attribute is added to the "GNU language" in this case, why >>> isn't this being treated in the same way as the command line option >>> has been treated ? All this means is that we add an attribute and a >>> command line option to common code and then not implement it in a >>> proper target agnostic fashion. >> >> You are right. This is the way I wanted it too but I also wanted the >> attribute to work without PIC. PLT calls are generated without -fPIC >> and -fPIE too and I wanted a solution for that. On looking at the >> code in more detail, >> >> * -fno-plt is made to work with -fPIC, is there a reason to not make >> it work for non-PIC code? I can remove the flag_pic check from >> calls.c > > I don't think that's right, you probably have to allow that along with > (flag_pic || (decl && attribute_no_plt (decl)) - however it seems odd > to me that the language extension allows this but the flag doesn't. > >> * Then, I add the generic attribute "noplt" and everything is fine. >> >> There is just one caveat with the above approach, for x86_64 >> (*call_insn) will not generate indirect-calls for *non-PIC* code >> because constant_call_address_operand in predicates.md will evaluate >> to false. This can be fixed appropriately in ix86_output_call_insn in >> i386.c. > > Yes, targets need to massage that into place but that's essentially > the mechanics of retaining indirect calls in each backend. -fno-plt > doesn't work for ARM / AArch64 with optimizers currently (and I > suspect on most other targets) because our predicates are too liberal, > fixed by treating "noplt" or -fno-plt as the equivalent of > -mlong-calls. > >> >> >> Is this alright? Sorry for the confusion, but the primary reason why >> I did not do it the way you suggested is because we wanted "noplt" >> attribute to work for non-PIC code also. > > If that is the case, then this is a slightly more complicated > condition in the same place. We then always have indirect calls for > functions that are marked noplt and just have target generate this > appropriately.
I have now modified this patch. This patch does two things: 1) Adds new generic function attribute "no_plt" that is similar in functionality to -fno-plt except that it applies only to calls to functions that are marked with this attribute. 2) For x86_64, it makes -fno-plt(and the attribute) also work for non-PIC code by directly generating an indirect call via a GOT entry. For PIC code, no_plt merely shadows the implementation of -fno-plt, no surprises here. * c-family/c-common.c (no_plt): New attribute. (handle_no_plt_attribute): New handler. * calls.c (prepare_call_address): Check for no_plt attribute. * config/i386/i386.c (ix86_function_ok_for_sibcall): Check for no_plt attribute. (ix86_expand_call): Ditto. (nopic_no_plt_attribute): New function. (ix86_output_call_insn): Output indirect call for non-pic no plt calls. * doc/extend.texi (no_plt): Document new attribute. * testsuite/gcc.target/i386/noplt-1.c: New test. * testsuite/gcc.target/i386/noplt-2.c: New test. * testsuite/gcc.target/i386/noplt-3.c: New test. * testsuite/gcc.target/i386/noplt-4.c: New test. Please review. Thanks Sri > > To be honest, this is trivial to implement in the ARM backend as one > would just piggy back on the longcalls work - despite that, IMNSHO > it's best done in a target independent manner. > > regards > Ramana > >> >> Thanks >> Sri >> >>> >>> regards >>> Ramana >>> >>> >>>> >>>>> >>>>>> I am not familiar with PLT calls for other targets. I can move the >>>>>> tests to gcc.dg but what relocation are you suggesting I check for? >>>>> >>>>> >>>>> Move the test to gcc.dg, add a target_support_no_plt function in >>>>> testsuite/lib/target-supports.exp and mark this as being supported only on >>>>> x86 and use scan-assembler to scan for PLT relocations for x86. Other >>>>> targets can add things as they deem fit. >>>> >>>>> >>>>> In any case, on a large number of elf/ linux targets I would have thought >>>>> the absence of a JMP_SLOT relocation would be good enough to check that >>>>> this >>>>> is working correctly. >>>>> >>>>> regards >>>>> Ramana >>>>> >>>>> >>>>> >>>>> >>>>>> >>>>>> Thanks >>>>>> Sri >>>>>> >>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> Ramana >>>>>>>> >>>>>>>> >>>>>>>>> >>>>>>>>> Also I think the PLT calls have EBX in call fusage wich is added by >>>>>>>>> ix86_expand_call. >>>>>>>>> else >>>>>>>>> { >>>>>>>>> /* Static functions and indirect calls don't need the pic >>>>>>>>> register. */ >>>>>>>>> if (flag_pic >>>>>>>>> && (!TARGET_64BIT >>>>>>>>> || (ix86_cmodel == CM_LARGE_PIC >>>>>>>>> && DEFAULT_ABI != MS_ABI)) >>>>>>>>> && GET_CODE (XEXP (fnaddr, 0)) == SYMBOL_REF >>>>>>>>> && ! SYMBOL_REF_LOCAL_P (XEXP (fnaddr, 0))) >>>>>>>>> { >>>>>>>>> use_reg (&use, gen_rtx_REG (Pmode, >>>>>>>>> REAL_PIC_OFFSET_TABLE_REGNUM)); >>>>>>>>> if (ix86_use_pseudo_pic_reg ()) >>>>>>>>> emit_move_insn (gen_rtx_REG (Pmode, >>>>>>>>> REAL_PIC_OFFSET_TABLE_REGNUM), >>>>>>>>> pic_offset_table_rtx); >>>>>>>>> } >>>>>>>>> >>>>>>>>> I think you want to take that away from FUSAGE there just like we do >>>>>>>>> for >>>>>>>>> local calls >>>>>>>>> (and in fact the code should already check flag_pic && flag_plt I >>>>>>>>> suppose. >>>>>>>> >>>>>>>> >>>>>>>> Done that now and patch attached. >>>>>>>> >>>>>>>> Thanks >>>>>>>> Sri >>>>>>>> >>>>>>>>> >>>>>>>>> Honza >>>>>> >>>>>> >>>>>
* c-family/c-common.c (no_plt): New attribute. (handle_no_plt_attribute): New handler. * calls.c (prepare_call_address): Check for no_plt attribute. * config/i386/i386.c (ix86_function_ok_for_sibcall): Check for no_plt attribute. (ix86_expand_call): Ditto. (nopic_no_plt_attribute): New function. (ix86_output_call_insn): Output indirect call for non-pic no plt calls. * doc/extend.texi (no_plt): Document new attribute. * testsuite/gcc.target/i386/noplt-1.c: New test. * testsuite/gcc.target/i386/noplt-2.c: New test. * testsuite/gcc.target/i386/noplt-3.c: New test. * testsuite/gcc.target/i386/noplt-4.c: New test. This patch does two things: * Adds new generic function attribute "no_plt" that is similar in functionality to -fno-plt except that it applies only to calls to functions that are marked with this attribute. * For x86_64, it makes -fno-plt(and the attribute) also work for non-PIC code by directly generating an indirect call via a GOT entry. Index: c-family/c-common.c =================================================================== --- c-family/c-common.c (revision 223720) +++ c-family/c-common.c (working copy) @@ -357,6 +357,7 @@ static tree handle_mode_attribute (tree *, tree, t static tree handle_section_attribute (tree *, tree, tree, int, bool *); static tree handle_aligned_attribute (tree *, tree, tree, int, bool *); static tree handle_weak_attribute (tree *, tree, tree, int, bool *) ; +static tree handle_no_plt_attribute (tree *, tree, tree, int, bool *) ; static tree handle_alias_ifunc_attribute (bool, tree *, tree, tree, bool *); static tree handle_ifunc_attribute (tree *, tree, tree, int, bool *); static tree handle_alias_attribute (tree *, tree, tree, int, bool *); @@ -706,6 +707,8 @@ const struct attribute_spec c_common_attribute_tab handle_aligned_attribute, false }, { "weak", 0, 0, true, false, false, handle_weak_attribute, false }, + { "no_plt", 0, 0, true, false, false, + handle_no_plt_attribute, false }, { "ifunc", 1, 1, true, false, false, handle_ifunc_attribute, false }, { "alias", 1, 1, true, false, false, @@ -8185,6 +8188,25 @@ handle_weak_attribute (tree *node, tree name, return NULL_TREE; } +/* Handle a "no_plt" attribute; arguments as in + struct attribute_spec.handler. */ + +static tree +handle_no_plt_attribute (tree *node, tree name, + tree ARG_UNUSED (args), + int ARG_UNUSED (flags), + bool * ARG_UNUSED (no_add_attrs)) +{ + if (TREE_CODE (*node) != FUNCTION_DECL) + { + warning (OPT_Wattributes, + "%qE attribute is only applicable on functions", name); + *no_add_attrs = true; + return NULL_TREE; + } + return NULL_TREE; +} + /* Handle an "alias" or "ifunc" attribute; arguments as in struct attribute_spec.handler, except that IS_ALIAS tells us whether this is an alias as opposed to ifunc attribute. */ Index: calls.c =================================================================== --- calls.c (revision 223720) +++ calls.c (working copy) @@ -226,8 +226,10 @@ prepare_call_address (tree fndecl_or_type, rtx fun && targetm.small_register_classes_for_mode_p (FUNCTION_MODE)) ? force_not_mem (memory_address (FUNCTION_MODE, funexp)) : memory_address (FUNCTION_MODE, funexp)); - else if (flag_pic && !flag_plt && fndecl_or_type + else if (flag_pic && fndecl_or_type && TREE_CODE (fndecl_or_type) == FUNCTION_DECL + && (!flag_plt + || lookup_attribute ("no_plt", DECL_ATTRIBUTES (fndecl_or_type))) && !targetm.binds_local_p (fndecl_or_type)) { funexp = force_reg (Pmode, funexp); Index: config/i386/i386.c =================================================================== --- config/i386/i386.c (revision 223720) +++ config/i386/i386.c (working copy) @@ -5479,6 +5479,8 @@ ix86_function_ok_for_sibcall (tree decl, tree exp) && !TARGET_64BIT && flag_pic && flag_plt + && (TREE_CODE (decl) != FUNCTION_DECL + || !lookup_attribute ("no_plt", DECL_ATTRIBUTES (decl))) && decl && !targetm.binds_local_p (decl)) return false; @@ -25497,13 +25499,19 @@ ix86_expand_call (rtx retval, rtx fnaddr, rtx call } else { - /* Static functions and indirect calls don't need the pic register. */ + /* Static functions and indirect calls don't need the pic register. Also, + check if PLT was explicitly avoided via no-plt or "no_plt" attribute, making + it an indirect call. */ if (flag_pic && (!TARGET_64BIT || (ix86_cmodel == CM_LARGE_PIC && DEFAULT_ABI != MS_ABI)) && GET_CODE (XEXP (fnaddr, 0)) == SYMBOL_REF - && ! SYMBOL_REF_LOCAL_P (XEXP (fnaddr, 0))) + && ! SYMBOL_REF_LOCAL_P (XEXP (fnaddr, 0)) + && flag_plt + && (TREE_CODE (SYMBOL_REF_DECL (XEXP(fnaddr, 0))) != FUNCTION_DECL + || !lookup_attribute ("no_plt", + DECL_ATTRIBUTES (SYMBOL_REF_DECL (XEXP(fnaddr, 0)))))) { use_reg (&use, gen_rtx_REG (Pmode, REAL_PIC_OFFSET_TABLE_REGNUM)); if (ix86_use_pseudo_pic_reg ()) @@ -25599,6 +25607,34 @@ ix86_expand_call (rtx retval, rtx fnaddr, rtx call return call; } +/* Return true if the function being called was marked with attribute + "no_plt" or using -fno-plt and we are compiling for no-PIC and x86_64. + This is currently used only with 64-bit ELF targets to call the function + marked "no_plt" indirectly. */ + +static bool +nopic_no_plt_attribute (rtx call_op) +{ + if (flag_pic) + return false; + + if (!TARGET_64BIT || TARGET_MACHO|| TARGET_SEH || TARGET_PECOFF) + return false; + + if (SYMBOL_REF_LOCAL_P (call_op)) + return false; + + tree symbol_decl = SYMBOL_REF_DECL (call_op); + + if (symbol_decl != NULL_TREE + && TREE_CODE (symbol_decl) == FUNCTION_DECL + && (!flag_plt + || lookup_attribute ("no_plt", DECL_ATTRIBUTES (symbol_decl)))) + return true; + + return false; +} + /* Output the assembly for a call instruction. */ const char * @@ -25610,7 +25646,9 @@ ix86_output_call_insn (rtx_insn *insn, rtx call_op if (SIBLING_CALL_P (insn)) { - if (direct_p) + if (direct_p && nopic_no_plt_attribute (call_op)) + xasm = "%!jmp\t*%p0@GOTPCREL(%%rip)"; + else if (direct_p) xasm = "%!jmp\t%P0"; /* SEH epilogue detection requires the indirect branch case to include REX.W. */ @@ -25653,7 +25691,9 @@ ix86_output_call_insn (rtx_insn *insn, rtx call_op seh_nop_p = true; } - if (direct_p) + if (direct_p && nopic_no_plt_attribute (call_op)) + xasm = "%!call\t*%p0@GOTPCREL(%%rip)"; + else if (direct_p) xasm = "%!call\t%P0"; else xasm = "%!call\t%A0"; Index: doc/extend.texi =================================================================== --- doc/extend.texi (revision 223720) +++ doc/extend.texi (working copy) @@ -2916,6 +2916,15 @@ the standard C library can be guaranteed not to th with the notable exceptions of @code{qsort} and @code{bsearch} that take function pointer arguments. +@item no_plt +@cindex @code{no_plt} function attribute +The @code{no_plt} attribute is used to inform the compiler that a calls +to the function should not use the PLT. For example, external functions +defined in shared objects are called from the executable using the PLT. +This attribute on the function declaration calls these functions indirectly +rather than going via the PLT. This is similar to @option{-fno-plt} but +is only applicable to calls to the function marked with this attribute. + @item optimize @cindex @code{optimize} function attribute The @code{optimize} attribute is used to specify that a function is to Index: testsuite/gcc.target/i386/noplt-1.c =================================================================== --- testsuite/gcc.target/i386/noplt-1.c (revision 0) +++ testsuite/gcc.target/i386/noplt-1.c (working copy) @@ -0,0 +1,13 @@ +/* { dg-do compile { target x86_64-*-linux* } } */ +/* { dg-options "-fno-pic" } */ + +__attribute__ ((no_plt)) +void foo(); + +int main() +{ + foo(); + return 0; +} + +/* { dg-final { scan-assembler "call\[ \t\]\\*.*foo.*@GOTPCREL\\(%rip\\)" } } */ Index: testsuite/gcc.target/i386/noplt-2.c =================================================================== --- testsuite/gcc.target/i386/noplt-2.c (revision 0) +++ testsuite/gcc.target/i386/noplt-2.c (working copy) @@ -0,0 +1,13 @@ +/* { dg-do compile { target x86_64-*-linux* } } */ +/* { dg-options "-O2 -fno-pic" } */ + + +__attribute__ ((no_plt)) +int foo(); + +int main() +{ + return foo(); +} + +/* { dg-final { scan-assembler "jmp\[ \t\]\\*.*foo.*@GOTPCREL\\(%rip\\)" } } */ Index: testsuite/gcc.target/i386/noplt-3.c =================================================================== --- testsuite/gcc.target/i386/noplt-3.c (revision 0) +++ testsuite/gcc.target/i386/noplt-3.c (working copy) @@ -0,0 +1,12 @@ +/* { dg-do compile { target x86_64-*-linux* } } */ +/* { dg-options "-fno-pic -fno-plt" } */ + +void foo(); + +int main() +{ + foo(); + return 0; +} + +/* { dg-final { scan-assembler "call\[ \t\]\\*.*foo.*@GOTPCREL\\(%rip\\)" } } */ Index: testsuite/gcc.target/i386/noplt-4.c =================================================================== --- testsuite/gcc.target/i386/noplt-4.c (revision 0) +++ testsuite/gcc.target/i386/noplt-4.c (working copy) @@ -0,0 +1,11 @@ +/* { dg-do compile { target x86_64-*-linux* } } */ +/* { dg-options "-O2 -fno-pic -fno-plt" } */ + +int foo(); + +int main() +{ + return foo(); +} + +/* { dg-final { scan-assembler "jmp\[ \t\]\\*.*foo.*@GOTPCREL\\(%rip\\)" } } */