yaxunl marked an inline comment as done.
yaxunl added a comment.

In D99683#2669136 <https://reviews.llvm.org/D99683#2669136>, @yaxunl wrote:

> In D99683#2669080 <https://reviews.llvm.org/D99683#2669080>, @tejohnson wrote:
>
>> In D99683#2669047 <https://reviews.llvm.org/D99683#2669047>, @yaxunl wrote:
>>
>>> In D99683#2664674 <https://reviews.llvm.org/D99683#2664674>, @tejohnson 
>>> wrote:
>>>
>>>> I haven't looked extensively yet, but why import noinline functions? Also, 
>>>> please add a patch description.
>>>
>>> AMDGPU backend does not support linking of object files containing external 
>>> symbols, i.e. one object file calling a function defined in another object 
>>> file. Therefore the LLVM module passed to AMDGPU backend needs to contain 
>>> definitions of all callees, even if a callee has noinline attribute. To 
>>> support backends like this, the function importer needs to be able to 
>>> import functions with noinline attribute. Therefore we add an LLVM option 
>>> for allowing that, which is off by default. We have comments at line 70 of 
>>> HIP.cpp about this.
>>
>> How does a non-LTO build work, or is (full) LTO currently required? Because 
>> with ThinLTO we only import functions that are externally defined but 
>> referenced in the current module. Also, when ThinLTO imports functions it 
>> makes them available_externally, which means they are dropped and made 
>> external symbols again after inlining. So anything imported but not inlined 
>> will go back to being an external symbol.
>
> AMDGPU backend by default uses full LTO for linking. It does not support 
> non-LTO linking. Currently, it inlines all functions except kernels. However 
> we want to be able to be able not to inline all functions. Is it OK to add an 
> LLVM option to mark imported functions as linkonce_odr so that AMDGPU backend 
> can keep the definitions of the imported functions?

Actually AMDGPU backend will internalize all non-kernel functions before 
codegen. Those functions with available_externally linkage will have internal 
linkage before codegen, therefore they will not be dropped.



================
Comment at: clang/include/clang/Driver/Options.td:1904-1907
+def foffload_lto : Flag<["-"], "foffload-lto">, Flags<[CoreOption]>, 
Group<f_Group>,
+  HelpText<"Enable LTO in 'full' mode for offload compilation">;
+def fno_offload_lto : Flag<["-"], "fno-offload-lto">, Flags<[CoreOption]>, 
Group<f_Group>,
+  HelpText<"Disable LTO mode (default) for offload compilation">;
----------------
jansvoboda11 wrote:
> yaxunl wrote:
> > tra wrote:
> > > Should it be `BoolFOption` ? 
> > Yes. will fix
> The `BoolFOption` marshalling multiclass should be only used for flags where 
> either the positive or the negative (or both) are -cc1 options and map to a 
> field in `CompilerInvocation`.
> 
> Since this patch only deals with the driver (not the cc1 frontend) using 
> `BoolFOption` is not correct. Please, revert this change to the previous 
> state.
> 
> I might need to explicitly call this out in the documentation 
> https://clang.llvm.org/docs/InternalsManual.html#adding-new-command-line-option.
will do


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D99683/new/

https://reviews.llvm.org/D99683

_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to