nickdesaulniers added a comment.

In D104342#2820811 <https://reviews.llvm.org/D104342#2820811>, @dblaikie wrote:

> what're the rules about module level attributes like this? For instance: Does 
> this affect/hinder inlining (if the attributes don't match between the caller 
> and callee)? Other situations that might matter?

I think you meant s/module level/function level/?  That's a good question, one 
I had to think about a little bit.  Here's my thoughts on the behavior this 
should exhibit, please let me know if you agree.

When using `-Wframe-larger-than=<threshold>` per TU, the developer wants to be 
alerted if any stack frame exceeds a threshold. The Linux kernel's use case is 
that the kernel's stack is limited to (usually) two pages (`ulimit -s`; 
typically 8KiB, but different architectures do support non-4KiB page sizes), so 
functions using more than 1KiB of stack are usually indicative of large objects 
being stack allocated that should have been heap allocated.

Currently, in C (and C with GNU extensions), there is no way to describe to the 
compiler function-grain specific values for `-Wframe-larger-than=`; rather than 
fine grain per function control, we only have coarse grain TU control.

So in the general case (non-LTO), we can only perform inline substitution at 
call sites visible from callers' definitions. Because there's no GNU C function 
attribute to change the current value of `-Wframe-larger-than=`, it's not 
possible for that value to differ between caller and callee.  But with LTO; 
shit gets weird.

Suddenly now with LTO, we have cross TU (cross Module) visibility into call 
sites, so we can inline across TU/Module boundaries.  Thus we can have an IR 
intermediary object with a call site where the caller's value of 
`-Wframe-larger-than=` differs from the callees!  So the question is what 
should happen in such a case?

The extremely conservative approach which we have done in the past for certain 
mismatched function attributes is to simply not perform inline substitution, if 
we have no other options.  This adds overhead to the inliner to check the XOR 
of attribute lists of the caller and callee for each call site.

But I *think* (and am open to sugguestions) that we should:

1. permit inline substitution
2. the caller's value of `"warn-stack-size"=` IR Fn Attr wins

I think this is ok because: if caller is defined in TU1 with 
`-Wframe-larger-than=` distinct from callee defined in TU2 with a different 
value of `-Wframe-larger-than=`, then we don't care what callee's value was. 
callee may even be DCE'd if it's inlined into a lone call site.  I'd expect in 
such cases that callee's value was larger than caller's, in which case callee 
should be attributed `no_inline` for LTO if the tighter threshold for caller 
now warns.  If callee's value was smaller than callers and we performed inline 
substitution, I think that's also perfectly fine, caller should not become 
"more strict."

Generally in the Linux kernel, we see a common value of `-Wframe-larger-than=` 
throughout most of the TUs, with only a few generally having a larger value to 
relax constraints a little. (Also, those relaxations are questionable, given 
the intent of `-Wframe-larger-than=` use in the kernel in the first place).

Let me add such a test to encode that intention; though I don't know yet what's 
involved/possible to implement. Let's see.



================
Comment at: llvm/lib/CodeGen/PrologEpilogInserter.cpp:282
+        .getValueAsString()
+        .getAsInteger(0, Threshold);
   if (StackSize > Threshold) {
----------------
dblaikie wrote:
> I guess the 0 value here is the default value if the value can't be parsed as 
> an integer? Is that desirable? I guess maybe we should ignore it (use 
> UINT_MAX here instead, maybe) and fail in the verifier.
> 
> But I guess if we fail in the verifier, then it doesn't really 
> matter/shouldn't be tested what the behavior is here when presented with 
> invalid IR.
> 
> (but this is a divergence from the module flag handling, which looks like it 
> does silently ignore non-numeric values, by using UINT_MAX)
IIUC, the first parameter to `getAsInteger` is the `Radix`, not the default 
value on failure to parse. But it does return `true` on error, so I should 
check that here.

I also should add a verifier check for this new function attribute.  While the 
"string key equals string value" attributes are quite flexible, it would be 
good to have some rigidity in requiring the string value to be parseable as an 
unsigned int.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D104342/new/

https://reviews.llvm.org/D104342

_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to