cor3ntin added inline comments.
================ Comment at: clang/lib/Sema/SemaExpr.cpp:2010-2013 + OS << "R\"EFLPM(" + << PredefinedExpr::ComputeName(getPredefinedExprKind(Tok.getKind()), + currentDecl) + << ")EFLPM\""; ---------------- tahonermann wrote: > RIscRIpt wrote: > > tahonermann wrote: > > > "EFLPM"? I'm assuming this is an attempt to avoid an accidental clash > > > with the computed name. Does this suffice to ensure a clash can't happen? > > > If not, can we do better? Per http://eel.is/c++draft/lex.string#nt:r-char > > > and http://eel.is/c++draft/lex.charset#1, we have a lot of flexibility > > > regarding which characters to use. > > At first I thought you were hinting me to use non-ascii characters for > > d-char-sequence. However, when I looked closely d-char-sequence is not as > > flexible as r-chars that you referenced: > > http://eel.is/c++draft/lex.string#nt:d-char and > > http://eel.is/c++draft/lex.charset#2 > > > > Here're my thoughts: > > 1. I was afraid that there could be a possibility of a function name > > contain either quote (`"`) or a backslash (`\`) despite it seemed > > impossible. > > 2. At first I used `Lexer::Stringify` to make it bullet-proof. I didn't > > like it, neither did you (reviewers). > > 3. I replaced `Lexer::Stringify` with raw-string-literal (d-char-sequence > > was chosen as acronym of current function name). > > 4. Current `EFLPM` looks weird and cryptic. And we are limited to > > [lex.charset.basic] with exceptions (space and earlier chars are not > > allowed). I think, using a raw-string-literal without d-char-sequence would > > be enough, because problem could be caused only by two consecutive > > characters `)"`, neither of which can appear in a function name. > Sorry about that; you are absolutely right that `d-char-sequence` is (much) > more constrained than `r-char-sequence`. > > For `__FUNCSIG__`, the generated text can include arbitrary text. For > example, given this declaration: > void f(decltype(sizeof("()"))*) > the macro value is: > void __cdecl f(decltype(sizeof ("()")) *) > which does contain `)"`. > > I think we should do more to avoid any possibility of the resulting string > being unparseable because the resulting diagnostics will be completely > inscrutable. The best way to do that would be to avoid trying to produce a > parseable string literal in the first place. Based on that and given that > `Sema::ActOnStringLiteral()` immediately uses `StringLiteralParser` to > produce an expanded string literal, I think we would be better off moving > this expansion there such that these macros can be expanded to already cooked > string literals that don't need to be parsed at all. This will mean having to > modify the `StringLiteralParser` constructor to accept a `Decl*` pointer that > will be passed `getCurLocalScopeDecl()` in `Sema::ActOnStringLiteral()`. > `StringLiteralParser` can then assert if it ever encounters one of these > function local predefined tokens when it hasn't been provided a corresponding > `Decl*` to use with them. I didn't think about that scenario but in that case the original idea to use Lexer::Stringify seems like a cleaner solution to me. Adding that kind of logic to `LiteralSupport` creates a lot of coupling for not much gain that I can see. In that light, `Lexer::Stringify` seems like the least worse option. Repository: rG LLVM Github Monorepo CHANGES SINCE LAST ACTION https://reviews.llvm.org/D153914/new/ https://reviews.llvm.org/D153914 _______________________________________________ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits