DoDoENT added inline comments.
================ Comment at: clang/include/clang/AST/PrettyPrinter.h:307 + /// decltype(s) will be printed as "S<Point{1,2}>" if enabled and as "S<{1,2}>" if disabled, + /// regardless if PrintCanonicalTypes is enabled. + unsigned AlwaysIncludeTypeForNonTypeTemplateArgument : 1; ---------------- dblaikie wrote: > aaron.ballman wrote: > > DoDoENT wrote: > > > aaron.ballman wrote: > > > > dblaikie wrote: > > > > > DoDoENT wrote: > > > > > > dblaikie wrote: > > > > > > > What does `PrintCanonicalTypes` have to do with this? Does it > > > > > > > overlap with this functionality in some way, but doesn't provide > > > > > > > the functionality you want in particular? > > > > > > Thank you for the question. If you set the `PrintCanonicalTypes` to > > > > > > `false`, the `S<Point{1, 2}>` would be printed as `S<Point{1, 2}>` > > > > > > even without this patch. However, if you set it to `true`, it will > > > > > > be printed as `S<{1, 2}>`. > > > > > > > > > > > > I don't fully understand why it does that, but it's quite annoying. > > > > > > > > > > > > For a better example, please take a look at the > > > > > > `TemplateIdWithComplexFullTypeNTTP` unit tests that I've added: if > > > > > > `PrintCanonicalTypes` is set to `true`, the original print output > > > > > > of type is `NDArray<float, {{{0}}}, {{{0}}}, {{{0}}}>`, and if set > > > > > > to `false` (which is default), the output is `NDArray<float, > > > > > > Height{{{0}}}, Width{{{0}}}, Channels{{{0}}}>` - so the NTTP type > > > > > > is neither fully written nor fully omitted, which is weird. > > > > > > > > > > > > As I said, I don't really understand the idea behind > > > > > > `PrintCanonicalTypes`, but when my new > > > > > > `AlwaysIncludeTypeForNonTypeTemplateArgument` is enabled, you will > > > > > > get the full type printed, regardless of value of > > > > > > `PrintCanonicalTypes` setting. > > > > > > > > > > > Perhaps this might be more of a bug in PrintCanonicalTypes than > > > > > something to add a separate flag for. > > > > > > > > > > @aaron.ballman D55552 for context here... > > > > > > > > > > Hmm, actually, just adding the top level `Height{{0}}, Width{{0}}, > > > > > Channels{{0}}` is sufficient to make this code compile (whereas with > > > > > the `{{{0}}}` it doesn't form a valid identifier. > > > > > > > > > > So what's your use case for needing more explicitness than that top > > > > > level? > > > > > Perhaps this might be more of a bug in PrintCanonicalTypes than > > > > > something to add a separate flag for. > > > > > > > > > > @aaron.ballman D55552 for context here... > > > > > > > > I looked over D55552 again and haven't spotted anything with it that > > > > seems amiss; the change there is to grab the canonical type before > > > > trying to print it which is all the more I'd expect > > > > `PrintCanonicalTypes` to impact. > > > > > > > > This looks like the behavior you'd get when you desugar the type. Check > > > > out the AST dump for `s`: https://godbolt.org/z/vxh5j6qWr > > > > ``` > > > > `-VarDecl <line:3:1, col:20> col:20 s 'S<Point{1, 2}>':'S<{1, 2}>' > > > > callinit > > > > ``` > > > > We generate that type information at > > > > https://github.com/llvm/llvm-project/blob/main/clang/lib/AST/TextNodeDumper.cpp#L663 > > > > for doing the AST dump, note how the second type printed is the > > > > desugared type and that matches what we're seeing from the pretty > > > > printer. > > > > So what's your use case for needing more explicitness than that top > > > > level? > > > > > > As I described in the [github > > > issue](https://github.com/llvm/llvm-project/issues/57562), I'm trying to > > > write a clang-based tool that will have different behavior if the printed > > > `{{{0}}}` is actually `Width` than if its `Height` or anything else. > > > > > > You can see the the issue in the AST dump for `bla`: > > > https://godbolt.org/z/fMr4f13o3 > > > > > > The type is > > > ``` > > > `-VarDecl <line:20:1, col:21> col:21 bla 'NDArray<float, > > > W>':'NDArray<float, {{{0}}}>' callinit > > > `-CXXConstructExpr <col:21> 'NDArray<float, W>':'NDArray<float, > > > {{{0}}}>' 'void () noexcept' > > > ``` > > > > > > so it's unknown whether `{{{0}}}` represent the `Width` or `Height`. My > > > patch makes it work exactly like GCC (see the comparison of error message > > > between [clang 15 and GCC 12.1](https://godbolt.org/z/WenWe8caf). > > > > > > > Perhaps this might be more of a bug in PrintCanonicalTypes than > > > > something to add a separate flag for. > > > > > > This was also my first thought and the first version of my patch (before > > > even submitting it here) was to actually change the behavior of > > > `PrintCanonicalTypes`. However, that change made some tests fail, as I > > > described in the patch description: > > > > > > - CodeGenCXX/debug-info-template.cpp > > > - SemaCXX/constexpr-printing.cpp > > > - SemaCXX/cxx2a-nttp-printing.cpp > > > - SemaTemplate/temp_arg_string_printing.cpp > > > > > > Of course, it's possible to simply update the tests, but I actually don't > > > fully understand what is the goal of `PrintCanonicalTypes` and whether > > > its current behavior is actually desired or not, so I played it safe and > > > introduced a new policy that is disabled by default until I get more > > > feedback from more experienced LLVM developers. > > > > > > The patch does solve my problem and since I'm building LLVM from source > > > anyway, I can have it enabled in my fork. > > > > > > I just want to see if it would also be beneficial to be introduced into > > > the upstream LLVM. > > > Of course, it's possible to simply update the tests, but I actually don't > > > fully understand what is the goal of PrintCanonicalTypes and whether its > > > current behavior is actually desired or not, so I played it safe and > > > introduced a new policy that is disabled by default until I get more > > > feedback from more experienced LLVM developers. > > > > `PrintCanonicalTypes` is what controls output like whether we print a > > typedef name (not the canonical type) or the final underlying type all > > typedefs involved (the canonical type). It won't have an impact on things > > like whether we print the name of a structure or not. > > > > After looking into this, I think we do want changes here. I'm not 100% > > convinced we need a new policy member. I tried out printing the type names > > unconditionally and the results were a bit of a mixed bag (some diagnostics > > got more clear, other diagnostics didn't become more clear but also didn't > > become more confusing): > > ``` > > error: 'note' diagnostics expected but not seen: > > File > > F:\source\llvm-project\clang\test\SemaTemplate\temp_arg_nontype_cxx20.cpp > > Line 189: in instantiation of template class 'Diags::X<{1, 2}>' requested > > here > > error: 'note' diagnostics seen but not expected: > > File > > F:\source\llvm-project\clang\test\SemaTemplate\temp_arg_nontype_cxx20.cpp > > Line 189: in instantiation of template class 'Diags::X<Diags::A{1, 2}>' > > requested here > > ``` > > seems like a clarifying change, while: > > ``` > > error: 'error' diagnostics expected but not seen: > > File F:\source\llvm-project\clang\test\SemaCXX\cxx2a-nttp-printing.cpp > > Line 19: no member named 'display' in 'ASCII<{"this nontype template > > argument is [...]"}>' > > File F:\source\llvm-project\clang\test\SemaCXX\cxx2a-nttp-printing.cpp > > Line 25: no member named 'display' in 'ASCII<{{119, 97, 105, 116, 32, 97, > > 32, 115, 27, 99, ...}}>' > > File F:\source\llvm-project\clang\test\SemaCXX\cxx2a-nttp-printing.cpp > > Line 33: no member named 'display' in 'ASCII<{"what??!"}>' > > error: 'error' diagnostics seen but not expected: > > File F:\source\llvm-project\clang\test\SemaCXX\cxx2a-nttp-printing.cpp > > Line 19: no member named 'display' in 'ASCII<Str<43>{"this nontype template > > argument is [...]"}>' > > File F:\source\llvm-project\clang\test\SemaCXX\cxx2a-nttp-printing.cpp > > Line 25: no member named 'display' in 'ASCII<Str<14>{{119, 97, 105, 116, > > 32, 97, 32, 115, 27, 99, ...}}>' > > File F:\source\llvm-project\clang\test\SemaCXX\cxx2a-nttp-printing.cpp > > Line 33: no member named 'display' in 'ASCII<Str<8>{"what??!"}>' > > ``` > > seems like it's neither here nor there. > > > > @dblaikie, do you have feelings on how to go with this? > Yeah, I'm inclined to think that `Diags::X<{1, 2}>` is just too simplified - > it's unambiguous if the parameter isn't `auto`, but isn't valid syntax (so > the language still expects a type name there) & so maybe we should do the > same in diagnostics? > > @aaron.ballman - though I'm still confused by the behavior-change when > `PrintCanonicalTypes = false` that causes the top level names to be printed? > Maybe that's just a weird case/red herring/bug in `PrintCanonicalTypes = > true` that skips those top level names and we could print them > unconditionally? > > @DoDoENT - I was/am curious if/why you need more explicitness than would be > provided by `PrintCanonicalTypes = false` - if the output was `NDArray<float, > Height{{{0}}}, Width{{{0}}}, Channels{{{0}}}>` would that be sufficient for > your needs? (I think in that case the name would be valid to use in code, > which I think is a reasonable basis to make the decision - but I'm not sure > how to justify adding all the intermediate names too) > > `PrintCanonicalTypes` is what controls output like whether we print a typedef > name (not the canonical type) or the final underlying type all typedefs > involved (the canonical type). It won't have an impact on things like whether > we print the name of a structure or not. Thank you for the explanation. > I was/am curious if/why you need more explicitness than would be provided by > `PrintCanonicalTypes = false` - if the output was `NDArray<float, > Height{{{0}}}, Width{{{0}}}, Channels{{{0}}}>` would that be sufficient for > your needs? It would for now (at least until I actually start needing to get the types of inner braces), however, in a real-world scenario I actually get output `NDArray<float, {{{0}}}, {{{0}}}, {{{0}}}>`, regardless of the value of `PrintCanonicalTypes`. I've tried to create a minimum reproducer for that case, but, unfortunately, all simple cases resolve to actually display the names `Height`, `Width`, etc. The closes thing to the real-world scenario is the diagnostic message shown [here](https://godbolt.org/z/WenWe8caf). However, in my real code, where there are multiple levels of type deduction happening, I always get the output without a type name, regardless of the value of `PrintCanonicalTypes`. This is why I took some time to debug what is actually happening within clang and created this patch which now works for me in all cases. ================ Comment at: clang/include/clang/AST/PrettyPrinter.h:307 + /// decltype(s) will be printed as "S<Point{1,2}>" if enabled and as "S<{1,2}>" if disabled, + /// regardless if PrintCanonicalTypes is enabled. + unsigned AlwaysIncludeTypeForNonTypeTemplateArgument : 1; ---------------- DoDoENT wrote: > dblaikie wrote: > > aaron.ballman wrote: > > > DoDoENT wrote: > > > > aaron.ballman wrote: > > > > > dblaikie wrote: > > > > > > DoDoENT wrote: > > > > > > > dblaikie wrote: > > > > > > > > What does `PrintCanonicalTypes` have to do with this? Does it > > > > > > > > overlap with this functionality in some way, but doesn't > > > > > > > > provide the functionality you want in particular? > > > > > > > Thank you for the question. If you set the `PrintCanonicalTypes` > > > > > > > to `false`, the `S<Point{1, 2}>` would be printed as `S<Point{1, > > > > > > > 2}>` even without this patch. However, if you set it to `true`, > > > > > > > it will be printed as `S<{1, 2}>`. > > > > > > > > > > > > > > I don't fully understand why it does that, but it's quite > > > > > > > annoying. > > > > > > > > > > > > > > For a better example, please take a look at the > > > > > > > `TemplateIdWithComplexFullTypeNTTP` unit tests that I've added: > > > > > > > if `PrintCanonicalTypes` is set to `true`, the original print > > > > > > > output of type is `NDArray<float, {{{0}}}, {{{0}}}, {{{0}}}>`, > > > > > > > and if set to `false` (which is default), the output is > > > > > > > `NDArray<float, Height{{{0}}}, Width{{{0}}}, Channels{{{0}}}>` - > > > > > > > so the NTTP type is neither fully written nor fully omitted, > > > > > > > which is weird. > > > > > > > > > > > > > > As I said, I don't really understand the idea behind > > > > > > > `PrintCanonicalTypes`, but when my new > > > > > > > `AlwaysIncludeTypeForNonTypeTemplateArgument` is enabled, you > > > > > > > will get the full type printed, regardless of value of > > > > > > > `PrintCanonicalTypes` setting. > > > > > > > > > > > > > Perhaps this might be more of a bug in PrintCanonicalTypes than > > > > > > something to add a separate flag for. > > > > > > > > > > > > @aaron.ballman D55552 for context here... > > > > > > > > > > > > Hmm, actually, just adding the top level `Height{{0}}, Width{{0}}, > > > > > > Channels{{0}}` is sufficient to make this code compile (whereas > > > > > > with the `{{{0}}}` it doesn't form a valid identifier. > > > > > > > > > > > > So what's your use case for needing more explicitness than that top > > > > > > level? > > > > > > Perhaps this might be more of a bug in PrintCanonicalTypes than > > > > > > something to add a separate flag for. > > > > > > > > > > > > @aaron.ballman D55552 for context here... > > > > > > > > > > I looked over D55552 again and haven't spotted anything with it that > > > > > seems amiss; the change there is to grab the canonical type before > > > > > trying to print it which is all the more I'd expect > > > > > `PrintCanonicalTypes` to impact. > > > > > > > > > > This looks like the behavior you'd get when you desugar the type. > > > > > Check out the AST dump for `s`: https://godbolt.org/z/vxh5j6qWr > > > > > ``` > > > > > `-VarDecl <line:3:1, col:20> col:20 s 'S<Point{1, 2}>':'S<{1, 2}>' > > > > > callinit > > > > > ``` > > > > > We generate that type information at > > > > > https://github.com/llvm/llvm-project/blob/main/clang/lib/AST/TextNodeDumper.cpp#L663 > > > > > for doing the AST dump, note how the second type printed is the > > > > > desugared type and that matches what we're seeing from the pretty > > > > > printer. > > > > > So what's your use case for needing more explicitness than that top > > > > > level? > > > > > > > > As I described in the [github > > > > issue](https://github.com/llvm/llvm-project/issues/57562), I'm trying > > > > to write a clang-based tool that will have different behavior if the > > > > printed `{{{0}}}` is actually `Width` than if its `Height` or anything > > > > else. > > > > > > > > You can see the the issue in the AST dump for `bla`: > > > > https://godbolt.org/z/fMr4f13o3 > > > > > > > > The type is > > > > ``` > > > > `-VarDecl <line:20:1, col:21> col:21 bla 'NDArray<float, > > > > W>':'NDArray<float, {{{0}}}>' callinit > > > > `-CXXConstructExpr <col:21> 'NDArray<float, W>':'NDArray<float, > > > > {{{0}}}>' 'void () noexcept' > > > > ``` > > > > > > > > so it's unknown whether `{{{0}}}` represent the `Width` or `Height`. My > > > > patch makes it work exactly like GCC (see the comparison of error > > > > message between [clang 15 and GCC > > > > 12.1](https://godbolt.org/z/WenWe8caf). > > > > > > > > > Perhaps this might be more of a bug in PrintCanonicalTypes than > > > > > something to add a separate flag for. > > > > > > > > This was also my first thought and the first version of my patch > > > > (before even submitting it here) was to actually change the behavior of > > > > `PrintCanonicalTypes`. However, that change made some tests fail, as I > > > > described in the patch description: > > > > > > > > - CodeGenCXX/debug-info-template.cpp > > > > - SemaCXX/constexpr-printing.cpp > > > > - SemaCXX/cxx2a-nttp-printing.cpp > > > > - SemaTemplate/temp_arg_string_printing.cpp > > > > > > > > Of course, it's possible to simply update the tests, but I actually > > > > don't fully understand what is the goal of `PrintCanonicalTypes` and > > > > whether its current behavior is actually desired or not, so I played it > > > > safe and introduced a new policy that is disabled by default until I > > > > get more feedback from more experienced LLVM developers. > > > > > > > > The patch does solve my problem and since I'm building LLVM from source > > > > anyway, I can have it enabled in my fork. > > > > > > > > I just want to see if it would also be beneficial to be introduced into > > > > the upstream LLVM. > > > > Of course, it's possible to simply update the tests, but I actually > > > > don't fully understand what is the goal of PrintCanonicalTypes and > > > > whether its current behavior is actually desired or not, so I played it > > > > safe and introduced a new policy that is disabled by default until I > > > > get more feedback from more experienced LLVM developers. > > > > > > `PrintCanonicalTypes` is what controls output like whether we print a > > > typedef name (not the canonical type) or the final underlying type all > > > typedefs involved (the canonical type). It won't have an impact on things > > > like whether we print the name of a structure or not. > > > > > > After looking into this, I think we do want changes here. I'm not 100% > > > convinced we need a new policy member. I tried out printing the type > > > names unconditionally and the results were a bit of a mixed bag (some > > > diagnostics got more clear, other diagnostics didn't become more clear > > > but also didn't become more confusing): > > > ``` > > > error: 'note' diagnostics expected but not seen: > > > File > > > F:\source\llvm-project\clang\test\SemaTemplate\temp_arg_nontype_cxx20.cpp > > > Line 189: in instantiation of template class 'Diags::X<{1, 2}>' requested > > > here > > > error: 'note' diagnostics seen but not expected: > > > File > > > F:\source\llvm-project\clang\test\SemaTemplate\temp_arg_nontype_cxx20.cpp > > > Line 189: in instantiation of template class 'Diags::X<Diags::A{1, 2}>' > > > requested here > > > ``` > > > seems like a clarifying change, while: > > > ``` > > > error: 'error' diagnostics expected but not seen: > > > File F:\source\llvm-project\clang\test\SemaCXX\cxx2a-nttp-printing.cpp > > > Line 19: no member named 'display' in 'ASCII<{"this nontype template > > > argument is [...]"}>' > > > File F:\source\llvm-project\clang\test\SemaCXX\cxx2a-nttp-printing.cpp > > > Line 25: no member named 'display' in 'ASCII<{{119, 97, 105, 116, 32, 97, > > > 32, 115, 27, 99, ...}}>' > > > File F:\source\llvm-project\clang\test\SemaCXX\cxx2a-nttp-printing.cpp > > > Line 33: no member named 'display' in 'ASCII<{"what??!"}>' > > > error: 'error' diagnostics seen but not expected: > > > File F:\source\llvm-project\clang\test\SemaCXX\cxx2a-nttp-printing.cpp > > > Line 19: no member named 'display' in 'ASCII<Str<43>{"this nontype > > > template argument is [...]"}>' > > > File F:\source\llvm-project\clang\test\SemaCXX\cxx2a-nttp-printing.cpp > > > Line 25: no member named 'display' in 'ASCII<Str<14>{{119, 97, 105, 116, > > > 32, 97, 32, 115, 27, 99, ...}}>' > > > File F:\source\llvm-project\clang\test\SemaCXX\cxx2a-nttp-printing.cpp > > > Line 33: no member named 'display' in 'ASCII<Str<8>{"what??!"}>' > > > ``` > > > seems like it's neither here nor there. > > > > > > @dblaikie, do you have feelings on how to go with this? > > Yeah, I'm inclined to think that `Diags::X<{1, 2}>` is just too simplified > > - it's unambiguous if the parameter isn't `auto`, but isn't valid syntax > > (so the language still expects a type name there) & so maybe we should do > > the same in diagnostics? > > > > @aaron.ballman - though I'm still confused by the behavior-change when > > `PrintCanonicalTypes = false` that causes the top level names to be > > printed? Maybe that's just a weird case/red herring/bug in > > `PrintCanonicalTypes = true` that skips those top level names and we could > > print them unconditionally? > > > > @DoDoENT - I was/am curious if/why you need more explicitness than would be > > provided by `PrintCanonicalTypes = false` - if the output was > > `NDArray<float, Height{{{0}}}, Width{{{0}}}, Channels{{{0}}}>` would that > > be sufficient for your needs? (I think in that case the name would be valid > > to use in code, which I think is a reasonable basis to make the decision - > > but I'm not sure how to justify adding all the intermediate names too) > > > > `PrintCanonicalTypes` is what controls output like whether we print a > > typedef name (not the canonical type) or the final underlying type all > > typedefs involved (the canonical type). It won't have an impact on things > > like whether we print the name of a structure or not. > > Thank you for the explanation. > > > I was/am curious if/why you need more explicitness than would be provided > > by `PrintCanonicalTypes = false` - if the output was `NDArray<float, > > Height{{{0}}}, Width{{{0}}}, Channels{{{0}}}>` would that be sufficient for > > your needs? > > It would for now (at least until I actually start needing to get the types of > inner braces), however, in a real-world scenario I actually get output > `NDArray<float, {{{0}}}, {{{0}}}, {{{0}}}>`, regardless of the value of > `PrintCanonicalTypes`. I've tried to create a minimum reproducer for that > case, but, unfortunately, all simple cases resolve to actually display the > names `Height`, `Width`, etc. The closes thing to the real-world scenario is > the diagnostic message shown [here](https://godbolt.org/z/WenWe8caf). > > However, in my real code, where there are multiple levels of type deduction > happening, I always get the output without a type name, regardless of the > value of `PrintCanonicalTypes`. This is why I took some time to debug what is > actually happening within clang and created this patch which now works for me > in all cases. > Maybe that's just a weird case/red herring/bug in `PrintCanonicalTypes = > true` that skips those top level names and we could print them > unconditionally? From my debugging sessions, I saw that the difference happens during printing whether the type in question is deduced type or known type. If it's a known type, then in both cases the type gets printed (in a clang-based tool, not in the diagnostic as shown on Godbolt). However, if it's a deduced type, then the type gets or doesn't get printed depending on whether the canonical type is requested. If canonical type is requested from a deduced type, in AST it's actually represented as an "expression statement", not a "type", and then the type printer tries to print it as an "expression statement". This then leads to printing code in APValue and TemplateBase thinking that it's actually printing a normal C++ expression (as in the case of a clang-tidy or similar refactor tool), and not as a non-type template parameter, yielding printout with omitted types, as that is generally expected when writing code. However, if you are using the printer to get a full type name in a string for different purposes, as I do, then you get incomplete information. So, there may be cases when you actually need/want full-type info and cases when you don't want it. Therefore, I think some new policy config should be in place to let people choose the behavior. Repository: rG LLVM Github Monorepo CHANGES SINCE LAST ACTION https://reviews.llvm.org/D134453/new/ https://reviews.llvm.org/D134453 _______________________________________________ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits