ychen added inline comments.

================
Comment at: clang/lib/Sema/SemaTemplateDeduction.cpp:5511-5539
+  template <typename T1, typename T2,
+            std::enable_if_t<std::is_same<T1, T2>::value, bool> = true>
+  bool operator()(T1 *PS1, T2 *PS2) {
+    return hasEqualTemplateArgumentList(
+        PS1->getTemplateArgsAsWritten()->arguments(),
+        PS2->getTemplateArgsAsWritten()->arguments());
+  }
----------------
mizvekov wrote:
> mizvekov wrote:
> > mizvekov wrote:
> > > mizvekov wrote:
> > > > ychen wrote:
> > > > > mizvekov wrote:
> > > > > > I think you are not supposed to use the `TemplateArgsAsWritten` 
> > > > > > here.
> > > > > > 
> > > > > > The injected arguments are 'Converted' arguments, and the 
> > > > > > transformation above, by unpacking the arguments, is reversing just 
> > > > > > a tiny part of the conversion process.
> > > > > > 
> > > > > > It's not very meaningful to canonicalize the arguments as written 
> > > > > > to perform a semantic comparison, as that works well just for some 
> > > > > > kinds of template arguments, like types and templates, but not for 
> > > > > > other kinds in which the conversion process is not trivial.
> > > > > > 
> > > > > > For example, I think this may fail to compare the same integers 
> > > > > > written in different ways, like `2` vs `1 + 1`.
> > > > > Indeed. It can happen only when comparing one partial specialization 
> > > > > with another. I think the standard does not require an implementation 
> > > > > to deal with this but we could use the best effort without much 
> > > > > overhead. For `2` vs `1+1` or similar template arguments that are not 
> > > > > dependent, we could assume the equivalence because they wouldn't be 
> > > > > in the partial ordering stage if they're not equivalent. For more 
> > > > > complicated cases like `J+2` vs `J+1+1` where J is NTTP, let's stop 
> > > > > trying (match GCC) because the overhead is a little bit high. 
> > > > But I think the 'TemplateArgs', which are the specialization arguments 
> > > > and are available through `getTemplateArgs()`, are the converted 
> > > > arguments you want here, ie the AsWritten arguments converted against 
> > > > the template.
> > > > 
> > > > I don't see why you can't just use that.
> > > > 
> > > > How about we change:
> > > > ```
> > > >   if (!TemplateArgumentListAreEqual(S.getASTContext())(P1, P2))
> > > >     return nullptr;
> > > > ```
> > > > 
> > > > Into:
> > > > 
> > > > ```
> > > >   {
> > > >     ArrayRef<TemplateArgument> Args1 = P1->getTemplateArgs().asArray(), 
> > > > Args2;
> > > >     if constexpr (IsMoreSpecialThanPrimaryCheck)
> > > >       Args2 = P2->getInjectedTemplateArgs();
> > > >     else
> > > >       Args2 = P2->getTemplateArgs().asArray();
> > > > 
> > > >     if (Args1.size() != Args2.size())
> > > >       return nullptr;
> > > > 
> > > >     for (unsigned I = 0, E = Args1.size(); I < E; ++I) {
> > > >       TemplateArgument Arg2 = Args2[I];
> > > >       // Unlike the specialization arguments, the injected arguments 
> > > > are not
> > > >       // always canonical.
> > > >       if constexpr (IsMoreSpecialThanPrimaryCheck)
> > > >         Arg2 = S.Context.getCanonicalTemplateArgument(Arg2);
> > > > 
> > > >       // We use profile, instead of structural comparison of the 
> > > > arguments,
> > > >       // because canonicalization can't do the right thing for dependent
> > > >       // expressions.
> > > >       llvm::FoldingSetNodeID IDA, IDB;
> > > >       Args1[I].Profile(IDA, S.Context);
> > > >       Arg2.Profile(IDB, S.Context);
> > > >       if (IDA != IDB)
> > > >         return nullptr;
> > > >     }
> > > >   }
> > > > ```
> > > > 
> > > > That should work, right?
> > > Actually, you can even further simplify this.
> > > 
> > > You can't have two different specializations with the same specialization 
> > > arguments. These arguments are used as the key to unique them anyway.
> > > 
> > > So simplify my above suggestion to:
> > > ```
> > >   if constexpr (IsMoreSpecialThanPrimaryCheck) {
> > >     ArrayRef<TemplateArgument> Args1 = P1->getTemplateArgs().asArray(),
> > >                                Args2 = P2->getInjectedTemplateArgs();
> > >     if (Args1.size() != Args2.size())
> > >       return nullptr;
> > > 
> > >     for (unsigned I = 0, E = Args1.size(); I < E; ++I) {
> > >       // We use profile, instead of structural comparison of the 
> > > arguments,
> > >       // because canonicalization can't do the right thing for dependent
> > >       // expressions.
> > >       llvm::FoldingSetNodeID IDA, IDB;
> > >       Args1[I].Profile(IDA, S.Context);
> > >       // Unlike the specialization arguments, the injected arguments are 
> > > not
> > >       // always canonical.
> > >       S.Context.getCanonicalTemplateArgument(Args2[I]).Profile(IDB, 
> > > S.Context);
> > >       if (IDA != IDB)
> > >         return nullptr;
> > >     }
> > >   }
> > > ```
> > Yet another improvement here is that you can do this check once, when we 
> > create the specialization, and then simply store a 
> > `hasSameArgumentsAsPrimaryInjected` flag.
> > 
> > When we create the specialization, we already profile the specialization 
> > arguments anyway, so it's another computation you can avoid repeating.
> > 
> > Could even avoid repeating the profiling of the injected arguments if there 
> > was justified runtime overhead there, which I doubt, trading that off with 
> > increased memory use.
> It doesn't even need to be a flag to store in each SpecializationDecl, 
> because there can be only one specialization with the same arguments as the 
> primary injected arguments, for obvious reasons :D
> 
> So probably just a pointer in the TemplateDecl...
'TemplateArgs' is indeed what I needed, I don't know how I missed that. There 
could be multiple partial specializations with the same arguments as the 
primary injected arguments and they differ by constraints 
(https://godbolt.org/z/onME5ehEE). I'm unsure if adding a 
`hasSameArgumentsAsPrimaryInjected` flag would save much since the 
`IsMoreSpecialThanPrimaryCheck` is only performed once. We could avoid maybe 
one profiling by introducing the `hasSameArgumentsAsPrimaryInjected` flag 
though. On the other hand, it doesn't make the implementation much harder 
either. 

I did some compile-time measurements, and the results seem neutral 
https://llvm-compile-time-tracker.com/index.php?config=NewPM-O0-g&stat=instructions&remote=yuanfang-chen.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D128750/new/

https://reviews.llvm.org/D128750

_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to