VitaNuo added a comment.

Thank you all for comments! The patch should be ready for the next review now.



================
Comment at: clang/include/clang/Tooling/Inclusions/StdSymbolMap.inc:1053
 SYMBOL(remainder, std::, <cmath>)
+SYMBOL(remove, std::, <cstdio>)
 SYMBOL(remove_all_extents, std::, <type_traits>)
----------------
VitaNuo wrote:
> hokein wrote:
> > I think `std::remove` should not be in the scope of the patch, it has two 
> > variants:
> > - std::remove from `<cstdio>`
> > - and std::remove from `<algorithm>`.
> Yes, agreed. Let's take care of the overloads going forward.
The latest version keeps the `std::remove` overload from `<algorithm>`.


================
Comment at: clang/lib/Tooling/Inclusions/Stdlib/StandardLibrary.cpp:58
+    int SymIndex = NextAvailSymIndex;
+    if (NSSymbolMap *NSSymbols = NamespaceSymbols->lookup(NS)) {
+      auto It = NSSymbols->find(Name);
----------------
hokein wrote:
> VitaNuo wrote:
> > hokein wrote:
> > > Given the fact that multiple-header symbols are grouped in the .inc file, 
> > > we could simplify the code of checking a new symbol by looking at the 
> > > last available SymIndex:
> > > 
> > > ```
> > > if (NextAvailSymIndex > 0 && SymbolNames[NextAvailSymIndex-1].first == NS 
> > > && SymbolNames[NextAvailSymIndex-1].second == Name) {
> > >    // this is not a new symbol.
> > > }
> > > ```
> > I know this is easier in terms of code, but this heavily relies on the 
> > order in the generated files. I would prefer to keep the library and the 
> > generator as decoupled as possible, even if it means slightly more complex 
> > code here. Overall, it seems more future-proof in case of unrelated 
> > generator changes (bugs?) that might change the order.
> > but this heavily relies on the order in the generated files.
> 
> Yeah, this is true. I actually think we should make it as an invariant 
> (multiple-header symbols are grouped) of the generated .inc files. This 
> invariant is important and useful, it is much easier for human to read and 
> justify. We can probably guarantee it in the generator side.
Ok, sure. Sorted the symbols in the generator now and also applied the snippet 
from above. I'm not 100% sure the code became much simpler but this version 
seems fine too :)


================
Comment at: clang/tools/include-mapping/cppreference_parser.py:174
-      # std::remove<> has variant algorithm.
-      "std::remove": ("algorithm"),
-  }
----------------
kadircet wrote:
> VitaNuo wrote:
> > VitaNuo wrote:
> > > kadircet wrote:
> > > > VitaNuo wrote:
> > > > > kadircet wrote:
> > > > > > this is actually checking for something else (sorry for the 
> > > > > > confusing naming).
> > > > > > 
> > > > > > the `variant` here refers to library name mentioned in parentheses 
> > > > > > (this is same problem as `std::move`) on the std symbol index page 
> > > > > > https://en.cppreference.com/w/cpp/symbol_index (e.g. `remove<>() 
> > > > > > (algorithm)`). by getting rid of this we're introducing a 
> > > > > > regression, as previously `std::remove` wouldn't be recognized by 
> > > > > > the library, but now it'll be recognized and we'll keep suggesting 
> > > > > > `<cstdio>` for it.
> > > > > > 
> > > > > > so we should actually keep this around.
> > > > > Ok, I can keep this out of this patch, but we'll have to remove this 
> > > > > logic evetually when we deal with overloads.
> > > > > 
> > > > > I have a slight suspicion that this code might be buggy, because it 
> > > > > suggests that one _of_ the variants should be accepted. What is does 
> > > > > in reality, though, is it keeps `algorithm` in the list of headers 
> > > > > suitable for `std::remove` alongside `cstdio`, and then in the last 
> > > > > step `std::remove` is ignored by the generator because of being 
> > > > > defined in two headers.
> > > > > 
> > > > > With this patch, the result will be both `{cstdio, algorithm}`. Is 
> > > > > this (more) satisfactory for now compared to skipping `algorithm` due 
> > > > > to being an overload?
> > > > > 
> > > > > Ok, I can keep this out of this patch, but we'll have to remove this 
> > > > > logic evetually when we deal with overloads.
> > > > 
> > > > Surely, I wasn't saying this should stay here forever, i am just saying 
> > > > that what's done in the scope of this patch doesn't really address the 
> > > > issues "worked around" by this piece.
> > > > 
> > > > > I have a slight suspicion that this code might be buggy, because it 
> > > > > suggests that one _of_ the variants should be accepted. What is does 
> > > > > in reality, though, is it keeps algorithm in the list of headers 
> > > > > suitable for std::remove alongside cstdio, and then in the last step 
> > > > > std::remove is ignored by the generator because of being defined in 
> > > > > two headers.
> > > > 
> > > > right, it's because we have logic to prefer "non-variant" versions of 
> > > > symbols when available (i.e. in the absence of this logic, we'd prefer 
> > > > std::remove from cstdio). this logic enables us to preserve certain 
> > > > variants (in addition to non-variants). that way we treat std::remove 
> > > > as ambigious rather than always resolving to <cstdio>, hence it's 
> > > > marked as "missing", similar to `std::move`.
> > > > 
> > > > > With this patch, the result will be both {cstdio, algorithm}. Is this 
> > > > > (more) satisfactory for now compared to skipping algorithm due to 
> > > > > being an overload?
> > > > 
> > > > in the end this should probably look like {algorithm, cstdio}, but as 
> > > > mentioned elsewhere, this is not the same problem as "same symbol being 
> > > > provided by multiple header" but rather "different symbols having same 
> > > > name and different headers". so treatment of it shouldn't change by 
> > > > this patch.
> > > On second thought, I think we'd rather special-case the overloads for now 
> > > (until we get to them). See the newest patch version.
> > > right, it's because we have logic to prefer "non-variant" versions of 
> > > symbols when available (i.e. in the absence of this logic, we'd prefer 
> > > std::remove from cstdio).
> > 
> > Where is this logic? AFAICS the generator in the current state doesn't 
> > generate anything for std::remove. 
> > Where is this logic? AFAICS the generator in the current state doesn't 
> > generate anything for std::remove.
> 
> It's the logic that you're deleting:
> ```
>     if variant and variant not in variants_for_symbol:
>         continue
> ```
> 
> we ignore any symbols that has a variant we shouldn't accept and always 
> prefer the non-variant versions. to be more concrete when parsing c++ symbol 
> index we'll see two alternatives for `std::remove`:
> 
> ```
> remove()
> remove<>() (algorithm)
> ```
> 
> first one is a non-variant, hence it's accepted by default. second one is 
> `algorithm` variant, and is accepted per this deleted logic. in the end we 
> **used** to didn't generate anything because we now have multiple headers 
> providing `std::remove`.
> 
> i think it creates more confusion to special case only std::remove down 
> below, and not other symbols whose non-variant versions we accept, e.g. sin 
> also has non-variant versions, but we don't "special case" it to keep it out 
> of the map.
> 
> i'd suggest treating std::remove similar to other symbols with an accepted 
> variant, rather than creating a divergence. so my suggestion is changing the 
> logic above to **only** accept variants mentioned in the map when there's a 
> mapping for the symbol (and reject the non-variant version). net effect will 
> be that we'll now include the mapping from `std::remove` to `<algorithm>`, 
> which is currently inline with our behaviour against other symbols with 
> different variants. we try to pick the most common variant, and that's almost 
> always the "non-variant" version, apart from `std::remove`.
> 
> that way we should get rid of the special casing below completely. does that 
> make sense?
Ok, I've tweaked the logic to keep `std::remove` from `<algorithm>` now.


================
Comment at: clang/tools/include-mapping/gen_std.py:112
   for symbol in symbols:
-    if len(symbol.headers) == 1:
-      # SYMBOL(unqualified_name, namespace, header)
-      print("SYMBOL(%s, %s, %s)" % (symbol.name, symbol.namespace,
-                                    symbol.headers[0]))
-    elif len(symbol.headers) == 0:
+    if re.match("^remove$|^swap$", symbol.name) and symbol.namespace == 
"std::":
+      continue
----------------
kadircet wrote:
> different headers providing `std::swap` don't provide different variants of 
> it, these are just ADL extension points, similar to std::begin (which also 
> shouldn't be missing from the mapping).
> 
> for `remove`, i've provided more context in the discussion above, about 
> keeping a different variant of `std::remove` around.
Thank you. The latest version should take care both of the overloads that have 
accidentally made it to this patch and of the `std::remove` case. 


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D142092/new/

https://reviews.llvm.org/D142092

_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to