VitaNuo marked 2 inline comments as done.
VitaNuo added inline comments.

================
Comment at: clang/include/clang/Tooling/Inclusions/StdSymbolMap.inc:106
-SYMBOL(atomic_exchange_explicit, std::, <atomic>)
-SYMBOL(atomic_fetch_add, std::, <atomic>)
-SYMBOL(atomic_fetch_add_explicit, std::, <atomic>)
----------------
hokein wrote:
> Looks like the regex filters too many symbols -- e.g. atomic_fetch_add is not 
> a variant symbol, we should keep it instead. The same to other following 
> symbols as well.
They're all back now.


================
Comment at: clang/include/clang/Tooling/Inclusions/StdSymbolMap.inc:100
 SYMBOL(atoll, std::, <cstdlib>)
+SYMBOL(atomic, std::, <atomic>)
+SYMBOL(atomic, std::, <memory>)
----------------
kadircet wrote:
> hokein wrote:
> > VitaNuo wrote:
> > > VitaNuo wrote:
> > > > kadircet wrote:
> > > > > hokein wrote:
> > > > > > Conceptually, this (and other `atomic_*` symbols) doesn't feel 
> > > > > > correct:
> > > > > > - `<atomic>` provides the generic template `template<class T> 
> > > > > > struct atomic;`
> > > > > > -  `<memory>` provides partial template specializations for 
> > > > > > `std::shared_ptr` and `std::weak_ptr`  
> > > > > > 
> > > > > > They are variant symbols (ideally, they should be treat as the 
> > > > > > `std::move()`). The issue here is that unlike `std::move` which has 
> > > > > > two individual entries in the index page, we only have one entry 
> > > > > > for `std::atomic` (extending the cppreference_parser.py script to 
> > > > > > perfectly distinguish these two cases in the 
> > > > > > [page](https://en.cppreference.com/w/cpp/atomic/atomic) seems 
> > > > > > non-trivial).  Some options:
> > > > > > 
> > > > > > 1) treat them as multiple-header symbols (like what this patch does 
> > > > > > now)
> > > > > > 2) special-case these symbols like `std::move()`
> > > > > > 3) always prefer the header providing generic templates
> > > > > > 
> > > > > > @kadircet, what do you think?
> > > > > right, i believe this patch is definitely adding keeping multiple 
> > > > > headers for a symbol around, but mixing it with keeping headers for 
> > > > > variants (e.g. overloads provided by different headers, or 
> > > > > specializations as mentioned here).
> > > > > 
> > > > > we definitely need some more changes in parser to make sure we're 
> > > > > only preserving alternatives for **the same symbol** and not for any 
> > > > > other variants (overloads, specializations). IIRC there are currently 
> > > > > 2 places these variants could come into play:
> > > > > - first is the symbol index page itself, symbols that have ` (foo)` 
> > > > > next to them have variants and should still be ignored (e.g. 
> > > > > std::remove variant of cstdio shouldn't be preserved in the scope of 
> > > > > this patch)
> > > > > - second is inside the detail pages for symbols, e.g. 
> > > > > [std::atomic](http://go/std::atomic), we can have headers for 
> > > > > different declarations, they're clearly different variants so we 
> > > > > shouldn't add such symbols into the mapping `_ParseSymbolPage` 
> > > > > already does some detection of declarations in between headers.
> > > > > 
> > > > > in the scope of this patch, we should keep ignoring both.
> > > > I suggest to special-case the overloads for now, just not to solve all 
> > > > the problems in the same patch.
> > > The first group are already ignored. We need to take a lot at how to 
> > > ignore the second one.
> > Ideally, we should tweak the `_ParseSymbolPage` to handle this case, but I 
> > don't see a easy way to do it (the `atomic` 
> > [case](https://en.cppreference.com/w/cpp/atomic/atomic) is quite tricky 
> > where the symbol name is identical, only the template argument is 
> > different, and the simple text-match heuristic in `_ParseSymbolPage` fails 
> > in this case).
> > 
> > so specialcasing these (there are not too many of them) looks fine to me.
> I don't follow why we can't perform this detection directly, haven't been 
> taking a look at the details but the page looks like:
> ```
> Defined in header <atomic>
> template< class T > struct atomic;
> template< class U > struct atomic<U*>;
> Defined in header <memory>
> template< class U > struct atomic<std::shared_ptr<U>>;
> template< class U > struct atomic<std::weak_ptr<U>>;
> Defined in header <stdatomic.h>
> #define _Atomic(T) /* see below */
> ```
> 
> So `_ParseSymbolPage`, first sees <atomic> then a bunch of decls, then 
> <memory> then a bunch more decls and so on, and in the end it returns all the 
> headers it has found.
> 
> Why can't we change the logic here to return `nothing` when there are 
> different decls && headers? i.e. return empty if we see a new header after 
> seeing some declarations.
> does this result in undesired behaviour elsewhere?
Ok, I've fixed (or hacked :) this programmatically.
The parsing of constants in 
https://en.cppreference.com/w/cpp/locale/codecvt_mode is not complete now, 
everything is attributed to the "codecvt" header, the "locale" header is 
ignored. Hope it's OK.
If not, then I'm afraid I can only special-case the "atomic.*" symbols.


================
Comment at: clang/include/clang/Tooling/Inclusions/StdSymbolMap.inc:369
+SYMBOL(div, std::, <cinttypes>)
+SYMBOL(div, std::, <cstdlib>)
 SYMBOL(div_t, std::, <cstdlib>)
----------------
VitaNuo wrote:
> hokein wrote:
> > this one as well, both headers provide different overloads.
> I suggest to special-case the overloads for now, just not to solve all the 
> problems in the same patch.
Should be fixed now, together with "atomic.*" cases.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D142092/new/

https://reviews.llvm.org/D142092

_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to