zygoloid wrote: > But, I have a question: now that it has ensured the uniqueness of typeinfo's > address, why does the implementation still compare the type equality by the > address of type name string?
The uniqueness of the address of the `type_info` object itself is not guaranteed. The reason is that we sometimes [generate `type_info` objects for pointers to incomplete types](https://itanium-cxx-abi.github.io/cxx-abi/abi.html#:~:text=When%20it%20is,the%20type_info%20addresses.) as part of exception handling, so we can end up with multiple `type_info` objects for the same type. > For symbols with internal linkage or hidden visibility, I don't think there > would be problems if they were allowed to be merged. For example, considering > there are 2 translation units 0.cpp and 1.cpp, and we defined internal `class > A {}` in these 2 translation units, since they are all internal symbols, I > think comparing `0.cpp::A` with `1.cpp::A` is undefined behavior. Because > there at least one symbol was referenced outside of the current visibility > scope. Such comparisons can happen in valid C++ code. For example: ```c++ // a.cc namespace { class A {}; } void throwA() { throw A(); } ``` ```c++ // b.cc namespace { class A {}; } void throwA(); int main() { try { throwA(); } catch (A) { return 1; } catch (...) { return 2; } } ``` This program is valid and `main` is required to return 2 -- the `A` in `a.cc` and the `A` in `b.cc` are two different types. During exception handling, we will compare them by comparing their `type_info`, which means we will compare the addresses of the name strings, so we need two different addresses. > For dynamic loading, if there are two same symbols in different DSOs, the > symbol would be interposed. Not in the cases I mentioned. (I think we can *probably* get away with marking type name strings as `unnamed_addr` if the type has external linkage, because we don't expect `unnamed_addr` to have any effect after static linking. But it's not clear to me that that's actually guaranteed by the LLVM semantics, or whether it would be permissible to make use of `unnamed_addr` in some cross-DSO LTO situation.) You *can* still apply `unnamed_addr` in the cases where the the target ABI rule is that `type_info` comparisons will always use or fall back to a string comparison. Per the libc++ implementation, that's that case on Apple arm64 targets. You can detect this using `classifyRTTIUniqueness`. I think it's also correct and safe to apply `local_unnamed_addr` to these type name strings in all cases. Merging with another string literal from the same compilation should always be OK. https://github.com/llvm/llvm-project/pull/111343 _______________________________________________ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits