nocchijiang wrote:

> I suspect that the no-LTO case might still encounter some slowdown, as each 
> CU needs to read the entire CGData regardless.

I can confirm that the performance have been improved significantly from my 
testing on no-LTO projects that the slowdown is acceptable now. Before applying 
the PR it was about 50% slowdown, now it is ~5%.

> Alternatively, we could restructure the indexed CGData to allow for reading 
> only the relevant hash entries on demand.

Besides only consuming the matched stable entries like what this PR does, this 
is exactly what I planned to do to reduce the memory footprint of the 
deserialized CGData. I would like to discuss the detail in the RFC thread with 
you to make sure that we are on the same page before coding it.


https://github.com/llvm/llvm-project/pull/115750
_______________________________________________
llvm-branch-commits mailing list
llvm-branch-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

Reply via email to