[PATCH] D136701: [LinkerWrapper] Perform device linking steps in parallel

Yaxun Liu via Phabricator via cfe-commits Tue, 25 Oct 2022 14:43:51 -0700

yaxunl added a comment.

In general, I think parallelizing the linking workload for multiple GPU's in 
the linker wrapper is a useful feature. I am not sure whether the workload to 
be parallelized includes the LLVM passes and codegen, which is usually the 
bottleneck. Parallelizing this workload when there are many GPU arch's can 
significantly improve build time.


It is preferable if the parallelization can be coordinated with GNU make 
through the job server provided by GNU make 
(https://www.gnu.org/software/make/manual/html_node/Job-Slots.html#Job-Slots). 
However, some efforts are needed to implement that.

For now, I think an option to enable parallelization (by default off) should be 
fine.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D136701/new/

https://reviews.llvm.org/D136701

_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[PATCH] D136701: [LinkerWrapper] Perform device linking steps in parallel

Reply via email to