Issue |
123092
|
Summary |
[AMDGPU][Attributor] OpenMC Performance Regression on AMD GPU
|
Labels |
new issue
|
Assignees |
|
Reporter |
jtramm
|
I have observed a performance regression in the OpenMC scientific simulation application (which uses OpenMP target offloading) on the AMD MI250 due to commit https://github.com/llvm/llvm-project/commit/7dbd6cd2946ec3a9b4ad2dfd7ead177baac15bd7, which is part of PR #114357. The regression is not observed on NVIDIA GPUs.
This commit causes OpenMC to be 20% slower overall on a typical benchmark problem, with the most expensive kernel in the simulation being about 2x slower.
OpenMC can be installed and its performance benchmark run using the following script: https://github.com/jtramm/openmc_offloading_builder/tree/main
FOM before this commit (higher is better):
```
Calculation Rate (inactive) = 239870.0 particles/second
```
FOM after this commit:
```
Calculation Rate (inactive) = 192707.0 particles/second
```
Main kernel timing before this commit:
```
XS lookups (Fuel) = 1.8720e+01 seconds
```
Main kernel after this commit:
```
XS lookups (Fuel) = 3.5231e+01 seconds
```
Rocprof shows similar slowdown for this kernel, which it lists as:
```
__omp_offloading_25_7c638f81__ZN6openmc32process_calculate_xs_events_fuelEv_l256.kd
```
_______________________________________________
llvm-bugs mailing list
llvm-bugs@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-bugs