r339109 - AMDGPU: Add builtin for s_dcache_inv_vol

2018-08-07 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Tue Aug 7 00:49:04 2018 New Revision: 339109 URL: http://llvm.org/viewvc/llvm-project?rev=339109&view=rev Log: AMDGPU: Add builtin for s_dcache_inv_vol Added: cfe/trunk/test/CodeGenOpenCL/builtins-amdgcn-ci.cl cfe/trunk/test/SemaOpenCL/builtins-amdgcn-error-ci.cl Mod

r339110 - AMDGPU: Add builtin for s_dcache_wb

2018-08-07 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Tue Aug 7 00:49:13 2018 New Revision: 339110 URL: http://llvm.org/viewvc/llvm-project?rev=339110&view=rev Log: AMDGPU: Add builtin for s_dcache_wb Added: cfe/trunk/test/SemaOpenCL/builtins-amdgcn-error-vi.cl Modified: cfe/trunk/include/clang/Basic/BuiltinsAMDGPU.def

r339278 - AMDGPU: Fix enabling denormals by default on pre-VI targets

2018-08-08 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Wed Aug 8 10:48:37 2018 New Revision: 339278 URL: http://llvm.org/viewvc/llvm-project?rev=339278&view=rev Log: AMDGPU: Fix enabling denormals by default on pre-VI targets Fast FMAF is not a sufficient condition to enable denormals. Before VI, enabling denormals caused F32 in

r339395 - AMDGPU: Add another missing builtin

2018-08-09 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Thu Aug 9 15:18:37 2018 New Revision: 339395 URL: http://llvm.org/viewvc/llvm-project?rev=339395&view=rev Log: AMDGPU: Add another missing builtin Modified: cfe/trunk/include/clang/Basic/BuiltinsAMDGPU.def cfe/trunk/test/CodeGenOpenCL/builtins-amdgcn-ci.cl cfe/tr

r339934 - AMDGPU: Correct errors in device table

2018-08-16 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Thu Aug 16 13:19:47 2018 New Revision: 339934 URL: http://llvm.org/viewvc/llvm-project?rev=339934&view=rev Log: AMDGPU: Correct errors in device table Modified: cfe/trunk/lib/Basic/Targets/AMDGPU.h Modified: cfe/trunk/lib/Basic/Targets/AMDGPU.h URL: http://llvm.org/view

r340193 - Rename -mlink-cuda-bitcode to -mlink-builtin-bitcode

2018-08-20 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Mon Aug 20 11:16:48 2018 New Revision: 340193 URL: http://llvm.org/viewvc/llvm-project?rev=340193&view=rev Log: Rename -mlink-cuda-bitcode to -mlink-builtin-bitcode The same semantics work for OpenCL, and probably any offload language. Keep the old name around as an alias. M

r340292 - AMDGPU: Move target code into TargetParser

2018-08-21 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Tue Aug 21 09:13:29 2018 New Revision: 340292 URL: http://llvm.org/viewvc/llvm-project?rev=340292&view=rev Log: AMDGPU: Move target code into TargetParser Modified: cfe/trunk/lib/Basic/Targets/AMDGPU.cpp cfe/trunk/lib/Basic/Targets/AMDGPU.h Modified: cfe/trunk/lib/Ba

r315094 - OpenCL: Assume functions are convergent

2017-10-06 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Fri Oct 6 12:34:40 2017 New Revision: 315094 URL: http://llvm.org/viewvc/llvm-project?rev=315094&view=rev Log: OpenCL: Assume functions are convergent This was done for CUDA functions in r261779, and for the same reason this also needs to be done for OpenCL. An arbitrary fun

r315219 - AMDGPU: Fix missing declaration for __builtin_amdgcn_dispatch_ptr

2017-10-09 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Mon Oct 9 10:44:18 2017 New Revision: 315219 URL: http://llvm.org/viewvc/llvm-project?rev=315219&view=rev Log: AMDGPU: Fix missing declaration for __builtin_amdgcn_dispatch_ptr Modified: cfe/trunk/include/clang/Basic/BuiltinsAMDGPU.def cfe/trunk/test/CodeGenOpenCL/bu

r315238 - AMDGPU: Add read_exec_lo/hi builtins

2017-10-09 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Mon Oct 9 13:06:37 2017 New Revision: 315238 URL: http://llvm.org/viewvc/llvm-project?rev=315238&view=rev Log: AMDGPU: Add read_exec_lo/hi builtins Modified: cfe/trunk/include/clang/Basic/BuiltinsAMDGPU.def cfe/trunk/lib/CodeGen/CGBuiltin.cpp cfe/trunk/test/CodeG

r341033 - AMDGPU: Default to hidden visibility

2018-08-30 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Thu Aug 30 01:18:06 2018 New Revision: 341033 URL: http://llvm.org/viewvc/llvm-project?rev=341033&view=rev Log: AMDGPU: Default to hidden visibility Object linking isn't supported, so it's not useful to emit default visibility. Default visibility requires relocations we don't

r338707 - Try to make builtin address space declarations not useless

2018-08-02 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Thu Aug 2 05:14:28 2018 New Revision: 338707 URL: http://llvm.org/viewvc/llvm-project?rev=338707&view=rev Log: Try to make builtin address space declarations not useless The way address space declarations for builtins currently work is nearly useless. The code assumes the ad

r338754 - AMDGPU: Fix missing declaration of queue ptr builtin

2018-08-02 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Thu Aug 2 11:24:55 2018 New Revision: 338754 URL: http://llvm.org/viewvc/llvm-project?rev=338754&view=rev Log: AMDGPU: Fix missing declaration of queue ptr builtin Modified: cfe/trunk/include/clang/Basic/BuiltinsAMDGPU.def cfe/trunk/test/CodeGenOpenCL/builtins-amdgcn

r324641 - Fix crash on array initializer with non-0 alloca addrspace

2018-02-08 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Thu Feb 8 11:37:09 2018 New Revision: 324641 URL: http://llvm.org/viewvc/llvm-project?rev=324641&view=rev Log: Fix crash on array initializer with non-0 alloca addrspace Modified: cfe/trunk/lib/CodeGen/CGDecl.cpp cfe/trunk/test/CodeGenOpenCL/address-space-constant-in

r324748 - AMDGPU: Update for datalayout change

2018-02-09 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Fri Feb 9 08:58:41 2018 New Revision: 324748 URL: http://llvm.org/viewvc/llvm-project?rev=324748&view=rev Log: AMDGPU: Update for datalayout change Modified: cfe/trunk/lib/Basic/Targets/AMDGPU.cpp cfe/trunk/test/CodeGen/target-data.c cfe/trunk/test/CodeGenOpenCL/

r352443 - AMDGPU: Add ds append/consume builtins

2019-01-28 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Mon Jan 28 15:59:18 2019 New Revision: 352443 URL: http://llvm.org/viewvc/llvm-project?rev=352443&view=rev Log: AMDGPU: Add ds append/consume builtins Modified: cfe/trunk/include/clang/Basic/BuiltinsAMDGPU.def cfe/trunk/lib/CodeGen/CGBuiltin.cpp cfe/trunk/test/Cod

Re: r350970 - [Darwin][Driver] Don't pass a file as object_path_lto during ThinLTO

2019-01-29 Thread Matt Arsenault via cfe-commits
> On Jan 11, 2019, at 4:16 PM, Steven Wu via cfe-commits > wrote: > > Author: steven_wu > Date: Fri Jan 11 13:16:04 2019 > New Revision: 350970 > > URL: http://llvm.org/viewvc/llvm-project?rev=350970&view=rev > Log: > [Darwin][Driver] Don't pass a file as object_path_lto during ThinLTO > > S

r352539 - Revert "OpenCL: Extend argument promotion rules to vector types"

2019-01-29 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Tue Jan 29 12:49:47 2019 New Revision: 352539 URL: http://llvm.org/viewvc/llvm-project?rev=352539&view=rev Log: Revert "OpenCL: Extend argument promotion rules to vector types" This reverts r348083. This was based on a misreading of the spec for printf specifiers. Also rever

r352540 - OpenCL: Use length modifier for warning on vector printf arguments

2019-01-29 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Tue Jan 29 12:49:54 2019 New Revision: 352540 URL: http://llvm.org/viewvc/llvm-project?rev=352540&view=rev Log: OpenCL: Use length modifier for warning on vector printf arguments Re-enable format string warnings on printf. The warnings are still incomplete. Apparently it is

r352544 - OpenCL: Try to fix bot test failure

2019-01-29 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Tue Jan 29 13:14:56 2019 New Revision: 352544 URL: http://llvm.org/viewvc/llvm-project?rev=352544&view=rev Log: OpenCL: Try to fix bot test failure Modified: cfe/trunk/test/SemaOpenCL/format-strings-fixit.cl Modified: cfe/trunk/test/SemaOpenCL/format-strings-fixit.cl URL

r346806 - OpenCL: Don't warn on v printf modifier

2018-11-13 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Tue Nov 13 14:30:35 2018 New Revision: 346806 URL: http://llvm.org/viewvc/llvm-project?rev=346806&view=rev Log: OpenCL: Don't warn on v printf modifier This avoids spurious warnings, but could use a lot of work. For example the number of vector elements is not verified, and t

r347873 - Mark __builtin_shufflevector as using custom type checking

2018-11-29 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Thu Nov 29 07:45:05 2018 New Revision: 347873 URL: http://llvm.org/viewvc/llvm-project?rev=347873&view=rev Log: Mark __builtin_shufflevector as using custom type checking The custom handling seems to all be implemented already. This avoids regressions in a future patch when f

r348083 - OpenCL: Extend argument promotion rules to vector types

2018-12-01 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Sat Dec 1 13:56:10 2018 New Revision: 348083 URL: http://llvm.org/viewvc/llvm-project?rev=348083&view=rev Log: OpenCL: Extend argument promotion rules to vector types The spec is ambiguous on whether vector types are allowed to be implicitly converted. The only legal context

r348084 - OpenCL: Improve vector printf warnings

2018-12-01 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Sat Dec 1 14:16:27 2018 New Revision: 348084 URL: http://llvm.org/viewvc/llvm-project?rev=348084&view=rev Log: OpenCL: Improve vector printf warnings The vector modifier is considered separate, so don't treat it as a conversion specifier. This is still not warning on some c

r348809 - Update test for instcombine change

2018-12-10 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Mon Dec 10 15:02:40 2018 New Revision: 348809 URL: http://llvm.org/viewvc/llvm-project?rev=348809&view=rev Log: Update test for instcombine change Modified: cfe/trunk/test/CodeGen/vector.c Modified: cfe/trunk/test/CodeGen/vector.c URL: http://llvm.org/viewvc/llvm-projec

r356354 - Add testcase from bug 41079

2019-03-17 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Sun Mar 17 16:16:31 2019 New Revision: 356354 URL: http://llvm.org/viewvc/llvm-project?rev=356354&view=rev Log: Add testcase from bug 41079 Modified: cfe/trunk/test/CodeGen/builtin-expect.c Modified: cfe/trunk/test/CodeGen/builtin-expect.c URL: http://llvm.org/viewvc/ll

r359918 - Ensure there is stack usage in stack size warning test

2019-05-03 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Fri May 3 12:04:14 2019 New Revision: 359918 URL: http://llvm.org/viewvc/llvm-project?rev=359918&view=rev Log: Ensure there is stack usage in stack size warning test r359906 broke this because the only stack usage was from a spill which can be avoided since the only block is

r343653 - OpenCL: Mark printf format string argument

2018-10-02 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Tue Oct 2 19:01:19 2018 New Revision: 343653 URL: http://llvm.org/viewvc/llvm-project?rev=343653&view=rev Log: OpenCL: Mark printf format string argument Fixes not warning on format string errors. Added: cfe/trunk/test/SemaOpenCL/printf-format-string-warnings.cl Modifie

r354624 - AMDGPU: Don't emit debugger subtarget features

2019-02-21 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Thu Feb 21 13:31:43 2019 New Revision: 354624 URL: http://llvm.org/viewvc/llvm-project?rev=354624&view=rev Log: AMDGPU: Don't emit debugger subtarget features Keep the flag around for compatability. Modified: cfe/trunk/lib/Driver/ToolChains/AMDGPU.cpp cfe/trunk/test/

r363390 - AMDGPU: Use AMDGPU toolchain for other OSes

2019-06-14 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Fri Jun 14 06:39:57 2019 New Revision: 363390 URL: http://llvm.org/viewvc/llvm-project?rev=363390&view=rev Log: AMDGPU: Use AMDGPU toolchain for other OSes This would need more work to actually support them, but this is less wrong than the default. Modified: cfe/trunk/li

r363682 - AMDGPU: Disable errno by default

2019-06-18 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Tue Jun 18 06:59:32 2019 New Revision: 363682 URL: http://llvm.org/viewvc/llvm-project?rev=363682&view=rev Log: AMDGPU: Disable errno by default Modified: cfe/trunk/lib/Driver/ToolChains/AMDGPU.h cfe/trunk/test/Driver/fast-math.c Modified: cfe/trunk/lib/Driver/ToolCh

r363684 - AMDGPU: Add GWS instruction builtins

2019-06-18 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Tue Jun 18 07:10:01 2019 New Revision: 363684 URL: http://llvm.org/viewvc/llvm-project?rev=363684&view=rev Log: AMDGPU: Add GWS instruction builtins Modified: cfe/trunk/include/clang/Basic/BuiltinsAMDGPU.def cfe/trunk/test/CodeGenOpenCL/builtins-amdgcn.cl Modified: c

r363871 - Reapply "r363684: AMDGPU: Add GWS instruction builtins"

2019-06-19 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Wed Jun 19 12:55:49 2019 New Revision: 363871 URL: http://llvm.org/viewvc/llvm-project?rev=363871&view=rev Log: Reapply "r363684: AMDGPU: Add GWS instruction builtins" Modified: cfe/trunk/include/clang/Basic/BuiltinsAMDGPU.def cfe/trunk/test/CodeGenOpenCL/builtins-amd

r366286 - AMDGPU: Add some missing builtins

2019-07-16 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Tue Jul 16 17:01:03 2019 New Revision: 366286 URL: http://llvm.org/viewvc/llvm-project?rev=366286&view=rev Log: AMDGPU: Add some missing builtins Added: cfe/trunk/test/CodeGenOpenCL/builtins-amdgcn-gfx10.cl cfe/trunk/test/SemaOpenCL/builtins-amdgcn-error-gfx10-param.c

r367431 - AMDGPU: Add missing builtin declarations

2019-07-31 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Wed Jul 31 07:03:05 2019 New Revision: 367431 URL: http://llvm.org/viewvc/llvm-project?rev=367431&view=rev Log: AMDGPU: Add missing builtin declarations Modified: cfe/trunk/include/clang/Basic/BuiltinsAMDGPU.def cfe/trunk/test/CodeGenOpenCL/builtins-amdgcn.cl Modifie

r367973 - Builtins: Start adding half versions of math builtins

2019-08-05 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Mon Aug 5 20:28:37 2019 New Revision: 367973 URL: http://llvm.org/viewvc/llvm-project?rev=367973&view=rev Log: Builtins: Start adding half versions of math builtins The implementation of the OpenCL builtin currently library uses 2 different hacks to get to the corresponding

r363986 - AMDGPU: Add DS GWS sema builtins

2019-06-20 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Thu Jun 20 14:33:57 2019 New Revision: 363986 URL: http://llvm.org/viewvc/llvm-project?rev=363986&view=rev Log: AMDGPU: Add DS GWS sema builtins Modified: cfe/trunk/include/clang/Basic/BuiltinsAMDGPU.def cfe/trunk/test/CodeGenOpenCL/builtins-amdgcn-ci.cl cfe/trunk

r364123 - AMDGPU: Fix target builtins for gfx10

2019-06-21 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Fri Jun 21 18:30:00 2019 New Revision: 364123 URL: http://llvm.org/viewvc/llvm-project?rev=364123&view=rev Log: AMDGPU: Fix target builtins for gfx10 This wasn't setting some of the features from older generations. Modified: cfe/trunk/lib/Basic/Targets/AMDGPU.cpp cfe

r364251 - AMDGPU: Fix missing declaration for mbcnt builtins

2019-06-24 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Mon Jun 24 16:34:06 2019 New Revision: 364251 URL: http://llvm.org/viewvc/llvm-project?rev=364251&view=rev Log: AMDGPU: Fix missing declaration for mbcnt builtins Modified: cfe/trunk/include/clang/Basic/BuiltinsAMDGPU.def cfe/trunk/test/CodeGenOpenCL/builtins-amdgcn.c

r331216 - AMDGPU: Add Vega12 and Vega20

2018-04-30 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Mon Apr 30 12:08:27 2018 New Revision: 331216 URL: http://llvm.org/viewvc/llvm-project?rev=331216&view=rev Log: AMDGPU: Add Vega12 and Vega20 Changes by Matt Arsenault Konstantin Zhuravlyov Added: cfe/trunk/test/CodeGenOpenCL/amdgpu-features.cl cfe/trunk/test/Cod

r336676 - Update test for backend error message change

2018-07-10 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Tue Jul 10 07:03:50 2018 New Revision: 336676 URL: http://llvm.org/viewvc/llvm-project?rev=336676&view=rev Log: Update test for backend error message change Modified: cfe/trunk/test/CodeGen/backend-unsupported-error.ll Modified: cfe/trunk/test/CodeGen/backend-unsupported

r336681 - AMDGPU: Try to fix test again

2018-07-10 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Tue Jul 10 07:47:31 2018 New Revision: 336681 URL: http://llvm.org/viewvc/llvm-project?rev=336681&view=rev Log: AMDGPU: Try to fix test again Modified: cfe/trunk/test/CodeGen/backend-unsupported-error.ll Modified: cfe/trunk/test/CodeGen/backend-unsupported-error.ll URL:

[clang] 40ab8ae - OpenMP: Add helper function for convergent runtime calls

2019-10-27 Thread Matt Arsenault via cfe-commits
Author: Matt Arsenault Date: 2019-10-27T21:26:55-07:00 New Revision: 40ab8ae9fb70f1550815bf0f867148b5101a4f66 URL: https://github.com/llvm/llvm-project/commit/40ab8ae9fb70f1550815bf0f867148b5101a4f66 DIFF: https://github.com/llvm/llvm-project/commit/40ab8ae9fb70f1550815bf0f867148b5101a4f66.diff

[clang] 56a503b - OpenMP: Add convergent to more runtime functions

2019-10-27 Thread Matt Arsenault via cfe-commits
Author: Matt Arsenault Date: 2019-10-27T21:26:55-07:00 New Revision: 56a503bdba9c33fce4d8fe86494cfd9c0b62c88a URL: https://github.com/llvm/llvm-project/commit/56a503bdba9c33fce4d8fe86494cfd9c0b62c88a DIFF: https://github.com/llvm/llvm-project/commit/56a503bdba9c33fce4d8fe86494cfd9c0b62c88a.diff

[clang] 1d96dca - HIP: Try to deal with more llvm package layouts

2020-05-25 Thread Matt Arsenault via cfe-commits
Author: Matt Arsenault Date: 2020-05-23T13:28:24-04:00 New Revision: 1d96dca9491e3d75c11c3cd1acff5fcda8c2f613 URL: https://github.com/llvm/llvm-project/commit/1d96dca9491e3d75c11c3cd1acff5fcda8c2f613 DIFF: https://github.com/llvm/llvm-project/commit/1d96dca9491e3d75c11c3cd1acff5fcda8c2f613.diff

[libclc] cf4d4e3 - libclc: Compile with -nostdlib

2020-05-28 Thread Matt Arsenault via cfe-commits
Author: Matt Arsenault Date: 2020-05-28T10:41:31-04:00 New Revision: cf4d4e366a2165f0e93948f166d76ae650aecc98 URL: https://github.com/llvm/llvm-project/commit/cf4d4e366a2165f0e93948f166d76ae650aecc98 DIFF: https://github.com/llvm/llvm-project/commit/cf4d4e366a2165f0e93948f166d76ae650aecc98.diff

[clang] 97f3f0b - AMDGPU: Add intrinsic for s_setreg

2020-05-28 Thread Matt Arsenault via cfe-commits
Author: Matt Arsenault Date: 2020-05-28T14:26:38-04:00 New Revision: 97f3f0bab0982f84745c7ac5ce8fb6b0918ff718 URL: https://github.com/llvm/llvm-project/commit/97f3f0bab0982f84745c7ac5ce8fb6b0918ff718 DIFF: https://github.com/llvm/llvm-project/commit/97f3f0bab0982f84745c7ac5ce8fb6b0918ff718.diff

[clang] 301a6da - AMDGPU: Fix clang side null pointer value for private

2020-06-02 Thread Matt Arsenault via cfe-commits
Author: Matt Arsenault Date: 2020-06-02T09:23:46-04:00 New Revision: 301a6da8c24a09052e3bda10e90b450b7b39ffea URL: https://github.com/llvm/llvm-project/commit/301a6da8c24a09052e3bda10e90b450b7b39ffea DIFF: https://github.com/llvm/llvm-project/commit/301a6da8c24a09052e3bda10e90b450b7b39ffea.diff

[clang] 4593e41 - AMDGPU: Teach toolchain to link rocm device libs

2020-04-10 Thread Matt Arsenault via cfe-commits
Author: Matt Arsenault Date: 2020-04-10T13:37:32-04:00 New Revision: 4593e4131affa84e61d7b6844be409ba46d29f11 URL: https://github.com/llvm/llvm-project/commit/4593e4131affa84e61d7b6844be409ba46d29f11 DIFF: https://github.com/llvm/llvm-project/commit/4593e4131affa84e61d7b6844be409ba46d29f11.diff

[clang] 1e93b3d - Disable test on windows

2020-04-10 Thread Matt Arsenault via cfe-commits
Author: Matt Arsenault Date: 2020-04-10T18:48:18-04:00 New Revision: 1e93b3d8a75638c2268df58e44a8738845df3ed5 URL: https://github.com/llvm/llvm-project/commit/1e93b3d8a75638c2268df58e44a8738845df3ed5 DIFF: https://github.com/llvm/llvm-project/commit/1e93b3d8a75638c2268df58e44a8738845df3ed5.diff

[clang] dc89a3e - HIP: Fix handling of denormal mode

2020-04-13 Thread Matt Arsenault via cfe-commits
Author: Matt Arsenault Date: 2020-04-13T11:48:45-07:00 New Revision: dc89a3efb43feedec04facfa2206de011d2606e7 URL: https://github.com/llvm/llvm-project/commit/dc89a3efb43feedec04facfa2206de011d2606e7 DIFF: https://github.com/llvm/llvm-project/commit/dc89a3efb43feedec04facfa2206de011d2606e7.diff

[clang] 3a61245 - clang/AMDGPU: Assume denormals are enabled for the default target.

2020-04-15 Thread Matt Arsenault via cfe-commits
Author: Matt Arsenault Date: 2020-04-15T09:17:26-04:00 New Revision: 3a612450508b314b7a6f4db142d0c619031d760e URL: https://github.com/llvm/llvm-project/commit/3a612450508b314b7a6f4db142d0c619031d760e DIFF: https://github.com/llvm/llvm-project/commit/3a612450508b314b7a6f4db142d0c619031d760e.diff

[clang] 9e03bde - AMDGPU: Add llvm.amdgcn.sqrt intrinsic

2020-06-26 Thread Matt Arsenault via cfe-commits
Author: Matt Arsenault Date: 2020-06-26T15:07:07-04:00 New Revision: 9e03bdebc17a223416d682f64ef2046b8bf0fc98 URL: https://github.com/llvm/llvm-project/commit/9e03bdebc17a223416d682f64ef2046b8bf0fc98 DIFF: https://github.com/llvm/llvm-project/commit/9e03bdebc17a223416d682f64ef2046b8bf0fc98.diff

[clang] 5c03bee - clang: Allow backend unsupported warnings

2020-04-27 Thread Matt Arsenault via cfe-commits
Author: Matt Arsenault Date: 2020-04-27T12:14:51-04:00 New Revision: 5c03beefa720bddb3e3f53c595a76bce7ad50f37 URL: https://github.com/llvm/llvm-project/commit/5c03beefa720bddb3e3f53c595a76bce7ad50f37 DIFF: https://github.com/llvm/llvm-project/commit/5c03beefa720bddb3e3f53c595a76bce7ad50f37.diff

[clang] 580a9f2 - Fix test without built AMDGPU

2020-04-27 Thread Matt Arsenault via cfe-commits
Author: Matt Arsenault Date: 2020-04-27T13:32:00-04:00 New Revision: 580a9f2c307f8d475277f20999ad5820d171beba URL: https://github.com/llvm/llvm-project/commit/580a9f2c307f8d475277f20999ad5820d171beba DIFF: https://github.com/llvm/llvm-project/commit/580a9f2c307f8d475277f20999ad5820d171beba.diff

[clang] 30eeb74 - clang: Use byref for aggregate kernel arguments

2020-08-06 Thread Matt Arsenault via cfe-commits
Author: Matt Arsenault Date: 2020-08-06T15:52:26-04:00 New Revision: 30eeb742f1d11d7a7036e3b8a3bffc1dfd252082 URL: https://github.com/llvm/llvm-project/commit/30eeb742f1d11d7a7036e3b8a3bffc1dfd252082 DIFF: https://github.com/llvm/llvm-project/commit/30eeb742f1d11d7a7036e3b8a3bffc1dfd252082.diff

[clang] 684dc1b - Elaborate more on --rocm-path flag.

2020-05-05 Thread Matt Arsenault via cfe-commits
Author: Matt Arsenault Date: 2020-05-05T09:33:18-04:00 New Revision: 684dc1bebe5cb70cfd27923940f9f8cba4f13195 URL: https://github.com/llvm/llvm-project/commit/684dc1bebe5cb70cfd27923940f9f8cba4f13195 DIFF: https://github.com/llvm/llvm-project/commit/684dc1bebe5cb70cfd27923940f9f8cba4f13195.diff

[clang] 03cb328 - clang: Cleanup usage of CreateMemCpy

2020-05-08 Thread Matt Arsenault via cfe-commits
Author: Matt Arsenault Date: 2020-05-08T20:57:56-04:00 New Revision: 03cb328d6f691bde88c754341ff3859d1c1ecc2f URL: https://github.com/llvm/llvm-project/commit/03cb328d6f691bde88c754341ff3859d1c1ecc2f DIFF: https://github.com/llvm/llvm-project/commit/03cb328d6f691bde88c754341ff3859d1c1ecc2f.diff

[clang] a881dc1 - Fix typo

2020-05-09 Thread Matt Arsenault via cfe-commits
Author: Matt Arsenault Date: 2020-05-09T16:00:17-04:00 New Revision: a881dc1103579926f039e81c0d25626ff8a582a9 URL: https://github.com/llvm/llvm-project/commit/a881dc1103579926f039e81c0d25626ff8a582a9 DIFF: https://github.com/llvm/llvm-project/commit/a881dc1103579926f039e81c0d25626ff8a582a9.diff

[clang] 123bee6 - AMDGPU: Search for new ROCm bitcode library structure

2020-05-12 Thread Matt Arsenault via cfe-commits
Author: Matt Arsenault Date: 2020-05-12T09:41:07-04:00 New Revision: 123bee602a260150ff55c74287f583a67ee78f36 URL: https://github.com/llvm/llvm-project/commit/123bee602a260150ff55c74287f583a67ee78f36 DIFF: https://github.com/llvm/llvm-project/commit/123bee602a260150ff55c74287f583a67ee78f36.diff

[clang] 14e1845 - HIP: Merge builtin library handling

2020-05-12 Thread Matt Arsenault via cfe-commits
Author: Matt Arsenault Date: 2020-05-12T09:50:22-04:00 New Revision: 14e184571139ba4c7347ea547074c6d9ec9c7b14 URL: https://github.com/llvm/llvm-project/commit/14e184571139ba4c7347ea547074c6d9ec9c7b14 DIFF: https://github.com/llvm/llvm-project/commit/14e184571139ba4c7347ea547074c6d9ec9c7b14.diff

[clang] 235fb7d - AMDGPU/OpenCL: Accept -nostdlib in place of -nogpulib

2020-05-14 Thread Matt Arsenault via cfe-commits
Author: Matt Arsenault Date: 2020-05-14T12:33:31-04:00 New Revision: 235fb7dc24b1cf7034dfc76bb853ffb4ac5dec5d URL: https://github.com/llvm/llvm-project/commit/235fb7dc24b1cf7034dfc76bb853ffb4ac5dec5d DIFF: https://github.com/llvm/llvm-project/commit/235fb7dc24b1cf7034dfc76bb853ffb4ac5dec5d.diff

[PATCH] D25343: [OpenCL] Mark group functions as noduplicate in opencl-c.h

2016-10-06 Thread Matt Arsenault via cfe-commits
arsenm added a comment. These should be convergent instead https://reviews.llvm.org/D25343 ___ cfe-commits mailing list cfe-commits@lists.llvm.org http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

r319735 - AMDGPU: Don't add fp64 feature to r600 subtargets

2017-12-04 Thread Matt Arsenault via cfe-commits
Author: arsenm Date: Mon Dec 4 19:51:26 2017 New Revision: 319735 URL: http://llvm.org/viewvc/llvm-project?rev=319735&view=rev Log: AMDGPU: Don't add fp64 feature to r600 subtargets Should fix test after r319709 Modified: cfe/trunk/lib/Basic/Targets/AMDGPU.cpp Modified: cfe/trunk/lib/Basic

[compiler-rt] [clang-tools-extra] [clang] [llvm] [InferAddressSpaces] Fix constant replace to avoid modifying other functions (PR #70611)

2023-11-07 Thread Matt Arsenault via cfe-commits
@@ -334,6 +335,15 @@ template<> struct simplify_type { } }; +template <> struct GraphTraits { arsenm wrote: If we allowed bitcasts between address spaces with the same size, we could drop addrspacecast constantexprs altogether https://github.com/llvm/llv

[clang] [AMDGPU] const-fold imm operands of amdgcn_update_dpp intrinsic (PR #71139)

2023-11-07 Thread Matt Arsenault via cfe-commits
@@ -17632,8 +17632,20 @@ Value *CodeGenFunction::EmitAMDGPUBuiltinExpr(unsigned BuiltinID, case AMDGPU::BI__builtin_amdgcn_mov_dpp: case AMDGPU::BI__builtin_amdgcn_update_dpp: { llvm::SmallVector Args; -for (unsigned I = 0; I != E->getNumArgs(); ++I) - Args.pu

[clang] [AMDGPU] const-fold imm operands of amdgcn_update_dpp intrinsic (PR #71139)

2023-11-07 Thread Matt Arsenault via cfe-commits
@@ -17632,8 +17632,27 @@ Value *CodeGenFunction::EmitAMDGPUBuiltinExpr(unsigned BuiltinID, case AMDGPU::BI__builtin_amdgcn_mov_dpp: case AMDGPU::BI__builtin_amdgcn_update_dpp: { llvm::SmallVector Args; -for (unsigned I = 0; I != E->getNumArgs(); ++I) - Args.pu

[llvm] [clang] [AMDGPU] Emit a waitcnt instruction after each memory instruction (PR #68932)

2023-11-08 Thread Matt Arsenault via cfe-commits
https://github.com/arsenm requested changes to this pull request. https://github.com/llvm/llvm-project/pull/68932 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [llvm] [AMDGPU] Emit a waitcnt instruction after each memory instruction (PR #68932)

2023-11-08 Thread Matt Arsenault via cfe-commits
https://github.com/arsenm edited https://github.com/llvm/llvm-project/pull/68932 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [llvm] [AMDGPU] Emit a waitcnt instruction after each memory instruction (PR #68932)

2023-11-08 Thread Matt Arsenault via cfe-commits
@@ -52,6 +52,11 @@ static cl::opt ForceEmitZeroFlag( cl::desc("Force all waitcnt instrs to be emitted as s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)"), cl::init(false), cl::Hidden); +static cl::opt +PreciseMemOpFlag("amdgpu-precise-memory-op", + cl::de

[clang] [llvm] [AMDGPU] Emit a waitcnt instruction after each memory instruction (PR #68932)

2023-11-08 Thread Matt Arsenault via cfe-commits
@@ -1809,6 +1816,23 @@ bool SIInsertWaitcnts::shouldFlushVmCnt(MachineLoop *ML, return HasVMemLoad && UsesVgprLoadedOutside; } +bool SIInsertWaitcnts::insertWaitcntAfterMemOp(MachineFunction &MF) { + bool Modified = false; + + for (auto &MBB : MF) { arsenm

[clang] [AMDGPU] const-fold imm operands of amdgcn_update_dpp intrinsic (PR #71139)

2023-11-08 Thread Matt Arsenault via cfe-commits
https://github.com/arsenm approved this pull request. Could probably golf this down for more sharing with the default path but this is a start https://github.com/llvm/llvm-project/pull/71139 ___ cfe-commits mailing list cfe-commits@lists.llvm.org htt

[clang] clang: Add pragma clang fp reciprocal (PR #68267)

2023-11-08 Thread Matt Arsenault via cfe-commits
https://github.com/arsenm edited https://github.com/llvm/llvm-project/pull/68267 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] Revert "Revert "[AMDGPU] const-fold imm operands of (PR #71669)

2023-11-08 Thread Matt Arsenault via cfe-commits
@@ -0,0 +1,48 @@ +// REQUIRES: amdgpu-registered-target + +// RUN: %clang --offload-arch=gfx906 -S -o - -emit-llvm --cuda-device-only -nogpuinc -nogpulib\ arsenm wrote: This should use %clang_cc1, look at some of the other CodeGen* test run lines for examples

[clang] Revert "Revert "[AMDGPU] const-fold imm operands of (PR #71669)

2023-11-08 Thread Matt Arsenault via cfe-commits
https://github.com/arsenm approved this pull request. https://github.com/llvm/llvm-project/pull/71669 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[openmp] [clang] [llvm] [OpenMP] Rework handling of global ctor/dtors in OpenMP (PR #71739)

2023-11-09 Thread Matt Arsenault via cfe-commits
@@ -2794,6 +2794,14 @@ void ItaniumCXXABI::registerGlobalDtor(CodeGenFunction &CGF, const VarDecl &D, if (D.isNoDestroy(CGM.getContext())) return; + // OpenMP offloading supports C++ constructors and destructors but we do not + // always have 'atexit' available. Inste

[compiler-rt] [llvm] [clang-tools-extra] [clang] [InferAddressSpaces] Fix constant replace to avoid modifying other functions (PR #70611)

2023-11-09 Thread Matt Arsenault via cfe-commits
https://github.com/arsenm approved this pull request. I think it would be better if we could eliminate ConstantExpr addrspacecasts from the IR altogether, which would avoid most of the complexity here. I would also somewhat prefer to push this DFS into a helper function, but can live with it i

[clang] [CodeGen] Implement post-opt linking option for builtin bitocdes (PR #69371)

2023-11-09 Thread Matt Arsenault via cfe-commits
@@ -113,7 +120,7 @@ class EmitAssemblyHelper { const CodeGenOptions &CodeGenOpts; const clang::TargetOptions &TargetOpts; const LangOptions &LangOpts; - Module *TheModule; + llvm::Module *TheModule; arsenm wrote: Why did this suddenly need qualificatio

[clang] [CodeGen] Implement post-opt linking option for builtin bitocdes (PR #69371)

2023-11-09 Thread Matt Arsenault via cfe-commits
@@ -98,6 +100,11 @@ extern cl::opt PrintPipelinePasses; static cl::opt ClSanitizeOnOptimizerEarlyEP( "sanitizer-early-opt-ep", cl::Optional, cl::desc("Insert sanitizers on OptimizerEarlyEP."), cl::init(false)); + +// Re-link builtin bitcodes after optimization +static c

[clang] [llvm] [AMDGPU] Emit a waitcnt instruction after each memory instruction (PR #68932)

2023-11-09 Thread Matt Arsenault via cfe-commits
@@ -1809,6 +1816,23 @@ bool SIInsertWaitcnts::shouldFlushVmCnt(MachineLoop *ML, return HasVMemLoad && UsesVgprLoadedOutside; } +bool SIInsertWaitcnts::insertWaitcntAfterMemOp(MachineFunction &MF) { + bool Modified = false; + + for (auto &MBB : MF) { arsenm

[clang] [llvm] [AMDGPU] Emit a waitcnt instruction after each memory instruction (PR #68932)

2023-11-09 Thread Matt Arsenault via cfe-commits
@@ -1809,6 +1816,23 @@ bool SIInsertWaitcnts::shouldFlushVmCnt(MachineLoop *ML, return HasVMemLoad && UsesVgprLoadedOutside; } +bool SIInsertWaitcnts::insertWaitcntAfterMemOp(MachineFunction &MF) { + bool Modified = false; + + for (auto &MBB : MF) { arsenm

[llvm] [clang] [openmp] [OpenMP] Rework handling of global ctor/dtors in OpenMP (PR #71739)

2023-11-09 Thread Matt Arsenault via cfe-commits
@@ -2794,6 +2794,14 @@ void ItaniumCXXABI::registerGlobalDtor(CodeGenFunction &CGF, const VarDecl &D, if (D.isNoDestroy(CGM.getContext())) return; + // OpenMP offloading supports C++ constructors and destructors but we do not + // always have 'atexit' available. Inste

[clang] clang: Add pragma clang fp reciprocal (PR #68267)

2023-11-09 Thread Matt Arsenault via cfe-commits
@@ -0,0 +1,130 @@ +// RUN: %clang_cc1 -O3 -triple %itanium_abi_triple -emit-llvm -o - %s | FileCheck -check-prefixes=CHECK,DEFAULT %s +// RUN: %clang_cc1 -O3 -triple %itanium_abi_triple -freciprocal-math -emit-llvm -o - %s | FileCheck -check-prefixes=CHECK,FLAG %s + +float base(

[clang] [clang][CGExprScalar] Remove no-op ptr-to-ptr bitcast (NFC) (PR #72072)

2023-11-12 Thread Matt Arsenault via cfe-commits
https://github.com/arsenm edited https://github.com/llvm/llvm-project/pull/72072 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [clang][CGExprScalar] Remove no-op ptr-to-ptr bitcast (NFC) (PR #72072)

2023-11-12 Thread Matt Arsenault via cfe-commits
https://github.com/arsenm approved this pull request. https://github.com/llvm/llvm-project/pull/72072 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [clang][CGExprScalar] Remove no-op ptr-to-ptr bitcast (NFC) (PR #72072)

2023-11-12 Thread Matt Arsenault via cfe-commits
@@ -2227,14 +2227,6 @@ Value *ScalarExprEmitter::VisitCastExpr(CastExpr *CE) { llvm::Value *V = CE->changesVolatileQualification() ? EmitLoadOfLValue(CE) : Visit(const_cast(E)); -if (V) { - // CK_NoOp can model a po

[llvm] [clang] [compiler-rt] [HIP] support 128 bit int division (PR #71978)

2023-11-12 Thread Matt Arsenault via cfe-commits
https://github.com/arsenm commented: 128-bit division should already work, we have an IR integer division expansion for > 64-bit divides. I think moving towards getting the infrastructure to a place where we can link in compiler-rt binaries is a good thing, but I don't think we're in a positio

[compiler-rt] [clang] [llvm] [HIP] support 128 bit int division (PR #71978)

2023-11-12 Thread Matt Arsenault via cfe-commits
https://github.com/arsenm edited https://github.com/llvm/llvm-project/pull/71978 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[compiler-rt] [llvm] [clang] [HIP] support 128 bit int division (PR #71978)

2023-11-12 Thread Matt Arsenault via cfe-commits
@@ -937,27 +938,105 @@ bool CodeGenAction::loadLinkModules(CompilerInstance &CI) { for (const CodeGenOptions::BitcodeFileToLink &F : CI.getCodeGenOpts().LinkBitcodeFiles) { -auto BCBuf = CI.getFileManager().getBufferForFile(F.Filename); -if (!BCBuf) { + +

[clang] [llvm] [compiler-rt] [HIP] support 128 bit int division (PR #71978)

2023-11-12 Thread Matt Arsenault via cfe-commits
@@ -937,27 +938,105 @@ bool CodeGenAction::loadLinkModules(CompilerInstance &CI) { for (const CodeGenOptions::BitcodeFileToLink &F : CI.getCodeGenOpts().LinkBitcodeFiles) { -auto BCBuf = CI.getFileManager().getBufferForFile(F.Filename); -if (!BCBuf) { + +

[clang] [llvm] [compiler-rt] [HIP] support 128 bit int division (PR #71978)

2023-11-12 Thread Matt Arsenault via cfe-commits
@@ -937,27 +938,105 @@ bool CodeGenAction::loadLinkModules(CompilerInstance &CI) { for (const CodeGenOptions::BitcodeFileToLink &F : CI.getCodeGenOpts().LinkBitcodeFiles) { -auto BCBuf = CI.getFileManager().getBufferForFile(F.Filename); -if (!BCBuf) { + +

[clang] [compiler-rt] [llvm] [HIP] support 128 bit int division (PR #71978)

2023-11-12 Thread Matt Arsenault via cfe-commits
@@ -937,27 +938,105 @@ bool CodeGenAction::loadLinkModules(CompilerInstance &CI) { for (const CodeGenOptions::BitcodeFileToLink &F : CI.getCodeGenOpts().LinkBitcodeFiles) { -auto BCBuf = CI.getFileManager().getBufferForFile(F.Filename); -if (!BCBuf) { + +

[clang] [llvm] [compiler-rt] [HIP] support 128 bit int division (PR #71978)

2023-11-12 Thread Matt Arsenault via cfe-commits
@@ -937,27 +938,105 @@ bool CodeGenAction::loadLinkModules(CompilerInstance &CI) { for (const CodeGenOptions::BitcodeFileToLink &F : CI.getCodeGenOpts().LinkBitcodeFiles) { -auto BCBuf = CI.getFileManager().getBufferForFile(F.Filename); -if (!BCBuf) { + +

[clang] [llvm] [compiler-rt] [HIP] support 128 bit int division (PR #71978)

2023-11-12 Thread Matt Arsenault via cfe-commits
@@ -596,6 +596,7 @@ static bool mustPreserveGV(const GlobalValue &GV) { if (const Function *F = dyn_cast(&GV)) return F->isDeclaration() || F->getName().startswith("__asan_") || F->getName().startswith("__sanitizer_") || + F->getName() == "__divti3" |

[clang] [llvm] [compiler-rt] [HIP] support 128 bit int division (PR #71978)

2023-11-12 Thread Matt Arsenault via cfe-commits
@@ -937,27 +938,105 @@ bool CodeGenAction::loadLinkModules(CompilerInstance &CI) { for (const CodeGenOptions::BitcodeFileToLink &F : CI.getCodeGenOpts().LinkBitcodeFiles) { -auto BCBuf = CI.getFileManager().getBufferForFile(F.Filename); -if (!BCBuf) { + +

[clang] [llvm] [compiler-rt] [HIP] support 128 bit int division (PR #71978)

2023-11-12 Thread Matt Arsenault via cfe-commits
@@ -3630,10 +3631,17 @@ SDValue SITargetLowering::LowerCall(CallLoweringInfo &CLI, std::vector Ops; Ops.push_back(Chain); + bool AddTargetGlobalAddr = true; + // Try to find the callee in the current module. + if (isa(Callee)) { +Callee = DAG.getSymbolFunctionGloba

[clang] clang: Add pragma clang fp reciprocal (PR #68267)

2023-11-13 Thread Matt Arsenault via cfe-commits
https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/68267 >From f09d8efdcbb5ffb9cd39d686205a120b6a82a01b Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Thu, 31 Aug 2023 17:33:35 -0400 Subject: [PATCH] clang: Add pragma clang fp reciprocal Just follow along with the

[clang] [llvm] [flang] [NFC][AMDGPU] Move address space enum to LLVM directory (PR #73944)

2023-12-06 Thread Matt Arsenault via cfe-commits
@@ -0,0 +1,31 @@ +//=== AMDGPUAddrSpace.h -*- C++ -*-===// +// +// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions. +// See https://llvm.org/LICENSE.txt for license information. +// SPDX-License-Identifier: Apa

[clang] [clang][AMDGPU] Update amdgpu_waves_per_eu attr docs (PR #74587)

2023-12-06 Thread Matt Arsenault via cfe-commits
@@ -2659,8 +2659,9 @@ An error will be given if: - Specified values violate subtarget specifications; - Specified values are not compatible with values provided through other attributes; - - The AMDGPU target backend is unable to create machine code that can meet the

[clang] [clang][AMDGPU] Update amdgpu_waves_per_eu attr docs (PR #74587)

2023-12-07 Thread Matt Arsenault via cfe-commits
https://github.com/arsenm approved this pull request. https://github.com/llvm/llvm-project/pull/74587 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

  1   2   3   4   5   6   7   8   9   10   >