date:20241121

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_fp8 for gfx950 (PR #117257)

2024-11-21 Thread Shilei Tian via llvm-branch-commits

https://github.com/shiltian approved this pull request. https://github.com/llvm/llvm-project/pull/117257 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_fp8 for gfx950 (PR #117257)

2024-11-21 Thread Shilei Tian via llvm-branch-commits

https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/117257 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] AMDGPU: Handle v_mfma_f64_16x16x4_f64 srcc write VGPR hazard change for gfx950 (PR #117283)

2024-11-21 Thread via llvm-branch-commits

llvmbot wrote: @llvm/pr-subscribers-backend-amdgpu Author: Matt Arsenault (arsenm) Changes Read by sgemm/dgemm in srcc after v_mfma_f64_16x16x4_f64 increases from 9 to 17 wait states. --- Full diff: https://github.com/llvm/llvm-project/pull/117283.diff 2 Files Affected: - (modified) ll

[llvm-branch-commits] [llvm] AMDGPU: Refine gfx950 xdl-write-vgpr hazard cases (PR #117285)

2024-11-21 Thread via llvm-branch-commits

llvmbot wrote: @llvm/pr-subscribers-backend-amdgpu Author: Matt Arsenault (arsenm) Changes The 2-pass XDL write VGPR, read by non-XDL SGEMM/DGEMM case was 1 wait state overly conservative. Previously, for gfx940, the XDL/non-XDL cases happened to have the same number of cycles in all cases

[llvm-branch-commits] [llvm] AMDGPU: Handle vcmpx+permalane gfx950 hazard (PR #117286)

2024-11-21 Thread via llvm-branch-commits

github-actions[bot] wrote: :warning: C/C++ code formatter, clang-format found issues in your code. :warning: You can test this locally with the following command: ``bash git-clang-format --diff 52f540df160ad84aef090acb35c9372c270d758b 0cbee40e03bff1514abbf1e879522a4808175c1a --e

[llvm-branch-commits] [llvm] AMDGPU: Handle gfx950 change in mfma_f64_16x16x4 + valu hazard (PR #117262)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117262 >From 06412577e65e05abf3edc1a884edc8640b924933 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Thu, 7 Mar 2024 15:01:08 +0530 Subject: [PATCH] AMDGPU: Handle gfx950 change in mfma_f64_16x16x4 + valu hazard

[llvm-branch-commits] [llvm] AMDGPU: Handle vcmpx+permalane gfx950 hazard (PR #117286)

2024-11-21 Thread via llvm-branch-commits

llvmbot wrote: @llvm/pr-subscribers-backend-amdgpu Author: Matt Arsenault (arsenm) Changes Confusingly, this is a different hazard to the one on gfx10 with a subtarget feature. --- Full diff: https://github.com/llvm/llvm-project/pull/117286.diff 3 Files Affected: - (modified) llvm/lib/

[llvm-branch-commits] [llvm] AMDGPU: Handle gfx950 XDL-write-overlapped-smfma-src-c wait state change (PR #117263)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117263 >From 087117bc3dc327237d52746813e932d4c8f0b8bc Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Wed, 6 Mar 2024 19:51:00 +0530 Subject: [PATCH] AMDGPU: Handle gfx950 XDL-write-overlapped-smfma-src-c wait stat

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_permlane16_swap_b32 and v_permlane32_swap_b32 for gfx950 (PR #117260)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117260 >From 426d5baaf7d373a6d35ead2af4515e108a6eb8b8 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Mon, 22 Jan 2024 12:40:54 +0700 Subject: [PATCH] AMDGPU: Add v_permlane16_swap_b32 and v_permlane32_swap_b32 for

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_fp8_fp8 for gfx950 (PR #117259)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117259 >From d5b3bb6210d19c81a935790c5267c3d97125a00d Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Sat, 3 Feb 2024 21:43:00 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_32x32x64_fp8_fp8 for gfx950 --- clang/

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_fp8 for gfx950 (PR #117257)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117257 >From 698095bb278b20ff853018b997a563a2387eeca6 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Sat, 3 Feb 2024 21:09:21 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_fp8 for gfx950 --- clang/

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x32x64_fp8_bf8 for gfx950 (PR #117258)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117258 >From 32ccf3950258693e8ca7be1c7ecc6670debc2bf7 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Sat, 3 Feb 2024 21:25:33 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_32x32x32x64_fp8_bf8 for gfx950 --- cla

[llvm-branch-commits] [clang] [llvm] [RISCV] Support __builtin_cpu_is (PR #116231)

2024-11-21 Thread Pengcheng Wang via llvm-branch-commits

https://github.com/wangpc-pp updated https://github.com/llvm/llvm-project/pull/116231 >From 9686a2c5c5276289e72d9098f497a9f246a1c457 Mon Sep 17 00:00:00 2001 From: Wang Pengcheng Date: Thu, 14 Nov 2024 22:06:45 +0800 Subject: [PATCH 1/4] Remove stale CHECKs Created using spr 1.3.6-beta.1 ---

[llvm-branch-commits] [clang] [llvm] [RISCV] Support __builtin_cpu_is (PR #116231)

2024-11-21 Thread Pengcheng Wang via llvm-branch-commits

https://github.com/wangpc-pp updated https://github.com/llvm/llvm-project/pull/116231 >From 9686a2c5c5276289e72d9098f497a9f246a1c457 Mon Sep 17 00:00:00 2001 From: Wang Pengcheng Date: Thu, 14 Nov 2024 22:06:45 +0800 Subject: [PATCH 1/4] Remove stale CHECKs Created using spr 1.3.6-beta.1 ---

[llvm-branch-commits] [clang] [llvm] [RISCV] Support __builtin_cpu_is (PR #116231)

2024-11-21 Thread Pengcheng Wang via llvm-branch-commits

@@ -58,6 +58,19 @@ bool hasFastVectorUnalignedAccess(StringRef CPU) { return Info && Info->FastVectorUnalignedAccess; } +bool hasValidCPUModel(StringRef CPU) { + const CPUModel CPUModel = getCPUModel(CPU); + return CPUModel.MVendorID != 0 && CPUModel.MArchID != 0 && --

[llvm-branch-commits] [clang] [llvm] [RISCV] Support __builtin_cpu_is (PR #116231)

2024-11-21 Thread Pengcheng Wang via llvm-branch-commits

https://github.com/wangpc-pp updated https://github.com/llvm/llvm-project/pull/116231 >From 9686a2c5c5276289e72d9098f497a9f246a1c457 Mon Sep 17 00:00:00 2001 From: Wang Pengcheng Date: Thu, 14 Nov 2024 22:06:45 +0800 Subject: [PATCH 1/4] Remove stale CHECKs Created using spr 1.3.6-beta.1 ---

[llvm-branch-commits] [clang] [llvm] [RISCV] Support __builtin_cpu_is (PR #116231)

2024-11-21 Thread Pengcheng Wang via llvm-branch-commits

https://github.com/wangpc-pp updated https://github.com/llvm/llvm-project/pull/116231 >From 9686a2c5c5276289e72d9098f497a9f246a1c457 Mon Sep 17 00:00:00 2001 From: Wang Pengcheng Date: Thu, 14 Nov 2024 22:06:45 +0800 Subject: [PATCH 1/4] Remove stale CHECKs Created using spr 1.3.6-beta.1 ---

[llvm-branch-commits] [llvm] 14b474b - Revert "[LV][VPlan] Remove any-of reduction from precomputeCost. NFC (#117109)"

2024-11-21 Thread via llvm-branch-commits

Author: Elvis Wang Date: 2024-11-22T11:32:12+08:00 New Revision: 14b474be36144527a55b5d49954379a3484c5f84 URL: https://github.com/llvm/llvm-project/commit/14b474be36144527a55b5d49954379a3484c5f84 DIFF: https://github.com/llvm/llvm-project/commit/14b474be36144527a55b5d49954379a3484c5f84.diff LO

[llvm-branch-commits] [llvm] [LoongArch] Fix GOT usage for `non-dso_local` function calls in large code model (PR #117134)

2024-11-21 Thread Lu Weining via llvm-branch-commits

https://github.com/SixWeining approved this pull request. https://github.com/llvm/llvm-project/pull/117134 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_bf8_fp8 for gfx950 (PR #117233)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

arsenm wrote: ### Merge activity * **Nov 21, 7:53 PM EST**: A user started a stack merge that includes this pull request via [Graphite](https://app.graphite.dev/github/pr/llvm/llvm-project/117233). https://github.com/llvm/llvm-project/pull/117233 _

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x32x64_fp8_bf8 for gfx950 (PR #117258)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117258 >From 24576df683abfa29c9d7f4406a318b6b67701732 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Sat, 3 Feb 2024 21:25:33 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_32x32x32x64_fp8_bf8 for gfx950 --- cla

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_permlane16_swap_b32 and v_permlane32_swap_b32 for gfx950 (PR #117260)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117260 >From 549b571ea25a06301f719778786a288d85604464 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Mon, 22 Jan 2024 12:40:54 +0700 Subject: [PATCH] AMDGPU: Add v_permlane16_swap_b32 and v_permlane32_swap_b32 for

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_fp8 for gfx950 (PR #117257)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117257 >From 73f8fed93b6fd985cf79d384fee64fc506ceb062 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Sat, 3 Feb 2024 21:09:21 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_fp8 for gfx950 --- clang/

[llvm-branch-commits] [llvm] AMDGPU: Handle gfx950 XDL-write-overlapped-smfma-src-c wait state change (PR #117263)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117263 >From 736d914241979efb46b506fb45cee79e73bbd20e Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Wed, 6 Mar 2024 19:51:00 +0530 Subject: [PATCH] AMDGPU: Handle gfx950 XDL-write-overlapped-smfma-src-c wait stat

[llvm-branch-commits] [llvm] AMDGPU: Handle gfx950 change in mfma_f64_16x16x4 + valu hazard (PR #117262)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117262 >From fc9424bd9d0d54a931f4059ff9a6f657f1c5a2dd Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Thu, 7 Mar 2024 15:01:08 +0530 Subject: [PATCH] AMDGPU: Handle gfx950 change in mfma_f64_16x16x4 + valu hazard

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_fp8_fp8 for gfx950 (PR #117259)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117259 >From d36a1301eb84377617c35c125e136230327eb3e9 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Sat, 3 Feb 2024 21:43:00 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_32x32x64_fp8_fp8 for gfx950 --- clang/

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_fp8_fp8 for gfx950 (PR #117235)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

arsenm wrote: ### Merge activity * **Nov 21, 7:53 PM EST**: A user started a stack merge that includes this pull request via [Graphite](https://app.graphite.dev/github/pr/llvm/llvm-project/117235). https://github.com/llvm/llvm-project/pull/117235 _

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_fp8_bf8 for gfx950 (PR #117234)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

arsenm wrote: ### Merge activity * **Nov 21, 7:53 PM EST**: A user started a stack merge that includes this pull request via [Graphite](https://app.graphite.dev/github/pr/llvm/llvm-project/117234). https://github.com/llvm/llvm-project/pull/117234 _

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_bf8_fp8 for gfx950 (PR #117233)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117233 >From 26c1f2d322e0ff8cbeb4d79052b26e5858a77332 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Fri, 2 Feb 2024 23:21:12 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_16x16x128_bf8_fp8 for gfx950 --- clang

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_fp8 for gfx950 (PR #117257)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm created https://github.com/llvm/llvm-project/pull/117257 None >From d4deb51fecabd75d85ab8c7a00645edb696e8c1f Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Sat, 3 Feb 2024 21:09:21 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_fp8 for gfx950 ---

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_permlane16_swap_b32 and v_permlane32_swap_b32 for gfx950 (PR #117260)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm created https://github.com/llvm/llvm-project/pull/117260 This was a bit annoying because these introduce a new special case encoding usage. op_sel is repurposed as a subset of dpp controls, and is eligible for VOP3->VOP1 shrinking. For some reason fi also uses an enum v

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_i32_16x16x128_i8 for gfx950 (PR #117213)

2024-11-21 Thread Shilei Tian via llvm-branch-commits

https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/117213 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_fp8 for gfx950 (PR #117257)

2024-11-21 Thread via llvm-branch-commits

llvmbot wrote: @llvm/pr-subscribers-clang Author: Matt Arsenault (arsenm) Changes --- Patch is 33.75 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/117257.diff 12 Files Affected: - (modified) clang/include/clang/Basic/BuiltinsAMDGPU.def (+

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_fp8 for gfx950 (PR #117257)

2024-11-21 Thread via llvm-branch-commits

llvmbot wrote: @llvm/pr-subscribers-mc Author: Matt Arsenault (arsenm) Changes --- Patch is 33.75 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/117257.diff 12 Files Affected: - (modified) clang/include/clang/Basic/BuiltinsAMDGPU.def (+1)

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_fp8 for gfx950 (PR #117257)

2024-11-21 Thread via llvm-branch-commits

llvmbot wrote: @llvm/pr-subscribers-llvm-ir Author: Matt Arsenault (arsenm) Changes --- Patch is 33.75 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/117257.diff 12 Files Affected: - (modified) clang/include/clang/Basic/BuiltinsAMDGPU.def

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_fp8_fp8 for gfx950 (PR #117259)

2024-11-21 Thread via llvm-branch-commits

llvmbot wrote: @llvm/pr-subscribers-backend-amdgpu Author: Matt Arsenault (arsenm) Changes --- Patch is 33.74 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/117259.diff 12 Files Affected: - (modified) clang/include/clang/Basic/BuiltinsAMDG

[llvm-branch-commits] [llvm] AMDGPU: Handle gfx950 change in mfma_f64_16x16x4 + valu hazard (PR #117262)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

arsenm wrote: > [!WARNING] > This pull request is not mergeable via GitHub because a downstack PR is > open. Once all requirements are satisfied, merge this PR as a stack href="https://app.graphite.dev/github/pr/llvm/llvm-project/117262?utm_source=stack-comment-downstack-mergeability-warning";

[llvm-branch-commits] [llvm] AMDGPU: Handle gfx950 change in mfma_f64_16x16x4 + valu hazard (PR #117262)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm ready_for_review https://github.com/llvm/llvm-project/pull/117262 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] AMDGPU: Define new sched model for gfx950 (PR #117261)

2024-11-21 Thread via llvm-branch-commits

llvmbot wrote: @llvm/pr-subscribers-backend-amdgpu Author: Matt Arsenault (arsenm) Changes A few instructions changed rate. --- Patch is 306.15 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/117261.diff 7 Files Affected: - (modified) llvm/l

[llvm-branch-commits] [llvm] AMDGPU: Define new sched model for gfx950 (PR #117261)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm ready_for_review https://github.com/llvm/llvm-project/pull/117261 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_permlane16_swap_b32 and v_permlane32_swap_b32 for gfx950 (PR #117260)

2024-11-21 Thread via llvm-branch-commits

llvmbot wrote: @llvm/pr-subscribers-backend-amdgpu Author: Matt Arsenault (arsenm) Changes This was a bit annoying because these introduce a new special case encoding usage. op_sel is repurposed as a subset of dpp controls, and is eligible for VOP3->VOP1 shrinking. For some reason fi also

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_permlane16_swap_b32 and v_permlane32_swap_b32 for gfx950 (PR #117260)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm ready_for_review https://github.com/llvm/llvm-project/pull/117260 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] AMDGPU: Handle gfx950 change in mfma_f64_16x16x4 + valu hazard (PR #117262)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm created https://github.com/llvm/llvm-project/pull/117262 Increase from 11 wait states to 19 >From faded39a3a6553935fa0e4c62e836f804552c320 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Thu, 7 Mar 2024 15:01:08 +0530 Subject: [PATCH] AMDGPU: Handle gfx950 change

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_fp8_fp8 for gfx950 (PR #117259)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm ready_for_review https://github.com/llvm/llvm-project/pull/117259 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x32x64_fp8_bf8 for gfx950 (PR #117258)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm ready_for_review https://github.com/llvm/llvm-project/pull/117258 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_fp8 for gfx950 (PR #117257)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm ready_for_review https://github.com/llvm/llvm-project/pull/117257 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_bf8 for gfx950 (PR #117256)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm ready_for_review https://github.com/llvm/llvm-project/pull/117256 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_bf8 for gfx950 (PR #117256)

2024-11-21 Thread via llvm-branch-commits

llvmbot wrote: @llvm/pr-subscribers-backend-amdgpu Author: Matt Arsenault (arsenm) Changes --- Patch is 35.26 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/117256.diff 13 Files Affected: - (modified) clang/include/clang/Basic/BuiltinsAMDG

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_bf8_bf8 for gfx950 (PR #117232)

2024-11-21 Thread Shilei Tian via llvm-branch-commits

https://github.com/shiltian approved this pull request. https://github.com/llvm/llvm-project/pull/117232 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_fp8_fp8 for gfx950 (PR #117259)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

arsenm wrote: > [!WARNING] > This pull request is not mergeable via GitHub because a downstack PR is > open. Once all requirements are satisfied, merge this PR as a stack href="https://app.graphite.dev/github/pr/llvm/llvm-project/117259?utm_source=stack-comment-downstack-mergeability-warning";

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x32_bf16 for gfx950 (PR #117212)

2024-11-21 Thread Shilei Tian via llvm-branch-commits

https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/117212 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] AMDGPU: Define new sched model for gfx950 (PR #117261)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

arsenm wrote: > [!WARNING] > This pull request is not mergeable via GitHub because a downstack PR is > open. Once all requirements are satisfied, merge this PR as a stack href="https://app.graphite.dev/github/pr/llvm/llvm-project/117261?utm_source=stack-comment-downstack-mergeability-warning";

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_permlane16_swap_b32 and v_permlane32_swap_b32 for gfx950 (PR #117260)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

arsenm wrote: > [!WARNING] > This pull request is not mergeable via GitHub because a downstack PR is > open. Once all requirements are satisfied, merge this PR as a stack href="https://app.graphite.dev/github/pr/llvm/llvm-project/117260?utm_source=stack-comment-downstack-mergeability-warning";

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x32x64_fp8_bf8 for gfx950 (PR #117258)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

arsenm wrote: > [!WARNING] > This pull request is not mergeable via GitHub because a downstack PR is > open. Once all requirements are satisfied, merge this PR as a stack href="https://app.graphite.dev/github/pr/llvm/llvm-project/117258?utm_source=stack-comment-downstack-mergeability-warning";

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_bf8 for gfx950 (PR #117256)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

arsenm wrote: > [!WARNING] > This pull request is not mergeable via GitHub because a downstack PR is > open. Once all requirements are satisfied, merge this PR as a stack href="https://app.graphite.dev/github/pr/llvm/llvm-project/117256?utm_source=stack-comment-downstack-mergeability-warning";

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x32x64_fp8_bf8 for gfx950 (PR #117258)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm created https://github.com/llvm/llvm-project/pull/117258 None >From 88d7337bfb2d636f3e0823faf2897f0a3c9a8a9e Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Sat, 3 Feb 2024 21:25:33 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_32x32x32x64_fp8_bf8 for gfx950 --

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_fp8 for gfx950 (PR #117257)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

arsenm wrote: > [!WARNING] > This pull request is not mergeable via GitHub because a downstack PR is > open. Once all requirements are satisfied, merge this PR as a stack href="https://app.graphite.dev/github/pr/llvm/llvm-project/117257?utm_source=stack-comment-downstack-mergeability-warning";

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_fp8_fp8 for gfx950 (PR #117259)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm created https://github.com/llvm/llvm-project/pull/117259 None >From 253fcf59235b1bc4edd0b8ce811f900606aa2a67 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Sat, 3 Feb 2024 21:43:00 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_32x32x64_fp8_fp8 for gfx950 ---

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_bf8 for gfx950 (PR #117256)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm created https://github.com/llvm/llvm-project/pull/117256 None >From 71df33ede8a81ec9ba59e95a4de81911c05f36dc Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Sat, 3 Feb 2024 20:48:16 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_bf8 for gfx950 ---

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_fp8_bf8 for gfx950 (PR #117234)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117234 >From 21f33495f0a3436cd3038446cf19c16a1109447b Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Sat, 3 Feb 2024 20:15:31 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_16x16x128_fp8_bf8 for gfx950 --- clang

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_fp8_fp8 for gfx950 (PR #117235)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117235 >From b006b680e7b4ccae5cc3a2ec918b3b237c9836f1 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Sat, 3 Feb 2024 20:43:58 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_16x16x128_fp8_fp8 for gfx950 --- clang

[llvm-branch-commits] [flang] [flang][OpenMP] Avoid early returns, NFC (PR #117231)

2024-11-21 Thread Kiran Chandramohan via llvm-branch-commits

https://github.com/kiranchandramohan approved this pull request. LG. Thanks. https://github.com/llvm/llvm-project/pull/117231 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-b

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_i32_32x32x64_i8 for gfx950 (PR #117214)

2024-11-21 Thread Shilei Tian via llvm-branch-commits

https://github.com/shiltian approved this pull request. https://github.com/llvm/llvm-project/pull/117214 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_fp8_bf8 for gfx950 (PR #117234)

2024-11-21 Thread Shilei Tian via llvm-branch-commits

https://github.com/shiltian approved this pull request. https://github.com/llvm/llvm-project/pull/117234 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x32_f16 for gfx950 (PR #117205)

2024-11-21 Thread Shilei Tian via llvm-branch-commits

https://github.com/shiltian approved this pull request. https://github.com/llvm/llvm-project/pull/117205 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_fp8_bf8 for gfx950 (PR #117234)

2024-11-21 Thread Shilei Tian via llvm-branch-commits

https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/117234 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_fp8_fp8 for gfx950 (PR #117235)

2024-11-21 Thread Shilei Tian via llvm-branch-commits

https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/117235 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_fp8_fp8 for gfx950 (PR #117235)

2024-11-21 Thread Shilei Tian via llvm-branch-commits

https://github.com/shiltian approved this pull request. https://github.com/llvm/llvm-project/pull/117235 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_bf8_fp8 for gfx950 (PR #117233)

2024-11-21 Thread Shilei Tian via llvm-branch-commits

https://github.com/shiltian approved this pull request. https://github.com/llvm/llvm-project/pull/117233 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_bf8_fp8 for gfx950 (PR #117233)

2024-11-21 Thread Shilei Tian via llvm-branch-commits

https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/117233 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_bf8_bf8 for gfx950 (PR #117232)

2024-11-21 Thread Shilei Tian via llvm-branch-commits

https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/117232 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_i32_32x32x64_i8 for gfx950 (PR #117214)

2024-11-21 Thread Shilei Tian via llvm-branch-commits

https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/117214 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_i32_16x16x128_i8 for gfx950 (PR #117213)

2024-11-21 Thread Shilei Tian via llvm-branch-commits

https://github.com/shiltian approved this pull request. https://github.com/llvm/llvm-project/pull/117213 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x32_bf16 for gfx950 (PR #117212)

2024-11-21 Thread Shilei Tian via llvm-branch-commits

https://github.com/shiltian approved this pull request. https://github.com/llvm/llvm-project/pull/117212 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x64_bf16 for gfx950 (PR #117211)

2024-11-21 Thread Shilei Tian via llvm-branch-commits

https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/117211 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x64_bf16 for gfx950 (PR #117211)

2024-11-21 Thread Shilei Tian via llvm-branch-commits

https://github.com/shiltian approved this pull request. https://github.com/llvm/llvm-project/pull/117211 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x32_f16 for gfx950 (PR #117205)

2024-11-21 Thread Shilei Tian via llvm-branch-commits

https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/117205 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [HLSL] Add RWBuffer::Load(Index) (PR #117018)

2024-11-21 Thread Helena Kotas via llvm-branch-commits

https://github.com/hekota edited https://github.com/llvm/llvm-project/pull/117018 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [HLSL] Add RWBuffer::Load(Index) (PR #117018)

2024-11-21 Thread Helena Kotas via llvm-branch-commits

@@ -189,12 +189,28 @@ struct BuiltinTypeDeclBuilder { BuiltinTypeDeclBuilder &addArraySubscriptOperators(Sema &S) { if (Record->isCompleteDefinition()) return *this; -addArraySubscriptOperator(S, true); -addArraySubscriptOperator(S, false); +ASTContext &

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x64_bf16 for gfx950 (PR #117211)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm ready_for_review https://github.com/llvm/llvm-project/pull/117211 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_i32_32x32x64_i8 for gfx950 (PR #117214)

2024-11-21 Thread Sirish Pande via llvm-branch-commits

https://github.com/srpande approved this pull request. lgrm https://github.com/llvm/llvm-project/pull/117214 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_i32_32x32x64_i8 for gfx950 (PR #117214)

2024-11-21 Thread via llvm-branch-commits

llvmbot wrote: @llvm/pr-subscribers-llvm-ir Author: Matt Arsenault (arsenm) Changes --- Patch is 34.65 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/117214.diff 13 Files Affected: - (modified) clang/include/clang/Basic/BuiltinsAMDGPU.def

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_i32_32x32x64_i8 for gfx950 (PR #117214)

2024-11-21 Thread via llvm-branch-commits

llvmbot wrote: @llvm/pr-subscribers-clang Author: Matt Arsenault (arsenm) Changes --- Patch is 34.65 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/117214.diff 13 Files Affected: - (modified) clang/include/clang/Basic/BuiltinsAMDGPU.def (+

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_i32_32x32x64_i8 for gfx950 (PR #117214)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm ready_for_review https://github.com/llvm/llvm-project/pull/117214 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_i32_16x16x128_i8 for gfx950 (PR #117213)

2024-11-21 Thread via llvm-branch-commits

llvmbot wrote: @llvm/pr-subscribers-llvm-ir Author: Matt Arsenault (arsenm) Changes --- Patch is 24.76 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/117213.diff 12 Files Affected: - (modified) clang/include/clang/Basic/BuiltinsAMDGPU.def

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x32_bf16 for gfx950 (PR #117212)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm ready_for_review https://github.com/llvm/llvm-project/pull/117212 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_i32_16x16x128_i8 for gfx950 (PR #117213)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm ready_for_review https://github.com/llvm/llvm-project/pull/117213 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x32_bf16 for gfx950 (PR #117212)

2024-11-21 Thread via llvm-branch-commits

llvmbot wrote: @llvm/pr-subscribers-backend-amdgpu Author: Matt Arsenault (arsenm) Changes --- Patch is 40.89 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/117212.diff 12 Files Affected: - (modified) clang/include/clang/Basic/BuiltinsAMDG

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x32_bf16 for gfx950 (PR #117212)

2024-11-21 Thread via llvm-branch-commits

llvmbot wrote: @llvm/pr-subscribers-llvm-ir Author: Matt Arsenault (arsenm) Changes --- Patch is 40.89 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/117212.diff 12 Files Affected: - (modified) clang/include/clang/Basic/BuiltinsAMDGPU.def

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x32_bf16 for gfx950 (PR #117212)

2024-11-21 Thread via llvm-branch-commits

llvmbot wrote: @llvm/pr-subscribers-llvm-analysis @llvm/pr-subscribers-clang Author: Matt Arsenault (arsenm) Changes --- Patch is 40.89 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/117212.diff 12 Files Affected: - (modified) clang/includ

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x64_bf16 for gfx950 (PR #117211)

2024-11-21 Thread via llvm-branch-commits

llvmbot wrote: @llvm/pr-subscribers-llvm-analysis @llvm/pr-subscribers-backend-amdgpu Author: Matt Arsenault (arsenm) Changes --- Patch is 33.99 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/117211.diff 13 Files Affected: - (modified) cla

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_i32_32x32x64_i8 for gfx950 (PR #117214)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

arsenm wrote: > [!WARNING] > This pull request is not mergeable via GitHub because a downstack PR is > open. Once all requirements are satisfied, merge this PR as a stack href="https://app.graphite.dev/github/pr/llvm/llvm-project/117214?utm_source=stack-comment-downstack-mergeability-warning";

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_i32_16x16x128_i8 for gfx950 (PR #117213)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

arsenm wrote: > [!WARNING] > This pull request is not mergeable via GitHub because a downstack PR is > open. Once all requirements are satisfied, merge this PR as a stack href="https://app.graphite.dev/github/pr/llvm/llvm-project/117213?utm_source=stack-comment-downstack-mergeability-warning";

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x64_bf16 for gfx950 (PR #117211)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

arsenm wrote: > [!WARNING] > This pull request is not mergeable via GitHub because a downstack PR is > open. Once all requirements are satisfied, merge this PR as a stack href="https://app.graphite.dev/github/pr/llvm/llvm-project/117211?utm_source=stack-comment-downstack-mergeability-warning";

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x32_bf16 for gfx950 (PR #117212)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

arsenm wrote: > [!WARNING] > This pull request is not mergeable via GitHub because a downstack PR is > open. Once all requirements are satisfied, merge this PR as a stack href="https://app.graphite.dev/github/pr/llvm/llvm-project/117212?utm_source=stack-comment-downstack-mergeability-warning";

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x32_bf16 for gfx950 (PR #117212)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm created https://github.com/llvm/llvm-project/pull/117212 None >From 951b5eff79c8472e6cafb0c25c06e5950aa53991 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Fri, 2 Feb 2024 15:01:35 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_32x32x32_bf16 for gfx950 --- cla

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_i32_32x32x64_i8 for gfx950 (PR #117214)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm created https://github.com/llvm/llvm-project/pull/117214 None >From f10f99c90d5a1919f60b58b9aa4ed48acdad99ff Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Fri, 2 Feb 2024 20:15:33 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_i32_32x32x64_i8 for gfx950 --- clang

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_i32_16x16x128_i8 for gfx950 (PR #117213)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm created https://github.com/llvm/llvm-project/pull/117213 None >From 8e48083967a60dc4a4a7c804a32be9b87ae7746d Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Fri, 2 Feb 2024 15:52:26 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_i32_16x16x128_i8 for gfx950 --- clan

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x64_bf16 for gfx950 (PR #117211)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm created https://github.com/llvm/llvm-project/pull/117211 None >From 34b7142bb2d1f2bd47a774d23b2a7cfc9a571b51 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Fri, 2 Feb 2024 14:43:25 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_16x16x64_bf16 for gfx950 --- cla

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x64_f16 for gfx950 (PR #117202)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits

https://github.com/arsenm created https://github.com/llvm/llvm-project/pull/117202 None >From 8f21ee1c353df309defdc980d9a84ee89e70ddf8 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Thu, 1 Feb 2024 15:01:47 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_16x16x64_f16 for gfx950 --- clan

1 2 >

1 - 100 of 162 matches

Mail list logo