https://bugs.llvm.org/show_bug.cgi?id=45926

            Bug ID: 45926
           Summary: [AMDGPU][MC][GFX8+] Instructions v_sub_u16_e64,
                    v_subrev_u16_e64 and v_add_u16_e64 do not support
                    clamp modifier
           Product: libraries
           Version: trunk
          Hardware: All
                OS: All
            Status: NEW
          Severity: enhancement
          Priority: P
         Component: Backend: AMDGPU
          Assignee: unassignedb...@nondot.org
          Reporter: dpreobrazhen...@luxoft.com
                CC: llvm-bugs@lists.llvm.org

According to sp3 documentation, both _e64 and _sdwa variants of these opcodes
should support clamp. However llvm assembler supports clamp for _sdwa variants
only.

These opcodes were renamed to v_*_nc_u16 in gfx10 - they should also be
corrected to support clamp.

Examples of failed tests:

    v_sub_u16_e64 v5, v1, v2 clamp    // gfx8, gfx9
    v_sub_nc_u16 v5, v1, v2 clamp     // gfx10

An example of a passed test:

    v_sub_u16_sdwa v5, v1, v2 clamp   // gfx8, gfx9

-- 
You are receiving this mail because:
You are on the CC list for the bug.
_______________________________________________
llvm-bugs mailing list
llvm-bugs@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-bugs

Reply via email to