LLVM/project 85e290bllvm/test/CodeGen/AMDGPU bf16.ll, llvm/test/CodeGen/NVPTX cmpxchg-sm90.ll cmpxchg-sm70.ll

Merge branch 'main' into users/meinersbur/flang_runtime_remove-FLANG_INCLUDE_RUNTIME
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,350-4,196llvm/test/CodeGen/AMDGPU/bf16.ll
+6,173-0llvm/test/Instrumentation/MemorySanitizer/AArch64/arm64-vmul.ll
+0-6,173llvm/test/Instrumentation/MemorySanitizer/AArch64/neon_vmul.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm90.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm70.ll
+90,478-10,3696,999 files not shown
+425,286-177,3787,005 files

LLVM/project 7dc7264llvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2f16.v8f16.ll, llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites long-spec-const-composite.ll

Merge branch 'main' into users/ylzsx/r-tls-noie
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+29,745-0llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+27,655-0llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+27,655-0llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+27,139-0llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+25,933-0llvm/test/CodeGen/AMDGPU/shufflevector.v2f32.v8f32.ll
+203,722-016,100 files not shown
+2,040,184-604,85516,106 files

LLVM/project 8cb72bdllvm/test/CodeGen/NVPTX cmpxchg-sm90.ll cmpxchg-sm60.ll, llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites long-spec-const-composite.ll

reb

Created using spr 1.3.4
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+0-6,173llvm/test/Instrumentation/MemorySanitizer/AArch64/neon_vmul.ll
+6,173-0llvm/test/Instrumentation/MemorySanitizer/AArch64/arm64-vmul.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm90.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm60.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm70.ll
+88,808-6,1733,815 files not shown
+253,453-75,5073,821 files

LLVM/project 8c41ae6lldb/tools/lldb-dap lldb-dap.cpp, llvm/test/CodeGen/AMDGPU bf16.ll

Merge branch 'main' into users/meinersbur/flang_runtime_premerge
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,194-3,880llvm/test/CodeGen/AMDGPU/bf16.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm90.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm70.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm60.ll
+309-5,049lldb/tools/lldb-dap/lldb-dap.cpp
+90,138-8,9295,013 files not shown
+307,445-100,3025,019 files

LLVM/project d7bc6d6clang/lib/Headers avx10_2convertintrin.h, llvm/test/CodeGen/NVPTX cmpxchg-sm70.ll cmpxchg-sm60.ll

Rebase, address comments

Created using spr 1.3.5
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm70.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm60.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm90.ll
+4,730-0llvm/test/tools/llvm-mca/RISCV/SiFiveP600/vlseg-vsseg.s
+2,854-51clang/lib/Headers/avx10_2convertintrin.h
+90,219-512,963 files not shown
+198,153-49,5632,969 files

LLVM/project a97bbe6lldb/tools/lldb-dap lldb-dap.cpp, llvm/test/CodeGen/AMDGPU bf16.ll

rebase

Created using spr 1.3.5-bogner
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,194-3,880llvm/test/CodeGen/AMDGPU/bf16.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm70.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm90.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm60.ll
+309-5,049lldb/tools/lldb-dap/lldb-dap.cpp
+90,138-8,9294,327 files not shown
+283,321-85,4414,333 files

LLVM/project 519ae24llvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2bf16.v8bf16.ll, llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites long-spec-const-composite.ll

Merge branch 'users/chapuni/mcdc/nest/nest-base' into users/chapuni/mcdc/nest/trunk
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,782-7,782llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+7,255-7,255llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4i64.v4i64.ll
+102,453-36,85810,567 files not shown
+953,290-609,13410,573 files

LLVM/project 03c851allvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2bf16.v8bf16.ll, llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites long-spec-const-composite.ll

Merge branch 'users/chapuni/mcdc/nest/bitmapaddr' into users/chapuni/mcdc/nest/nest-base

Conflicts:
        clang/test/CoverageMapping/mcdc-single-cond.cpp
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,782-7,782llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+7,255-7,255llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4p0.v4p0.ll
+102,453-36,85810,567 files not shown
+953,290-609,13410,573 files

LLVM/project d6cb61allvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2f16.v8f16.ll, llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites long-spec-const-composite.ll

Merge branch 'users/chapuni/mcdc/nest/tests' into users/chapuni/mcdc/nest/bitmapaddr
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,782-7,782llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+7,255-7,255llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4p0.v4p0.ll
+102,453-36,85810,567 files not shown
+953,323-609,04210,573 files

LLVM/project 457cf4bllvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2bf16.v8bf16.ll, llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites long-spec-const-composite.ll

Merge branch 'users/chapuni/mcdc/nest/tests' into users/chapuni/mcdc/nest/mcdcstate
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,782-7,782llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+7,255-7,255llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4i64.v4i64.ll
+102,453-36,85810,567 files not shown
+953,323-609,04210,573 files

LLVM/project aac9c6dllvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2bf16.v8bf16.ll, llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites long-spec-const-composite.ll

Merge branch 'users/chapuni/mcdc/nest/tests' into users/chapuni/mcdc/nest/lnot
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,782-7,782llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+7,255-7,255llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4i64.v4i64.ll
+102,453-36,85810,566 files not shown
+953,289-609,12910,572 files

LLVM/project e57bdccllvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2f16.v8f16.ll, llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites long-spec-const-composite.ll

Merge branch 'users/chapuni/mcdc/nest/tests' into users/chapuni/mcdc/nest/expect
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,782-7,782llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+7,255-7,255llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4i64.v4i64.ll
+102,453-36,85810,566 files not shown
+953,289-609,12910,572 files

LLVM/project 9a175d0llvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2f16.v8f16.ll, llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites long-spec-const-composite.ll

Merge branch 'main' into users/chapuni/mcdc/nest/tests
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,782-7,782llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+7,255-7,255llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4p0.v4p0.ll
+102,453-36,85810,565 files not shown
+953,289-609,03210,571 files

LLVM/project 967dc03llvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2bf16.v8bf16.ll, llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites long-spec-const-composite.ll

run 'git merge main'
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,782-7,782llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+7,255-7,255llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4i64.v4i64.ll
+102,453-36,8588,560 files not shown
+809,659-558,2578,566 files

LLVM/project 3bb5867lldb/tools/lldb-dap lldb-dap.cpp, llvm/test/CodeGen/AMDGPU llvm.log10.ll llvm.log.ll

Fix formatting

Created using spr 1.3.5
DeltaFile
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm90.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm60.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm70.ll
+26-2,694lldb/tools/lldb-dap/lldb-dap.cpp
+764-833llvm/test/CodeGen/AMDGPU/llvm.log10.ll
+764-833llvm/test/CodeGen/AMDGPU/llvm.log.ll
+18,594-4,3601,391 files not shown
+65,876-28,1571,397 files

LLVM/project 4d1c3abclang/lib/Sema SemaHLSL.cpp, clang/test/AST/HLSL default_cbuffer.hlsl

rebase

Created using spr 1.3.4
DeltaFile
+121-0clang/test/CodeGenHLSL/cbuffer_align.hlsl
+64-8clang/lib/Sema/SemaHLSL.cpp
+8-58llvm/test/CodeGen/RISCV/rvv/vreductions-fp-vp.ll
+32-32clang/test/CodeGenHLSL/basic_types.hlsl
+50-0llvm/test/CodeGen/RISCV/rvv/fixed-vectors-reduction-fp-vp.ll
+50-0clang/test/AST/HLSL/default_cbuffer.hlsl
+325-9837 files not shown
+792-23443 files

LLVM/project 5a4736bclang/lib/AST Decl.cpp, clang/lib/Sema SemaHLSL.cpp

rebase

Created using spr 1.3.4
DeltaFile
+121-0clang/test/CodeGenHLSL/cbuffer_align.hlsl
+64-8clang/lib/Sema/SemaHLSL.cpp
+32-32clang/test/CodeGenHLSL/basic_types.hlsl
+50-0clang/test/AST/HLSL/default_cbuffer.hlsl
+43-0clang/lib/AST/Decl.cpp
+39-0clang/test/CodeGenHLSL/default_cbuffer.hlsl
+349-4030 files not shown
+681-16636 files

LLVM/project 1b39328llvm/lib/CodeGen MachineInstr.cpp, llvm/test/CodeGen/AArch64 inline-asm-speculation.ll

[CodeGen] Fix MachineInstr::isSafeToMove handling of inline asm. (#126807)

Even if an inline asm doesn't have memory effects, we can't assume it's
safe to speculate: it could trap, or cause undefined behavior. At the
LLVM IR level, this is handled correctly: we don't speculate inline asm
(unless it's marked "speculatable", but I don't think anyone does that).
Codegen also needs to respect this restriction.

This change stops Early If Conversion and similar passes from
speculating an INLINEASM MachineInstr.

Some uses of isSafeToMove probably could be switched to a different API:
isSafeToMove assumes you're hoisting, but we could handle some forms of
sinking more aggressively. But I'll leave that for a followup, if it
turns out to be relevant.

See also discussion on gcc bugtracker
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102150 .
DeltaFile
+38-0llvm/test/CodeGen/AArch64/inline-asm-speculation.ll
+16-12llvm/test/CodeGen/AMDGPU/sdwa-peephole.ll
+18-1llvm/lib/CodeGen/MachineInstr.cpp
+8-8llvm/test/CodeGen/ARM/arm-shrink-wrapping.ll
+6-6llvm/test/CodeGen/X86/x86-shrink-wrapping.ll
+6-6llvm/test/CodeGen/X86/x86-win64-shrink-wrapping.ll
+92-332 files not shown
+94-428 files

LLVM/project 72fd292llvm/test/CodeGen/AMDGPU gfx-callable-argument-types.ll, llvm/test/CodeGen/RISCV/rvv vsoxseg-rv64.ll vluxseg-rv64.ll

Merge commit '2118b9d39b91e93c0146611235072cd6ca0f27b1^' into HEAD
DeltaFile
+8,414-8,431llvm/test/CodeGen/SystemZ/Large/branch-01.ll
+6,033-3,884llvm/test/CodeGen/AMDGPU/gfx-callable-argument-types.ll
+3,318-3,318llvm/test/CodeGen/RISCV/rvv/vsoxseg-rv64.ll
+3,318-3,318llvm/test/CodeGen/RISCV/rvv/vluxseg-rv64.ll
+3,318-3,318llvm/test/CodeGen/RISCV/rvv/vloxseg-rv64.ll
+3,318-3,318llvm/test/CodeGen/RISCV/rvv/vsuxseg-rv64.ll
+27,719-25,5877,153 files not shown
+259,838-233,5057,159 files

LLVM/project e1acf65llvm/test/CodeGen/AArch64/Atomics aarch64-atomicrmw-v8a.ll aarch64-atomicrmw-rcpc.ll, llvm/test/CodeGen/AMDGPU gfx-callable-argument-types.ll

Merge commit 'f9599bbc7a3f831e1793a549d8a7a19265f3e504^' into users/meinersbur/irbuilder-extract-refactor
DeltaFile
+8,414-8,431llvm/test/CodeGen/SystemZ/Large/branch-01.ll
+9,679-0llvm/test/CodeGen/AArch64/Atomics/aarch64-atomicrmw-v8a.ll
+9,679-0llvm/test/CodeGen/AArch64/Atomics/aarch64-atomicrmw-rcpc.ll
+9,679-0llvm/test/CodeGen/AArch64/Atomics/aarch64-atomicrmw-lse2.ll
+9,679-0llvm/test/CodeGen/AArch64/Atomics/aarch64-atomicrmw-rcpc3.ll
+5,620-3,841llvm/test/CodeGen/AMDGPU/gfx-callable-argument-types.ll
+52,750-12,27221,723 files not shown
+2,716,037-1,117,08321,729 files

LLVM/project bdf2fbbllvm/test/CodeGen/AMDGPU schedule-regpressure-limit.ll schedule-regpressure-limit3.ll

[AMDGPU] Convert some tests to opaque pointers (NFC)
DeltaFile
+512-512llvm/test/CodeGen/AMDGPU/schedule-regpressure-limit.ll
+512-512llvm/test/CodeGen/AMDGPU/schedule-regpressure-limit3.ll
+512-512llvm/test/CodeGen/AMDGPU/schedule-ilp.ll
+464-539llvm/test/CodeGen/AMDGPU/global-saddr-load.ll
+391-501llvm/test/CodeGen/AMDGPU/global-saddr-atomics.ll
+420-420llvm/test/CodeGen/AMDGPU/amdgpu-codegenprepare-i16-to-i32.ll
+2,811-2,996599 files not shown
+23,988-24,600605 files

LLVM/project 94c79cellvm/lib/Target/AMDGPU SIRemoveShortExecBranches.cpp, llvm/test/CodeGen/AMDGPU atomic_optimizations_local_pointer.ll collapse-endcf.ll

Revert "[AMDGPU] Invert the handling of skip insertion."

This reverts commit 0dc6c249bffac9f23a605ce4e42a84341da3ddbd.

The commit is reported to cause a regression in piglit/bin/glsl-vs-loop for
Mesa.

(cherry picked from commit a80291ce10ba9667352adcc895f9668144f5f616)
DeltaFile
+208-104llvm/test/CodeGen/AMDGPU/atomic_optimizations_local_pointer.ll
+0-158llvm/lib/Target/AMDGPU/SIRemoveShortExecBranches.cpp
+31-20llvm/test/CodeGen/AMDGPU/collapse-endcf.ll
+25-17llvm/test/CodeGen/AMDGPU/valu-i1.ll
+12-11llvm/test/CodeGen/AMDGPU/ret_jump.ll
+9-7llvm/test/CodeGen/AMDGPU/wave32.ll
+285-31734 files not shown
+381-39140 files

LLVM/project 0233b37clang/test/CodeGen aarch64-neon-2velem.c, lldb/source/Symbol TypeSystemClang.cpp ClangASTContext.cpp

[DPWBS-1239] Merge community master into htc/master

The following TriCore files needed to be touched due to changes
in the upstream community:

TriCoreMCTargetDesc.cpp:     adcd02683856c30ba6f349279509acecd90063df
TriCoreSubtarget.cpp:        adcd02683856c30ba6f349279509acecd90063df
TriCoreLegalizerInfo.cpp/.h: c5fffa4da35f0fcc89b5ea88cc1bc60bc475a18e

The following tests were updated/deleted due to changes in the
GlobalISel legalizer:

xfail-legalize-non-pow-2-memory.mir:   dc141af7553871b94f0d7cb4b1f2096578a923be
                                       9965b12fd1bcb78396fbea2c28d80068e43b31a3
legalize-load-store.mir:               dc141af7553871b94f0d7cb4b1f2096578a923be
                                       9965b12fd1bcb78396fbea2c28d80068e43b31a3
legalize-unmerge-values{-invalid}.mir: 2a160ba5b0ad065ee7020c787e7f896416be3faa
DeltaFile
+9,318-0lldb/source/Symbol/TypeSystemClang.cpp
+0-9,311lldb/source/Symbol/ClangASTContext.cpp
+3,242-2,437clang/test/CodeGen/aarch64-neon-2velem.c
+445-2,821llvm/tools/dsymutil/DwarfLinkerForBinary.cpp
+2,438-0llvm/lib/DWARFLinker/DWARFLinker.cpp
+2,294-0mlir/docs/doxygen.cfg.in
+17,737-14,5695,106 files not shown
+165,328-58,0805,112 files

LLVM/project e53a9d9llvm/lib/Target/AMDGPU SIRemoveShortExecBranches.cpp, llvm/test/CodeGen/AMDGPU atomic_optimizations_local_pointer.ll collapse-endcf.ll

Resubmit: [AMDGPU] Invert the handling of skip insertion.

The current implementation of skip insertion (SIInsertSkip) makes it a
mandatory pass required for correctness. Initially, the idea was to
have an optional pass. This patch inserts the s_cbranch_execz upfront
during SILowerControlFlow to skip over the sections of code when no
lanes are active. Later, SIRemoveShortExecBranches removes the skips
for short branches, unless there is a sideeffect and the skip branch is
really necessary.

This new pass will replace the handling of skip insertion in the
existing SIInsertSkip Pass.

Differential revision: https://reviews.llvm.org/D68092
DeltaFile
+104-208llvm/test/CodeGen/AMDGPU/atomic_optimizations_local_pointer.ll
+158-0llvm/lib/Target/AMDGPU/SIRemoveShortExecBranches.cpp
+19-30llvm/test/CodeGen/AMDGPU/collapse-endcf.ll
+17-25llvm/test/CodeGen/AMDGPU/valu-i1.ll
+11-12llvm/test/CodeGen/AMDGPU/ret_jump.ll
+7-9llvm/test/CodeGen/AMDGPU/wave32.ll
+316-28434 files not shown
+390-38040 files

LLVM/project a80291cllvm/lib/Target/AMDGPU SIRemoveShortExecBranches.cpp, llvm/test/CodeGen/AMDGPU atomic_optimizations_local_pointer.ll collapse-endcf.ll

Revert "[AMDGPU] Invert the handling of skip insertion."

This reverts commit 0dc6c249bffac9f23a605ce4e42a84341da3ddbd.

The commit is reported to cause a regression in piglit/bin/glsl-vs-loop for
Mesa.
DeltaFile
+208-104llvm/test/CodeGen/AMDGPU/atomic_optimizations_local_pointer.ll
+0-158llvm/lib/Target/AMDGPU/SIRemoveShortExecBranches.cpp
+31-20llvm/test/CodeGen/AMDGPU/collapse-endcf.ll
+25-17llvm/test/CodeGen/AMDGPU/valu-i1.ll
+12-11llvm/test/CodeGen/AMDGPU/ret_jump.ll
+9-7llvm/test/CodeGen/AMDGPU/wave32.ll
+285-31734 files not shown
+381-39140 files

LLVM/project 0dc6c24llvm/lib/Target/AMDGPU SIRemoveShortExecBranches.cpp, llvm/test/CodeGen/AMDGPU atomic_optimizations_local_pointer.ll collapse-endcf.ll

[AMDGPU] Invert the handling of skip insertion.

The current implementation of skip insertion (SIInsertSkip) makes it a
mandatory pass required for correctness. Initially, the idea was to
have an optional pass. This patch inserts the s_cbranch_execz upfront
during SILowerControlFlow to skip over the sections of code when no
lanes are active. Later, SIRemoveShortExecBranches removes the skips
for short branches, unless there is a sideeffect and the skip branch is
really necessary.

This new pass will replace the handling of skip insertion in the
existing SIInsertSkip Pass.

Differential revision: https://reviews.llvm.org/D68092
DeltaFile
+104-208llvm/test/CodeGen/AMDGPU/atomic_optimizations_local_pointer.ll
+158-0llvm/lib/Target/AMDGPU/SIRemoveShortExecBranches.cpp
+19-30llvm/test/CodeGen/AMDGPU/collapse-endcf.ll
+17-25llvm/test/CodeGen/AMDGPU/valu-i1.ll
+11-12llvm/test/CodeGen/AMDGPU/ret_jump.ll
+7-9llvm/test/CodeGen/AMDGPU/wave32.ll
+316-28434 files not shown
+390-38040 files

LLVM/project 25528d6llvm/test/CodeGen/PowerPC atomics-regression.ll, llvm/test/CodeGen/X86 avx512-shuffle-schedule.ll avx512vl-vec-masked-cmp.ll

[CodeGen] Unify MBB reference format in both MIR and debug output

As part of the unification of the debug format and the MIR format, print
MBB references as '%bb.5'.

The MIR printer prints the IR name of a MBB only for block definitions.

* find . \( -name "*.mir" -o -name "*.cpp" -o -name "*.h" -o -name "*.ll" \) -type f -print0 | xargs -0 sed -i '' -E 's/BB#" << ([a-zA-Z0-9_]+)->getNumber\(\)/" << printMBBReference(*\1)/g'
* find . \( -name "*.mir" -o -name "*.cpp" -o -name "*.h" -o -name "*.ll" \) -type f -print0 | xargs -0 sed -i '' -E 's/BB#" << ([a-zA-Z0-9_]+)\.getNumber\(\)/" << printMBBReference(\1)/g'
* find . \( -name "*.txt" -o -name "*.s" -o -name "*.mir" -o -name "*.cpp" -o -name "*.h" -o -name "*.ll" \) -type f -print0 | xargs -0 sed -i '' -E 's/BB#([0-9]+)/%bb.\1/g'
* grep -nr 'BB#' and fix

Differential Revision: https://reviews.llvm.org/D40422

llvm-svn: 319665
DeltaFile
+1,700-1,700llvm/test/CodeGen/X86/avx512-shuffle-schedule.ll
+1,569-1,569llvm/test/CodeGen/X86/avx512vl-vec-masked-cmp.ll
+1,250-1,250llvm/test/CodeGen/X86/sse2-schedule.ll
+1,138-1,138llvm/test/CodeGen/PowerPC/atomics-regression.ll
+954-954llvm/test/CodeGen/X86/avx2-schedule.ll
+941-941llvm/test/CodeGen/X86/avx512-schedule.ll
+7,552-7,5521,429 files not shown
+50,355-50,3041,435 files

LLVM/project 3dbeefallvm/test/CodeGen/AMDGPU global_atomics.ll global_atomics_i64.ll

AMDGPU: Mark all unspecified CC functions in tests as amdgpu_kernel

Currently the default C calling convention functions are treated
the same as compute kernels. Make this explicit so the default
calling convention can be changed to a non-kernel.

Converted with perl -pi -e 's/define void/define amdgpu_kernel void/'
on the relevant test directories (and undoing in one place that actually
wanted a non-kernel).

llvm-svn: 298444
DeltaFile
+98-98llvm/test/CodeGen/AMDGPU/global_atomics.ll
+97-97llvm/test/CodeGen/AMDGPU/global_atomics_i64.ll
+97-97llvm/test/CodeGen/AMDGPU/flat_atomics_i64.ll
+96-96llvm/test/CodeGen/AMDGPU/flat_atomics.ll
+91-91llvm/test/CodeGen/AMDGPU/fneg-combines.ll
+68-68llvm/test/CodeGen/AMDGPU/imm.ll
+547-547708 files not shown
+6,248-6,248714 files

LLVM/project 7fbec9bllvm/lib/CodeGen MachineBlockPlacement.cpp, llvm/test/CodeGen/PowerPC tail-dup-layout.ll

Codegen: Make chains from trellis-shaped CFGs

Lay out trellis-shaped CFGs optimally.
A trellis of the shape below:

  A     B
  |\   /|
  | \ / |
  |  X  |
  | / \ |
  |/   \|
  C     D

would be laid out A; B->C ; D by the current layout algorithm. Now we identify
trellises and lay them out either A->C; B->D or A->D; B->C. This scales with an
increasing number of predecessors. A trellis is a a group of 2 or more
predecessor blocks that all have the same successors.

because of this we can tail duplicate to extend existing trellises.

    [146 lines not shown]
DeltaFile
+352-26llvm/test/CodeGen/PowerPC/tail-dup-layout.ll
+293-17llvm/lib/CodeGen/MachineBlockPlacement.cpp
+19-17llvm/test/CodeGen/X86/win-alloca-expander.ll
+15-19llvm/test/CodeGen/X86/sse1.ll
+13-15llvm/test/CodeGen/X86/block-placement.ll
+7-8llvm/test/CodeGen/X86/bypass-slow-division-32.ll
+699-10218 files not shown
+747-14924 files

LLVM/project 5d8eb25llvm/lib/Target/AMDGPU VOPCInstructions.td, llvm/test/CodeGen/AMDGPU uniform-cfg.ll setcc-opt.ll

AMDGPU: Use unsigned compare for eq/ne

For some reason there are both of these available, except
for scalar 64-bit compares which only has u64. I'm not sure
why there are both (I'm guessing it's for the one bit inputs we
don't use), but for consistency always using the
unsigned one.

llvm-svn: 282832
DeltaFile
+14-16llvm/test/CodeGen/AMDGPU/uniform-cfg.ll
+12-12llvm/test/CodeGen/AMDGPU/setcc-opt.ll
+9-9llvm/test/CodeGen/AMDGPU/setcc.ll
+8-8llvm/lib/Target/AMDGPU/VOPCInstructions.td
+7-7llvm/test/CodeGen/AMDGPU/sopk-compares.ll
+6-6llvm/test/CodeGen/AMDGPU/llvm.amdgcn.div.fmas.ll
+56-5845 files not shown
+157-15951 files