LLVM/project 85e290bllvm/test/CodeGen/AMDGPU bf16.ll, llvm/test/CodeGen/NVPTX cmpxchg-sm90.ll cmpxchg-sm70.ll

Merge branch 'main' into users/meinersbur/flang_runtime_remove-FLANG_INCLUDE_RUNTIME
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,350-4,196llvm/test/CodeGen/AMDGPU/bf16.ll
+6,173-0llvm/test/Instrumentation/MemorySanitizer/AArch64/arm64-vmul.ll
+0-6,173llvm/test/Instrumentation/MemorySanitizer/AArch64/neon_vmul.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm90.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm70.ll
+90,478-10,3696,999 files not shown
+425,286-177,3787,005 files

LLVM/project 7dc7264llvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2f16.v8f16.ll, llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites long-spec-const-composite.ll

Merge branch 'main' into users/ylzsx/r-tls-noie
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+29,745-0llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+27,655-0llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+27,655-0llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+27,139-0llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+25,933-0llvm/test/CodeGen/AMDGPU/shufflevector.v2f32.v8f32.ll
+203,722-016,100 files not shown
+2,040,184-604,85516,106 files

LLVM/project 8c41ae6lldb/tools/lldb-dap lldb-dap.cpp, llvm/test/CodeGen/AMDGPU bf16.ll

Merge branch 'main' into users/meinersbur/flang_runtime_premerge
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,194-3,880llvm/test/CodeGen/AMDGPU/bf16.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm90.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm70.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm60.ll
+309-5,049lldb/tools/lldb-dap/lldb-dap.cpp
+90,138-8,9295,013 files not shown
+307,445-100,3025,019 files

LLVM/project a97bbe6lldb/tools/lldb-dap lldb-dap.cpp, llvm/test/CodeGen/AMDGPU bf16.ll

rebase

Created using spr 1.3.5-bogner
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,194-3,880llvm/test/CodeGen/AMDGPU/bf16.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm70.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm90.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm60.ll
+309-5,049lldb/tools/lldb-dap/lldb-dap.cpp
+90,138-8,9294,327 files not shown
+283,321-85,4414,333 files

LLVM/project 519ae24llvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2bf16.v8bf16.ll, llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites long-spec-const-composite.ll

Merge branch 'users/chapuni/mcdc/nest/nest-base' into users/chapuni/mcdc/nest/trunk
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,782-7,782llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+7,255-7,255llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4i64.v4i64.ll
+102,453-36,85810,567 files not shown
+953,290-609,13410,573 files

LLVM/project 03c851allvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2bf16.v8bf16.ll, llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites long-spec-const-composite.ll

Merge branch 'users/chapuni/mcdc/nest/bitmapaddr' into users/chapuni/mcdc/nest/nest-base

Conflicts:
        clang/test/CoverageMapping/mcdc-single-cond.cpp
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,782-7,782llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+7,255-7,255llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4p0.v4p0.ll
+102,453-36,85810,567 files not shown
+953,290-609,13410,573 files

LLVM/project d6cb61allvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2f16.v8f16.ll, llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites long-spec-const-composite.ll

Merge branch 'users/chapuni/mcdc/nest/tests' into users/chapuni/mcdc/nest/bitmapaddr
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,782-7,782llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+7,255-7,255llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4p0.v4p0.ll
+102,453-36,85810,567 files not shown
+953,323-609,04210,573 files

LLVM/project 457cf4bllvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2bf16.v8bf16.ll, llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites long-spec-const-composite.ll

Merge branch 'users/chapuni/mcdc/nest/tests' into users/chapuni/mcdc/nest/mcdcstate
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,782-7,782llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+7,255-7,255llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4i64.v4i64.ll
+102,453-36,85810,567 files not shown
+953,323-609,04210,573 files

LLVM/project aac9c6dllvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2bf16.v8bf16.ll, llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites long-spec-const-composite.ll

Merge branch 'users/chapuni/mcdc/nest/tests' into users/chapuni/mcdc/nest/lnot
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,782-7,782llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+7,255-7,255llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4i64.v4i64.ll
+102,453-36,85810,566 files not shown
+953,289-609,12910,572 files

LLVM/project e57bdccllvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2f16.v8f16.ll, llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites long-spec-const-composite.ll

Merge branch 'users/chapuni/mcdc/nest/tests' into users/chapuni/mcdc/nest/expect
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,782-7,782llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+7,255-7,255llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4i64.v4i64.ll
+102,453-36,85810,566 files not shown
+953,289-609,12910,572 files

LLVM/project 9a175d0llvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2f16.v8f16.ll, llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites long-spec-const-composite.ll

Merge branch 'main' into users/chapuni/mcdc/nest/tests
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,782-7,782llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+7,255-7,255llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4p0.v4p0.ll
+102,453-36,85810,565 files not shown
+953,289-609,03210,571 files

LLVM/project 967dc03llvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2bf16.v8bf16.ll, llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites long-spec-const-composite.ll

run 'git merge main'
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,782-7,782llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+7,255-7,255llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4i64.v4i64.ll
+102,453-36,8588,560 files not shown
+809,659-558,2578,566 files

LLVM/project f8b734fllvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2f16.v8f16.ll

fix order

Created using spr 1.3.4
DeltaFile
+7,782-7,782llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+7,255-7,255llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4i64.v4i64.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4p0.v4p0.ll
+43,825-43,8257,407 files not shown
+681,199-531,6487,413 files

LLVM/project 41056e8llvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2bf16.v8bf16.ll

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.4

[skip ci]
DeltaFile
+7,782-7,782llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+7,255-7,255llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4p0.v4p0.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4i64.v4i64.ll
+43,825-43,8257,401 files not shown
+681,165-531,6217,407 files

LLVM/project c122241lldb/tools/lldb-dap lldb-dap.cpp, llvm/test/CodeGen/AMDGPU bf16.ll atomic_optimizations_global_pointer.ll

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.4

[skip ci]
DeltaFile
+7,194-3,880llvm/test/CodeGen/AMDGPU/bf16.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm60.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm70.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm90.ll
+272-5,007lldb/tools/lldb-dap/lldb-dap.cpp
+4,269-28llvm/test/CodeGen/AMDGPU/atomic_optimizations_global_pointer.ll
+28,775-8,9151,280 files not shown
+84,457-34,5731,286 files

LLVM/project a9bf1f2flang/test/Lower allocatable-assignment.f90 array-expression.f90, lldb/tools/lldb-dap lldb-dap.cpp

rebase

Created using spr 1.3.4
DeltaFile
+7,163-3,829llvm/test/CodeGen/AMDGPU/bf16.ll
+4,269-28llvm/test/CodeGen/AMDGPU/atomic_optimizations_global_pointer.ll
+249-2,316lldb/tools/lldb-dap/lldb-dap.cpp
+2,163-0llvm/test/CodeGen/X86/vector-lrint-f16.ll
+190-1,086flang/test/Lower/allocatable-assignment.f90
+0-1,265flang/test/Lower/array-expression.f90
+14,034-8,5241,008 files not shown
+53,160-26,4091,014 files

LLVM/project 0cfee98llvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2bf16.v8bf16.ll

Rebase

Created using spr 1.3.5
DeltaFile
+7,782-7,782llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+7,255-7,255llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4p0.v4p0.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4i64.v4i64.ll
+43,825-43,8254,283 files not shown
+513,606-420,7004,289 files

LLVM/project af42b11llvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2bf16.v8bf16.ll

rebase

Created using spr 1.3.4
DeltaFile
+7,782-7,782llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+7,427-7,427llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+7,255-7,255llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4p0.v4p0.ll
+6,967-6,967llvm/test/CodeGen/AMDGPU/shufflevector.v4i64.v4i64.ll
+43,825-43,8254,198 files not shown
+510,585-434,2364,204 files

LLVM/project 9e8d11dllvm/lib/Target/X86 X86ISelLowering.cpp, llvm/test/CodeGen/X86 v8i1-masks.ll

[X86] Check that the type is integer before calling isUnsignedIntSetCC in combineExtSetcc. (#128263)

SETULT can be an unsigned less than integer compare or a unordered less
than FP compare. We need to check the VT to distinguish them.

Fixes on of the issues from #128237.
DeltaFile
+4-8llvm/test/CodeGen/X86/v8i1-masks.ll
+3-2llvm/lib/Target/X86/X86ISelLowering.cpp
+7-102 files

LLVM/project 2811a38clang/test/OpenMP atomic_compare_codegen.cpp nvptx_SPMD_codegen.cpp, libc/src/__support ryu_long_double_constants.h

Merge commit 'dd699c1333daeaea1c50c1506a66e9c7372afbb5' into users/meinersbur/irbuilder-extract
DeltaFile
+119,926-0libc/src/__support/ryu_long_double_constants.h
+20,887-20,814llvm/lib/Support/UnicodeNameToCodepointGenerated.cpp
+34,181-35clang/test/OpenMP/atomic_compare_codegen.cpp
+12,436-12,678clang/test/OpenMP/nvptx_SPMD_codegen.cpp
+16,298-0llvm/test/CodeGen/X86/pcsections-atomics.ll
+12,455-3,587llvm/test/CodeGen/X86/vector-interleaved-store-i64-stride-8.ll
+216,183-37,11441,232 files not shown
+4,417,032-1,979,31641,238 files

LLVM/project a3e5813clang/test/CodeGen/RISCV/rvv-intrinsics vluxseg_mask_mf.c vloxseg_mask_mf.c, llvm/test/CodeGen/AArch64/Atomics aarch64-atomicrmw-v8a.ll aarch64-atomicrmw-rcpc.ll

Merge commit 'e1acf65bc1b6fbde7f0d099003c148f9b46f7b21' into users/meinersbur/irbuilder-extract
DeltaFile
+9,679-0llvm/test/CodeGen/AArch64/Atomics/aarch64-atomicrmw-v8a.ll
+9,679-0llvm/test/CodeGen/AArch64/Atomics/aarch64-atomicrmw-rcpc.ll
+9,679-0llvm/test/CodeGen/AArch64/Atomics/aarch64-atomicrmw-lse2.ll
+9,679-0llvm/test/CodeGen/AArch64/Atomics/aarch64-atomicrmw-rcpc3.ll
+0-9,104clang/test/CodeGen/RISCV/rvv-intrinsics/vluxseg_mask_mf.c
+0-9,104clang/test/CodeGen/RISCV/rvv-intrinsics/vloxseg_mask_mf.c
+38,716-18,20815,675 files not shown
+2,473,889-901,26515,681 files

LLVM/project ad220d9clang/test/OpenMP target_teams_distribute_parallel_for_simd_schedule_codegen.cpp teams_distribute_parallel_for_simd_schedule_codegen.cpp, llvm/test/CodeGen/X86 large-gep-chain.ll

Merge commit 'e2d1e2183a9615c669392eefcfe632cc0b59a649' into users/meinersbur/irbuilder-extract
DeltaFile
+0-116,484llvm/test/MC/Disassembler/AMDGPU/gfx8_dasm_all.txt
+0-115,677llvm/test/MC/Disassembler/AMDGPU/gfx9_dasm_all.txt
+0-98,954llvm/test/MC/Disassembler/AMDGPU/gfx10_dasm_all.txt
+25,277-25,277llvm/test/CodeGen/X86/large-gep-chain.ll
+9,540-24,668clang/test/OpenMP/target_teams_distribute_parallel_for_simd_schedule_codegen.cpp
+9,364-24,292clang/test/OpenMP/teams_distribute_parallel_for_simd_schedule_codegen.cpp
+44,181-405,35242,672 files not shown
+3,761,432-2,721,70342,678 files

LLVM/project dd699c1clang/test/OpenMP atomic_compare_codegen.cpp nvptx_SPMD_codegen.cpp, libc/src/__support ryu_long_double_constants.h

Merge commit '6942c64e8128e4ccd891b813d0240f574f80f59e^' into users/meinersbur/irbuilder-extract-refactor
DeltaFile
+119,926-0libc/src/__support/ryu_long_double_constants.h
+20,887-20,814llvm/lib/Support/UnicodeNameToCodepointGenerated.cpp
+34,181-35clang/test/OpenMP/atomic_compare_codegen.cpp
+12,436-12,678clang/test/OpenMP/nvptx_SPMD_codegen.cpp
+16,298-0llvm/test/CodeGen/X86/pcsections-atomics.ll
+12,455-3,587llvm/test/CodeGen/X86/vector-interleaved-store-i64-stride-8.ll
+216,183-37,11441,232 files not shown
+4,417,032-1,979,31641,238 files

LLVM/project e1acf65llvm/test/CodeGen/AArch64/Atomics aarch64-atomicrmw-v8a.ll aarch64-atomicrmw-rcpc.ll, llvm/test/CodeGen/AMDGPU gfx-callable-argument-types.ll

Merge commit 'f9599bbc7a3f831e1793a549d8a7a19265f3e504^' into users/meinersbur/irbuilder-extract-refactor
DeltaFile
+8,414-8,431llvm/test/CodeGen/SystemZ/Large/branch-01.ll
+9,679-0llvm/test/CodeGen/AArch64/Atomics/aarch64-atomicrmw-v8a.ll
+9,679-0llvm/test/CodeGen/AArch64/Atomics/aarch64-atomicrmw-rcpc.ll
+9,679-0llvm/test/CodeGen/AArch64/Atomics/aarch64-atomicrmw-lse2.ll
+9,679-0llvm/test/CodeGen/AArch64/Atomics/aarch64-atomicrmw-rcpc3.ll
+5,620-3,841llvm/test/CodeGen/AMDGPU/gfx-callable-argument-types.ll
+52,750-12,27221,723 files not shown
+2,716,037-1,117,08321,729 files

LLVM/project 2c5d1b5llvm/lib/CodeGen/SelectionDAG DAGCombiner.cpp, llvm/test/CodeGen/AMDGPU combine_andor_with_cmps.ll wave32.ll

[DAGCombiner] Reassociate the operands from (OR (OR(CMP1, CMP2)), CMP3) to (OR (OR(CMP1, CMP3)), CMP2)

This happens when CMP1 and CMP3 have the same predicate (or CMP2 and CMP3 have
the same predicate).

This helps optimizations such as the fololowing one:
CMP(A,C)||CMP(B,C) => CMP(MIN/MAX(A,B), C)
CMP(A,C)&&CMP(B,C) => CMP(MIN/MAX(A,B), C)

Reviewed By: arsenm

Differential Revision: https://reviews.llvm.org/D156215
DeltaFile
+18-22llvm/test/CodeGen/AMDGPU/combine_andor_with_cmps.ll
+24-12llvm/test/CodeGen/Hexagon/isel/logical.ll
+16-16llvm/test/CodeGen/X86/v8i1-masks.ll
+24-0llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
+7-7llvm/test/CodeGen/AMDGPU/wave32.ll
+89-575 files

LLVM/project 834cc88llvm/test/CodeGen/X86 recip-fastmath.ll vector-interleaved-load-i32-stride-6.ll

[X86] X86FixupVectorConstantsPass - attempt to replace full width fp vector constant loads with broadcasts on AVX+ targets (REAPPLIED)

lowerBuildVectorAsBroadcast will not broadcast splat constants in all cases, resulting in a lot of situations where a full width vector load that has failed to fold but is loading splat constant values could use a broadcast load instruction just as cheaply, and save constant pool space.

NOTE: SSE3 targets can use MOVDDUP but not all SSE era CPUs can perform this as cheaply as a vector load, we will need to add scheduler model checks if we want to pursue this.

This is an updated commit of 98061013e01207444cfd3980cde17b5e75764fbe after being reverted at a279a09ab9524d1d74ef29b34618102d4b202e2f
DeltaFile
+41-125llvm/test/CodeGen/X86/recip-fastmath.ll
+69-35llvm/test/CodeGen/X86/vector-interleaved-load-i32-stride-6.ll
+27-55llvm/test/CodeGen/X86/extractelement-load.ll
+31-43llvm/test/CodeGen/X86/vector-constrained-fp-intrinsics.ll
+38-32llvm/test/CodeGen/X86/vector-trunc-math.ll
+42-21llvm/test/CodeGen/X86/vector-trunc-usat.ll
+248-31175 files not shown
+835-85281 files

LLVM/project a279a09llvm/test/CodeGen/X86 recip-fastmath.ll vector-interleaved-load-i32-stride-6.ll

Revert rG98061013e01207444cfd3980 - [X86] X86FixupVectorConstantsPass - attempt to replace full width fp vector constant loads with broadcasts on AVX+ targets

Reverting while we address an existing issue exposed by this (Issue #63108)
DeltaFile
+125-41llvm/test/CodeGen/X86/recip-fastmath.ll
+35-69llvm/test/CodeGen/X86/vector-interleaved-load-i32-stride-6.ll
+55-27llvm/test/CodeGen/X86/extractelement-load.ll
+43-31llvm/test/CodeGen/X86/vector-constrained-fp-intrinsics.ll
+32-38llvm/test/CodeGen/X86/vector-trunc-math.ll
+21-42llvm/test/CodeGen/X86/vector-trunc-usat.ll
+311-24875 files not shown
+852-83581 files

LLVM/project 9806101llvm/test/CodeGen/X86 recip-fastmath.ll vector-interleaved-load-i32-stride-6.ll

[X86] X86FixupVectorConstantsPass - attempt to replace full width fp vector constant loads with broadcasts on AVX+ targets

lowerBuildVectorAsBroadcast will not broadcast splat constants in all cases, resulting in a lot of situations where a full width vector load that has failed to fold but is loading splat constant values could use a broadcast load instruction just as cheaply, and save constant pool space.

NOTE: SSE3 targets can use MOVDDUP but not all SSE era CPUs can perform this as cheaply as a vector load, we will need to add scheduler model checks if we want to pursue this.
DeltaFile
+41-125llvm/test/CodeGen/X86/recip-fastmath.ll
+69-35llvm/test/CodeGen/X86/vector-interleaved-load-i32-stride-6.ll
+27-55llvm/test/CodeGen/X86/extractelement-load.ll
+31-43llvm/test/CodeGen/X86/vector-constrained-fp-intrinsics.ll
+38-32llvm/test/CodeGen/X86/vector-trunc-math.ll
+42-21llvm/test/CodeGen/X86/vector-trunc-usat.ll
+248-31175 files not shown
+836-85381 files

LLVM/project c972e1cllvm/test/CodeGen/X86 v8i1-masks.ll

[X86] v8i1-masks.ll - add avx512 test coverage and use X86 check prefix instead of X32

We try to use X32 for tests on gnux32 triples
DeltaFile
+844-425llvm/test/CodeGen/X86/v8i1-masks.ll
+844-4251 files

LLVM/project 86eff6bllvm/test/CodeGen/RISCV wide-scalar-shift-legalization.ll, llvm/test/CodeGen/X86 mul-i1024.ll smulo-128-legalisation-lowering.ll

[MachineCombiner] Use default latency model when no detailed model available

This change adjusts the cost modeling used when the target does not have a schedule model with individual instruction latencies. After this change, we use the default latency information available from TargetSchedule. The default latency information essentially ends up treating most instructions as latency 1, with a few "expensive" ones getting a higher cost.

Previously, we unconditionally applied the first legal pattern - without any consideration of profitability. As a result, this change both prevents some patterns being applied, and changes which patterns are exercised. (i.e. previously the first pattern was applied, afterwards, maybe the second one is because the first wasn't profitable.)

The motivation here is two fold.

First, this brings the default behavior in line with the behavior when -mcpu or -mtune is specified. This improves test coverage, and generally makes it less likely we will have bad surprises when providing more information to the compiler.

Second, this enables some reassociation for ILP by default. Despite being unconditionally enabled, the prior code tended to "reassociate" repeatedly through an entire chain and simply moving the first operand to the end. The result was still a serial chain, just a different one. With this change, one of the intermediate transforms is unprofitable and we end up with a partially flattened tree.

Note that the resulting code diffs show significant room for improvement in the basic algorithm. I am intentionally excluding those from this patch.

For the test diffs, I don't seen any concerning regressions. I took a fairly close look at the RISCV ones, but only skimmed the x86 (particularly vector x86) changes.

Differential Revision: https://reviews.llvm.org/D141017
DeltaFile
+3,358-3,330llvm/test/CodeGen/X86/mul-i1024.ll
+711-702llvm/test/CodeGen/X86/smulo-128-legalisation-lowering.ll
+652-652llvm/test/CodeGen/RISCV/wide-scalar-shift-legalization.ll
+599-589llvm/test/CodeGen/X86/mul-i512.ll
+410-388llvm/test/CodeGen/X86/vec_smulo.ll
+264-261llvm/test/CodeGen/X86/xmulo.ll
+5,994-5,922159 files not shown
+11,750-11,689165 files