LLVM/project 552f6c9llvm/test/CodeGen/X86 vector-interleaved-store-i8-stride-7.ll vector-interleaved-store-i16-stride-6.ll, llvm/test/Instrumentation/MemorySanitizer/AArch64 arm64-ld1.ll

rebase

Created using spr 1.3.4
DeltaFile
+3,128-3,170llvm/test/CodeGen/X86/vector-interleaved-store-i8-stride-7.ll
+2,487-2,510llvm/test/CodeGen/X86/vector-interleaved-store-i16-stride-6.ll
+2,099-2,096llvm/test/MC/AMDGPU/gfx12_asm_sop2.s
+2,053-2,052llvm/test/MC/Disassembler/AMDGPU/gfx12_dasm_sop2.txt
+3,860-0llvm/test/Instrumentation/MemorySanitizer/AArch64/arm64-ld1.ll
+1,716-1,780llvm/test/CodeGen/X86/vector-interleaved-store-i16-stride-5.ll
+15,343-11,6084,566 files not shown
+159,258-104,3134,572 files

LLVM/project 85e290bllvm/test/CodeGen/AMDGPU bf16.ll, llvm/test/CodeGen/NVPTX cmpxchg-sm90.ll cmpxchg-sm70.ll

Merge branch 'main' into users/meinersbur/flang_runtime_remove-FLANG_INCLUDE_RUNTIME
DeltaFile
+65,595-0llvm/test/CodeGen/SPIRV/extensions/SPV_INTEL_long_composites/long-spec-const-composite.ll
+7,350-4,196llvm/test/CodeGen/AMDGPU/bf16.ll
+6,173-0llvm/test/Instrumentation/MemorySanitizer/AArch64/arm64-vmul.ll
+0-6,173llvm/test/Instrumentation/MemorySanitizer/AArch64/neon_vmul.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm90.ll
+5,680-0llvm/test/CodeGen/NVPTX/cmpxchg-sm70.ll
+90,478-10,3696,999 files not shown
+425,286-177,3787,005 files

LLVM/project 2c57c89llvm/test/Analysis/CostModel/AArch64 arith-widening.ll, llvm/test/CodeGen/RISCV qci-interrupt-attr.ll

Merge branch 'main' of https://github.com/llvm/llvm-project into remove-old-res-middle
DeltaFile
+2,099-2,096llvm/test/MC/AMDGPU/gfx12_asm_sop2.s
+2,053-2,052llvm/test/MC/Disassembler/AMDGPU/gfx12_dasm_sop2.txt
+1,434-1,828llvm/test/CodeGen/RISCV/rvv/fixed-vectors-cttz-vp.ll
+2,915-0llvm/test/CodeGen/RISCV/qci-interrupt-attr.ll
+1,434-1,418llvm/test/MC/AMDGPU/gfx12_asm_sop1.s
+940-1,896llvm/test/Analysis/CostModel/AArch64/arith-widening.ll
+10,875-9,2902,274 files not shown
+97,298-62,3572,280 files

LLVM/project 5a0a2f8llvm/test/CodeGen/AMDGPU llvm.amdgcn.image.load.2dmsaa.ll ps-shader-arg-count.ll, llvm/test/CodeGen/AMDGPU/GlobalISel andn2.ll orn2.ll

AMDGPU: Replace undef with poison in tests using insertvalue (#130895)

perl -p -i -e 's/insertvalue (.*) undef/insertvalue \1 poison/g'
DeltaFile
+81-81llvm/test/CodeGen/AMDGPU/llvm.amdgcn.image.load.2dmsaa.ll
+13-13llvm/test/CodeGen/AMDGPU/ps-shader-arg-count.ll
+11-11llvm/test/CodeGen/AMDGPU/GlobalISel/andn2.ll
+11-11llvm/test/CodeGen/AMDGPU/GlobalISel/orn2.ll
+8-8llvm/test/CodeGen/AMDGPU/ret.ll
+4-4llvm/test/CodeGen/AMDGPU/unigine-liveness-crash.ll
+128-12825 files not shown
+178-17631 files

LLVM/project ca45b6dllvm/test/CodeGen/AMDGPU llvm.amdgcn.image.load.2dmsaa.ll ps-shader-arg-count.ll

AMDGPU: Replace undef with poison in tests using insertvalue

perl -p -i -e 's/insertvalue (.*) undef/insertvalue \1 poison/g'
DeltaFile
+81-81llvm/test/CodeGen/AMDGPU/llvm.amdgcn.image.load.2dmsaa.ll
+13-13llvm/test/CodeGen/AMDGPU/ps-shader-arg-count.ll
+8-8llvm/test/CodeGen/AMDGPU/ret.ll
+4-4llvm/test/CodeGen/AMDGPU/skip-promote-alloca-vector-users.ll
+4-4llvm/test/CodeGen/AMDGPU/unigine-liveness-crash.ll
+4-4llvm/test/CodeGen/AMDGPU/llvm.amdgcn.raw.ptr.buffer.load.ll
+114-11419 files not shown
+146-14625 files

LLVM/project e1c6682llvm/test/CodeGen/AMDGPU llvm.amdgcn.image.load.2dmsaa.ll ps-shader-arg-count.ll

AMDGPU: Replace undef with poison in tests using insertvalue

perl -p -i -e 's/insertvalue (.*) undef/insertvalue \1 poison/g'
DeltaFile
+81-81llvm/test/CodeGen/AMDGPU/llvm.amdgcn.image.load.2dmsaa.ll
+13-13llvm/test/CodeGen/AMDGPU/ps-shader-arg-count.ll
+8-8llvm/test/CodeGen/AMDGPU/ret.ll
+4-4llvm/test/CodeGen/AMDGPU/skip-promote-alloca-vector-users.ll
+4-4llvm/test/CodeGen/AMDGPU/llvm.amdgcn.raw.ptr.buffer.load.ll
+4-4llvm/test/CodeGen/AMDGPU/unigine-liveness-crash.ll
+114-11419 files not shown
+146-14625 files

LLVM/project 2811a38clang/test/OpenMP atomic_compare_codegen.cpp nvptx_SPMD_codegen.cpp, libc/src/__support ryu_long_double_constants.h

Merge commit 'dd699c1333daeaea1c50c1506a66e9c7372afbb5' into users/meinersbur/irbuilder-extract
DeltaFile
+119,926-0libc/src/__support/ryu_long_double_constants.h
+20,887-20,814llvm/lib/Support/UnicodeNameToCodepointGenerated.cpp
+34,181-35clang/test/OpenMP/atomic_compare_codegen.cpp
+12,436-12,678clang/test/OpenMP/nvptx_SPMD_codegen.cpp
+16,298-0llvm/test/CodeGen/X86/pcsections-atomics.ll
+12,455-3,587llvm/test/CodeGen/X86/vector-interleaved-store-i64-stride-8.ll
+216,183-37,11441,232 files not shown
+4,417,032-1,979,31641,238 files

LLVM/project ad220d9clang/test/OpenMP target_teams_distribute_parallel_for_simd_schedule_codegen.cpp teams_distribute_parallel_for_simd_schedule_codegen.cpp, llvm/test/CodeGen/X86 large-gep-chain.ll

Merge commit 'e2d1e2183a9615c669392eefcfe632cc0b59a649' into users/meinersbur/irbuilder-extract
DeltaFile
+0-116,484llvm/test/MC/Disassembler/AMDGPU/gfx8_dasm_all.txt
+0-115,677llvm/test/MC/Disassembler/AMDGPU/gfx9_dasm_all.txt
+0-98,954llvm/test/MC/Disassembler/AMDGPU/gfx10_dasm_all.txt
+25,277-25,277llvm/test/CodeGen/X86/large-gep-chain.ll
+9,540-24,668clang/test/OpenMP/target_teams_distribute_parallel_for_simd_schedule_codegen.cpp
+9,364-24,292clang/test/OpenMP/teams_distribute_parallel_for_simd_schedule_codegen.cpp
+44,181-405,35242,672 files not shown
+3,761,432-2,721,70342,678 files

LLVM/project dd699c1clang/test/OpenMP atomic_compare_codegen.cpp nvptx_SPMD_codegen.cpp, libc/src/__support ryu_long_double_constants.h

Merge commit '6942c64e8128e4ccd891b813d0240f574f80f59e^' into users/meinersbur/irbuilder-extract-refactor
DeltaFile
+119,926-0libc/src/__support/ryu_long_double_constants.h
+20,887-20,814llvm/lib/Support/UnicodeNameToCodepointGenerated.cpp
+34,181-35clang/test/OpenMP/atomic_compare_codegen.cpp
+12,436-12,678clang/test/OpenMP/nvptx_SPMD_codegen.cpp
+16,298-0llvm/test/CodeGen/X86/pcsections-atomics.ll
+12,455-3,587llvm/test/CodeGen/X86/vector-interleaved-store-i64-stride-8.ll
+216,183-37,11441,232 files not shown
+4,417,032-1,979,31641,238 files

LLVM/project f2c164cllvm/test/CodeGen/AMDGPU gfx-callable-argument-types.ll vector_shuffle.packed.ll, llvm/test/CodeGen/AMDGPU/GlobalISel fshr.ll fshl.ll

[AMDGPU] Do not wait for vscnt on function entry and return

SIInsertWaitcnts inserts waitcnt instructions to resolve data
dependencies. The GFX10+ vscnt (VMEM store count) counter is never used
in this way. It is only used to resolve memory dependencies, and that is
handled by SIMemoryLegalizer. Hence there is no need to conservatively
wait for vscnt to be 0 on function entry and before returns.

Differential Revision: https://reviews.llvm.org/D153537
DeltaFile
+42-247llvm/test/CodeGen/AMDGPU/GlobalISel/fshr.ll
+0-270llvm/test/CodeGen/AMDGPU/gfx-callable-argument-types.ll
+24-166llvm/test/CodeGen/AMDGPU/GlobalISel/fshl.ll
+0-166llvm/test/CodeGen/AMDGPU/vector_shuffle.packed.ll
+45-116llvm/test/CodeGen/AMDGPU/strict_fpext.ll
+46-109llvm/test/CodeGen/AMDGPU/strict_fptrunc.ll
+157-1,074199 files not shown
+294-5,195205 files

LLVM/project 5cae881llvm/test/CodeGen/AMDGPU gfx-callable-argument-types.ll, llvm/test/CodeGen/AMDGPU/GlobalISel extractelement.ll insertelement.i8.ll

[AMDGPU] Add GFX11 test coverage

Add GFX11 test coverage to a bunch of tests where it was easy to do so,
mostly because the checks are autogenerated and/or GFX11 can share the
same checks as GFX10.

Differential Revision: https://reviews.llvm.org/D129295
DeltaFile
+3,218-0llvm/test/CodeGen/AMDGPU/gfx-callable-argument-types.ll
+1,473-831llvm/test/CodeGen/AMDGPU/GlobalISel/extractelement.ll
+1,809-0llvm/test/CodeGen/AMDGPU/GlobalISel/insertelement.i8.ll
+1,668-0llvm/test/CodeGen/AMDGPU/GlobalISel/fshl.ll
+1,619-0llvm/test/CodeGen/AMDGPU/GlobalISel/fshr.ll
+1,217-0llvm/test/CodeGen/AMDGPU/GlobalISel/extractelement.i8.ll
+11,004-831162 files not shown
+36,817-3,435168 files

LLVM/project caf1294llvm/lib/Target/AMDGPU GCNRegBankReassign.cpp, llvm/test/CodeGen/AMDGPU regbank-reassign.mir

[AMDGPU] Experiments show that the GCNRegBankReassign pass significantly impacts
the compilation time and there is no case for which we see any improvement in
performance. This patch removes this pass and its associated test cases from
the tree.

Differential Revision: https://reviews.llvm.org/D101313

Change-Id: I0599169a7609c19a887f8d847a71e664030cc141
DeltaFile
+0-900llvm/lib/Target/AMDGPU/GCNRegBankReassign.cpp
+0-611llvm/test/CodeGen/AMDGPU/regbank-reassign.mir
+218-218llvm/test/CodeGen/AMDGPU/GlobalISel/insertelement.i8.ll
+191-231llvm/test/CodeGen/AMDGPU/GlobalISel/fdiv.f64.ll
+160-166llvm/test/CodeGen/AMDGPU/GlobalISel/fshl.ll
+158-164llvm/test/CodeGen/AMDGPU/GlobalISel/fshr.ll
+727-2,29072 files not shown
+2,160-3,99278 files

LLVM/project b082e6fllvm/test/CodeGen/AMDGPU frem.ll, llvm/test/CodeGen/AMDGPU/GlobalISel legalize-load-local.mir insertelement.i8.ll

[AMDGPU] Extend gfx10 test coverage. NFC.

Differential Revision: https://reviews.llvm.org/D99267
DeltaFile
+4,346-0llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-load-local.mir
+1,678-0llvm/test/CodeGen/AMDGPU/GlobalISel/insertelement.i8.ll
+1,583-0llvm/test/CodeGen/AMDGPU/GlobalISel/extractelement.ll
+1,185-0llvm/test/CodeGen/AMDGPU/GlobalISel/extractelement.i8.ll
+960-0llvm/test/CodeGen/AMDGPU/GlobalISel/insertelement.i16.ll
+880-0llvm/test/CodeGen/AMDGPU/frem.ll
+10,632-0177 files not shown
+24,036-233183 files

LLVM/project 54818b3clang/test/AST ast-dump-expr-json.c, llvm/include/llvm/IR IntrinsicsHexagon.td

[DPWBS-1200] Merge community 'master' into HighTec 'htc/master'

Due to interface changes there were a few necessary changes to TriCore
specific files. The following table lists the changed files and the
corresponding community master commits:

TriCoreInstrInfo.h/.cpp:        e6c9a9af398baf40537d45498e0aaf417c1306dc
TriCoreInstPrinter.h/.cpp:      aa708763d30384c0da0b0779be96ba45f65773df
TriCoreRegisterBankInfo.h/.cpp: 21309eafdebaa0041a83a026ae011e305b2f52a0
TriCoreMCInstLower.cpp:         a transitive include of TargetMachine.h
                                was removed somewhere and had to be
                                included here
TriCoreDisassembler.cpp:        6fdd6a7b3f696972edc244488f59532d05136a27
TriCoreTargetMachine.cpp:       05da2fe52162c80dfa18aedf70cf73cb11201811
TriCore/*/CMakeLists.txt:       ab411801b82783eb7f652701ccfce81b16cf1811
legalize-phi.mir:               6da7dbb806dce9fbc05416482a5b895efdea96b0
DeltaFile
+12,682-4,910llvm/test/MC/AMDGPU/gfx10_asm_all.s
+5,448-5,431clang/test/AST/ast-dump-expr-json.c
+6,048-3,589llvm/lib/Target/Hexagon/HexagonDepIICScalar.td
+3,729-3,042llvm/lib/Target/Hexagon/HexagonDepInstrFormats.td
+3,173-3,161llvm/lib/Target/Hexagon/HexagonDepMapAsm2Intrin.td
+50-6,126llvm/include/llvm/IR/IntrinsicsHexagon.td
+31,130-26,25913,033 files not shown
+920,816-262,06913,039 files

LLVM/project 4d7201ellvm/lib/CodeGen/SelectionDAG DAGCombiner.cpp SelectionDAG.cpp, llvm/test/CodeGen/AArch64 arm64-fp.ll

DAG: Stop trying to fold FP -(x-y) -> y-x in getNode with nsz

This was increasing the number of instructions when fsub was legalized
on AMDGPU with no signed zeros enabled. This fold should be guarded by
hasOneUse, and I don't think getNode should be doing that. The same
fold is already done as a regular combine through isNegatibleForFree.

This does require duplicating, even though isNegatibleForFree does
this combine already (and properly checks hasOneUse) to avoid one PPC
regression. In the regression, the outer fneg has nsz but the fsub
operand does not. isNegatibleForFree only sees the operand, and
doesn't see it's used from a nsz context. A nsz parameter needs to be
added and threaded through isNegatibleForFree to avoid this.
DeltaFile
+4-7llvm/test/CodeGen/AMDGPU/fneg-fold-legalize-dag-increase-insts.ll
+10-0llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
+0-5llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp
+2-3llvm/test/CodeGen/AArch64/arm64-fp.ll
+16-154 files

LLVM/project 64cf265llvm/test/CodeGen/AMDGPU fneg-fold-legalize-dag-increase-insts.ll

AMDGPU: Precommit test showing extra instructions are introduced
DeltaFile
+27-0llvm/test/CodeGen/AMDGPU/fneg-fold-legalize-dag-increase-insts.ll
+27-01 files