LLVM/project ff4fc41llvm/lib/Target/AMDGPU SIISelLowering.cpp SIISelLowering.h, llvm/test/CodeGen/AMDGPU inreg-vgpr-spill.ll

[WIP][AMDGPU] Improve the handling of `inreg` arguments

When SGPRs available for `inreg` argument passing run out, the compiler silently
falls back to using whole VGPRs to pass those arguments. Ideally, instead of
using whole VGPRs, we should pack `inreg` arguments into individual lanes of
VGPRs.

This PR introduces `InregVGPRSpiller`, which handles this packing. It uses
`v_writelane` at the call site to place `inreg` arguments into specific VGPR
lanes, and then extracts them in the callee using `v_readlane`.

Fixes #130443 and #129071.
DeltaFile
+115-8llvm/lib/Target/AMDGPU/SIISelLowering.cpp
+63-0llvm/test/CodeGen/AMDGPU/inreg-vgpr-spill.ll
+1-1llvm/lib/Target/AMDGPU/SIISelLowering.h
+179-93 files

LLVM/project 7f2abe8llvm/lib/Transforms/Utils Local.cpp, llvm/test/Transforms/SimplifyCFG merge-direct-call-branch-weights-preserve-sink.ll merge-direct-call-branch-weights-preserve-hoist.ll

Revert "[Metadata] Preserve MD_prof when merging instructions when one is missing." (#134200)

Reverts llvm/llvm-project#132433

I suspect this change caused a failure in the bolt build bot.
https://lab.llvm.org/buildbot/#/builders/113/builds/6621

```
!9185 = !{!"branch_weights", i32 3912, i32 802}
Wrong number of operands
!9185 = !{!"branch_weights", i32 3912, i32 802}
fatal error: error in backend: Broken module found, compilation aborted!
```
DeltaFile
+0-63llvm/test/Transforms/SimplifyCFG/merge-direct-call-branch-weights-preserve-sink.ll
+0-62llvm/test/Transforms/SimplifyCFG/merge-direct-call-branch-weights-preserve-hoist.ll
+6-13llvm/lib/Transforms/Utils/Local.cpp
+6-1383 files

LLVM/project f404826llvm/lib/Target/RISCV/AsmParser RISCVAsmParser.cpp, llvm/test/MC/RISCV rv64zcmp-invalid.s

[RISCV] Don't allow '-' after 'ra' in Zcmp/Xqccmp register list. (#134182)

Move the parsing of '-' under the check that we parsed a comma.
Unfortunately, this leads to a poor error, but I still have more known
issues in this code and may end up with an overall restructuring and
want to think about wording.
DeltaFile
+36-36llvm/lib/Target/RISCV/AsmParser/RISCVAsmParser.cpp
+3-0llvm/test/MC/RISCV/rv64zcmp-invalid.s
+39-362 files

LLVM/project 3ea7902llvm/lib/Target/RISCV/AsmParser RISCVAsmParser.cpp, llvm/test/MC/RISCV rv32xqccmp-invalid.s

[RISCV] Check S0 register list check for qc.cm.pushfp to after we parsed the whole register list. (#134180)

This is more of a semantic check. The diagnostic location to has been
changed to point at the register list start instead of the
closing brace or whatever character might be there instead of a brace
if its malformed.
DeltaFile
+5-15llvm/lib/Target/RISCV/AsmParser/RISCVAsmParser.cpp
+1-1llvm/test/MC/RISCV/rv32xqccmp-invalid.s
+6-162 files

LLVM/project 4998273llvm/lib/Target/RISCV RISCVInstrInfoXqci.td RISCVInstrInfo.td, llvm/lib/Target/RISCV/AsmParser RISCVAsmParser.cpp

Reland [RISCV] Add Xqci Insn Formats (#134134)

This adds the following instruction formats from the Xqci Spec:
- QC.EAI
- QC.EI
- QC.EB
- QC.EJ
- QC.ES

The update to the THead test is because the largest number of operands
for a valid instruction has been bumped by this change.

This reverts commit 68fb7a5a1d203dde7badf67031bdd9eb650eef5d. This
relands commit 0cfabd37df9940346f3bf8a4d74c19e1f48a00e9.
DeltaFile
+215-0llvm/lib/Target/RISCV/RISCVInstrInfoXqci.td
+111-0llvm/test/MC/RISCV/insn_xqci-invalid.s
+28-17llvm/lib/Target/RISCV/RISCVInstrInfo.td
+41-0llvm/test/MC/RISCV/insn_xqci.s
+10-3llvm/lib/Target/RISCV/AsmParser/RISCVAsmParser.cpp
+7-1llvm/lib/Target/RISCV/MCTargetDesc/RISCVBaseInfo.h
+412-213 files not shown
+420-299 files

LLVM/project 84a0613compiler-rt/lib/builtins xxhash.h, compiler-rt/lib/builtins/aarch64 emupac.cpp emupac.c

Switch to SipHash

Created using spr 1.3.6-beta.1
DeltaFile
+0-7,343compiler-rt/lib/builtins/xxhash.h
+161-0third-party/siphash/include/siphash/SipHash.h
+132-0compiler-rt/lib/builtins/aarch64/emupac.cpp
+1-130llvm/lib/Support/SipHash.cpp
+0-115compiler-rt/lib/builtins/aarch64/emupac.c
+5-0llvm/lib/Support/CMakeLists.txt
+299-7,5883 files not shown
+304-7,5909 files

LLVM/project f2e0ad9llvm/lib/Support SipHash.cpp CMakeLists.txt, llvm/utils/gn/secondary/llvm/lib/Support BUILD.gn

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.6-beta.1

[skip ci]
DeltaFile
+161-0third-party/siphash/include/siphash/SipHash.h
+1-130llvm/lib/Support/SipHash.cpp
+5-0llvm/lib/Support/CMakeLists.txt
+1-0llvm/utils/gn/secondary/llvm/lib/Support/BUILD.gn
+168-1304 files

LLVM/project 7d7b3bellvm/lib/Support SipHash.cpp, llvm/test/CodeGen/AArch64 ptrauth-intrinsic-auth-resign.ll ptrauth-fpac.ll

[𝘀𝗽𝗿] initial version

Created using spr 1.3.6-beta.1
DeltaFile
+138-118llvm/test/CodeGen/AArch64/ptrauth-intrinsic-auth-resign.ll
+161-0third-party/siphash/include/siphash/SipHash.h
+1-130llvm/lib/Support/SipHash.cpp
+47-34llvm/test/CodeGen/AArch64/ptrauth-fpac.ll
+37-16llvm/test/CodeGen/AArch64/ptrauth-call.ll
+27-21llvm/test/CodeGen/AArch64/ptrauth-intrinsic-auth-resign-with-blend.ll
+411-3199 files not shown
+475-34215 files

LLVM/project 3ef9bf0llvm/lib/Target/AArch64 AArch64InstrInfo.td AArch64Subtarget.cpp, llvm/test/CodeGen/AArch64 ptrauth-intrinsic-auth-resign.ll ptrauth-fpac.ll

[𝘀𝗽𝗿] changes to main this commit is based on

Created using spr 1.3.6-beta.1

[skip ci]
DeltaFile
+138-118llvm/test/CodeGen/AArch64/ptrauth-intrinsic-auth-resign.ll
+47-34llvm/test/CodeGen/AArch64/ptrauth-fpac.ll
+37-16llvm/test/CodeGen/AArch64/ptrauth-call.ll
+27-21llvm/test/CodeGen/AArch64/ptrauth-intrinsic-auth-resign-with-blend.ll
+18-7llvm/lib/Target/AArch64/AArch64InstrInfo.td
+23-0llvm/lib/Target/AArch64/AArch64Subtarget.cpp
+290-1965 files not shown
+307-21211 files

LLVM/project b8d8405lldb/include/lldb/API SBModule.h, lldb/source/API SBModule.cpp

[LLDB] Expose checking if the symbol file exists/is loaded via SBModule (#134163)

The motivation for this patch is that in Statistics.cpp we [check to see
if the module symfile is
loaded](https://github.com/llvm/llvm-project/blob/990a086d9da0bc2fd53a6a4c95ecbbe23a297a83/lldb/source/Target/Statistics.cpp#L353C60-L353C75)
to calculate how much debug info has been loaded. I have an external
utility that only wants to look at the loaded debug info, which isn't
exposed by the SBAPI.
DeltaFile
+12-0lldb/source/API/SBModule.cpp
+3-0lldb/include/lldb/API/SBModule.h
+15-02 files

LLVM/project e3c0565clang/unittests CMakeLists.txt, clang/unittests/AST CMakeLists.txt

Reapply "[cmake] Refactor clang unittest cmake" (#134195)

This reapplies 5ffd9bdb50b57 (#133545) with fixes.

The BUILD_SHARED_LIBS=ON build was fixed by adding missing LLVM
dependencies to the InterpTests binary in
unittests/AST/ByteCode/CMakeLists.txt .
DeltaFile
+28-4clang/unittests/CMakeLists.txt
+11-17clang/unittests/Tooling/CMakeLists.txt
+12-13clang/unittests/Interpreter/CMakeLists.txt
+11-13clang/unittests/Interpreter/ExceptionTests/CMakeLists.txt
+7-16clang/unittests/AST/CMakeLists.txt
+7-15clang/unittests/ASTMatchers/CMakeLists.txt
+76-7822 files not shown
+168-29128 files

LLVM/project 3140d51llvm/test/tools/llvm-reduce bitcode-uselistorder.ll

llvm-reduce: Remove unsupported from bitcode uselistorder test (#134185)

This was disabled due to flakiness but I'm currently unable to
reproduce.

I'm nervous the original issue still exists. However, I downgraded the
tripped
assert in 8c18c25b1b22ea710edb40a4f167a6a8bfe6ff9d to a warning since
the same
assert can trigger for illegitimate reasons.

Fixes #64157
DeltaFile
+0-3llvm/test/tools/llvm-reduce/bitcode-uselistorder.ll
+0-31 files

LLVM/project 4e40c7clldb/tools/debugserver/source/MacOSX/arm64 DNBArchImplARM64.cpp

[lldb][debugserver] Save and restore the SVE/SME register state (#134184)

debugserver isn't saving and restoring the SVE/SME register state around
inferior function calls.

Making arbitrary function calls while in Streaming SVE mode is generally
a poor idea because a NEON instruction can be hit and crash the
expression execution, which is how I missed this, but they should be
handled correctly if the user knows it is safe to do.

rdar://146886210
DeltaFile
+40-7lldb/tools/debugserver/source/MacOSX/arm64/DNBArchImplARM64.cpp
+40-71 files

LLVM/project 6a46c6cllvm/lib/Analysis ValueTracking.cpp, llvm/lib/CodeGen/SelectionDAG SelectionDAG.cpp

Ensure KnownBits passed when calculating from range md has right size (#132985)

KnownBits passed to computeKnownBitsFromRangeMetadata must have the same
bit width as the range metadata bit width. Otherwise the calculated
results will be incorrect.

---------

Signed-off-by: John Lu <John.Lu at amd.com>
DeltaFile
+6-0llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp
+4-0llvm/lib/Analysis/ValueTracking.cpp
+10-02 files

LLVM/project dcc2182clang/lib/Sema SemaConcept.cpp SemaLambda.cpp, clang/test/SemaTemplate concepts-lambda.cpp

[Clang] Fix a lambda pattern comparison mismatch after ecc7e6ce4 (#133863)

In ecc7e6ce4, we tried to inspect the `LambdaScopeInfo` on stack to
recover the instantiating lambda captures. However, there was a mismatch
in how we compared the pattern declarations of lambdas: the constraint
instantiation used a tailored `getPatternFunctionDecl()` which is
localized in SemaLambda that finds the very primal template declaration
of a lambda, while `FunctionDecl::getTemplateInstantiationPattern` finds
the latest template pattern of a lambda. This difference causes issues
when lambdas are nested, as we always want the primary template
declaration.

This corrects that by moving `Sema::addInstantiatedCapturesToScope` from
SemaConcept to SemaLambda, allowing it to use the localized version of
`getPatternFunctionDecl`.

It is also worth exploring to coalesce the implementation of
`getPatternFunctionDecl` with
`FunctionDecl::getTemplateInstantiationPattern`. But I’m leaving that

    [4 lines not shown]
DeltaFile
+0-69clang/lib/Sema/SemaConcept.cpp
+68-0clang/lib/Sema/SemaLambda.cpp
+15-0clang/test/SemaTemplate/concepts-lambda.cpp
+83-693 files

LLVM/project 4986a79llvm/test/TableGen MacroFusion.td, llvm/utils/TableGen/Common PredicateExpander.cpp

[TableGen] Emit `llvm::is_contained` for `CheckOpcode` predicate (#134057)

When the list is large, using `llvm::is_contained` is of higher
performance than a sequence of comparisons. When the list is small,
the `llvm::is_contained` can be inlined and unrolled, which has the
same effect as using a sequence of comparisons.

And the generated code is more readable.
DeltaFile
+9-14llvm/utils/TableGen/Common/PredicateExpander.cpp
+2-2llvm/test/TableGen/MacroFusion.td
+11-162 files

LLVM/project 4fe0d74clang/lib/Format UnwrappedLineParser.cpp, clang/unittests/Format TokenAnnotatorTest.cpp

[clang-format] Fix a bug in annotating braces (#134039)

Fix #133873
DeltaFile
+5-1clang/lib/Format/UnwrappedLineParser.cpp
+5-0clang/unittests/Format/TokenAnnotatorTest.cpp
+10-12 files

LLVM/project 94dbe5emlir/include/mlir/Dialect/Tosa/IR TosaOps.td

[mlir][tosa] Remove extra whitespace in the PadOp example (#134113)

Trivial cleanup change.

Signed-off-by: Jerry Ge <jerry.ge at arm.com>
DeltaFile
+2-2mlir/include/mlir/Dialect/Tosa/IR/TosaOps.td
+2-21 files

LLVM/project f9b3bfallvm/include/llvm/Transforms/Utils FunctionImportUtils.h, llvm/lib/Transforms/IPO FunctionImport.cpp

[ctxprof] Support for "move" semantics for the contextual root
DeltaFile
+28-1llvm/lib/Transforms/Utils/FunctionImportUtils.cpp
+5-20llvm/include/llvm/Transforms/Utils/FunctionImportUtils.h
+19-3llvm/test/ThinLTO/X86/ctxprof-separate-module.ll
+18-0llvm/lib/Transforms/IPO/FunctionImport.cpp
+70-244 files

LLVM/project aab3649llvm/include/llvm/Transforms/Utils FunctionImportUtils.h, llvm/lib/Transforms/IPO FunctionImport.cpp

[ctxprof] Support for "move" semantics for the contextual root
DeltaFile
+28-1llvm/lib/Transforms/Utils/FunctionImportUtils.cpp
+5-20llvm/include/llvm/Transforms/Utils/FunctionImportUtils.h
+19-3llvm/test/ThinLTO/X86/ctxprof-separate-module.ll
+17-0llvm/lib/Transforms/IPO/FunctionImport.cpp
+69-244 files

LLVM/project 18c43d0lldb/test/Shell/DAP TestHelp.test TestOptions.test, lldb/tools/lldb-dap lldb-dap.cpp Options.td

[lldb-dap] Add a -v/--version command line argument (#134114)

Add a -v/--version command line argument to print the version of both
the lldb-dap binary and the liblldb it's linked against.

This is motivated by me trying to figure out which lldb-dap I had in my
PATH.
DeltaFile
+12-0lldb/tools/lldb-dap/lldb-dap.cpp
+8-0lldb/test/Shell/DAP/TestHelp.test
+0-8lldb/test/Shell/DAP/TestOptions.test
+6-0lldb/tools/lldb-dap/Options.td
+3-0lldb/test/Shell/DAP/TestVersion.test
+29-85 files

LLVM/project 357d66allvm/lib/Transforms/IPO FunctionImport.cpp, llvm/test/ThinLTO/X86 ctxprof-separate-module.ll

[ctxprof] Don't import roots elsewhere
DeltaFile
+21-3llvm/test/ThinLTO/X86/ctxprof-separate-module.ll
+17-0llvm/lib/Transforms/IPO/FunctionImport.cpp
+38-32 files

LLVM/project ff0c2fbllvm/test/CodeGen/LoongArch/lasx/ir-instruction absd.ll, llvm/test/CodeGen/LoongArch/lsx/ir-instruction absd.ll

[LoongArch] Pre-commit tests for vector absolute difference (#132898)

DeltaFile
+557-0llvm/test/CodeGen/LoongArch/lsx/ir-instruction/absd.ll
+557-0llvm/test/CodeGen/LoongArch/lasx/ir-instruction/absd.ll
+1,114-02 files

LLVM/project d59b2c4llvm/lib/Transforms/IPO FunctionImport.cpp

[ctxprof][nfc] Make `computeImportForFunction` a member of `ModuleImportsManager` (#134011)

DeltaFile
+14-12llvm/lib/Transforms/IPO/FunctionImport.cpp
+14-121 files

LLVM/project 7a534bfllvm/lib/Transforms/IPO FunctionImport.cpp

[ctxprof][nfc] Make `computeImportForFunction` a member of `ModuleImportsManager`
DeltaFile
+14-12llvm/lib/Transforms/IPO/FunctionImport.cpp
+14-121 files

LLVM/project 02467f9llvm/lib/Transforms/IPO FunctionImport.cpp, llvm/test/ThinLTO/X86 ctxprof-separate-module.ll

[ctxprof] Option to move a whole tree to its own module (#133992)

Modules may contain a mix of functions that participate or don't participate in callgraphs covered by a contextual profile. We currently have been importing all the functions under a context root in the module defining that root, but if the other functions there are covered by flat profiles, the result is difficult to reason about.

This patch allows moving everything under a context root (and that root) in its own module. For now, we expect a module with a filename matching the GUID of the function be present in the set of modules known by the linker. This mechanism can be improved in a later patch.

Subsequent patches will handle implementing "move" instead of "import" semantics for the root function (because we want to make sure only one version of the root exists - so the optimizations we perform are actually the ones being observed at runtime).
DeltaFile
+56-0llvm/test/ThinLTO/X86/ctxprof-separate-module.ll
+27-5llvm/lib/Transforms/IPO/FunctionImport.cpp
+83-52 files

LLVM/project ccb54b3llvm/test/tools/llvm-reduce bitcode-uselistorder.ll

llvm-reduce: Remove unsupported from bitcode uselistorder test

This was disabled due to flakiness but I'm currently unable to reproduce.

I'm nervous the original issue still exists. However, I downgraded the tripped
assert in 8c18c25b1b22ea710edb40a4f167a6a8bfe6ff9d to a warning since the same
assert can trigger for illegitimate reasons.

Fixes #64157
DeltaFile
+0-3llvm/test/tools/llvm-reduce/bitcode-uselistorder.ll
+0-31 files

LLVM/project fb7135eclang/docs ReleaseNotes.rst, clang/lib/Sema SemaDeclCXX.cpp

[Clang] fixed clang frontend crash with friend class declaration and overload == (#133878)

DeltaFile
+12-0clang/test/CXX/class/class.compare/class.compare.default/p1.cpp
+1-2clang/lib/Sema/SemaDeclCXX.cpp
+1-0clang/docs/ReleaseNotes.rst
+14-23 files

LLVM/project 749c20bllvm/utils/lit/tests test-output.py test-data.py, llvm/utils/lit/tests/Inputs/test-data dummy_format.py metrics.ini

[LIT] Add a test for lit.Test.toMetricValue. NFC
DeltaFile
+8-2llvm/utils/lit/tests/Inputs/test-data/dummy_format.py
+2-1llvm/utils/lit/tests/Inputs/test-data/metrics.ini
+2-1llvm/utils/lit/tests/test-output.py
+1-0llvm/utils/lit/tests/test-data.py
+13-44 files

LLVM/project e5809f0llvm/tools CMakeLists.txt

[LLVM] Only build the GPU loader utility if it has LLVM-libc (#134141)

Summary:
There were some discussions about this being included by default. I need
to fix this up and codify the use of LLVM libc inside of LLVM. For now,
just turn it off unless the user requested the `libc` GPU stuff. This
matches the old behavior.
DeltaFile
+1-1llvm/tools/CMakeLists.txt
+1-11 files