LLVM / project - FreshBSD

LLVM/project f958a73 — llvm/test/Transforms/InstCombine call-guard.ll fast-math.ll

2024-05-09 03:43:00 UTC by Nikita Popov on ⎇

main

[InstCombine] Fix name clashes in check lines (NFC)

These used both lower and upper case variants of the same name,
resulting in malformed check lines when regenerated.

Delta		File
+8	-8	llvm/test/Transforms/InstCombine/call-guard.ll
+4	-4	llvm/test/Transforms/InstCombine/fast-math.ll
+12	-12	2 files

LLVM/project 0d335f7 — llvm/lib/Transforms/InstCombine InstCombineAddSub.cpp, llvm/test/Transforms/InstCombine add.ll

2024-05-09 03:35:16 UTC by Nikita Popov on ⎇

main

[InstCombine] Handle more commuted cases in matchesSquareSum()

Delta		File
+8	-21	llvm/test/Transforms/InstCombine/add.ll
+10	-10	llvm/lib/Transforms/InstCombine/InstCombineAddSub.cpp
+18	-31	2 files

LLVM/project a39a382 — llvm/test/Transforms/InstCombine add.ll

2024-05-09 03:29:01 UTC by Nikita Popov on ⎇

main

[InstCombine] Thwart complexity-based canonicalization (NFC)

These tests did not test what they were supposed to. The transform
fails to actually handle the commuted cases.

Delta		File
+40	-19	llvm/test/Transforms/InstCombine/add.ll
+40	-19	1 files

LLVM/project bce9393 — llvm/lib/Target/AMDGPU SOPInstructions.td, llvm/test/CodeGen/AMDGPU llvm.amdgcn.s.wait.event.ll

2024-05-09 03:17:31 UTC by Jay Foad via Tom Stellard on ⎇

release/18.x

[AMDGPU] Fix GFX12 encoding of s_wait_event export_ready (#89622)

As well as flipping the sense of the bit, GFX12 moved it from bit 0 to
bit 1 in the encoded simm16 operand.

(cherry picked from commit e0a763c490d8ef58dca867e0ef834978ccf8e17d)

Delta		File
+3	-7	llvm/test/CodeGen/AMDGPU/llvm.amdgcn.s.wait.event.ll
+1	-1	llvm/lib/Target/AMDGPU/SOPInstructions.td
+4	-8	2 files

LLVM/project f5f572f — llvm/include/llvm/CodeGen MachineFrameInfo.h, llvm/lib/CodeGen/SelectionDAG SelectionDAGBuilder.cpp

2024-05-09 03:16:03 UTC by Björn Pettersson via Tom Stellard on ⎇

release/18.x

[SelectionDAG] Mark frame index as "aliased" at argument copy elison (#89712)

This is a fix for miscompiles reported in
  https://github.com/llvm/llvm-project/issues/89060

After argument copy elison the IR value for the eliminated alloca
is aliasing with the fixed stack object. This patch is making sure
that we mark the fixed stack object as being aliased with IR values
to avoid that for example schedulers are reordering accesses to
the fixed stack object. This could otherwise happen when there is a
mix of MemOperands refering the shared fixed stack slow via both
the IR value for the elided alloca, and via a fixed stack pseudo
source value (as would be the case when lowering the arguments).

(cherry picked from commit d8b253be56b3e9073b3e59123cf2da0bcde20c63)

Delta		File
+39	-0	llvm/test/CodeGen/Hexagon/arg-copy-elison.ll
+7	-0	llvm/include/llvm/CodeGen/MachineFrameInfo.h
+2	-1	llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
+48	-1	3 files

LLVM/project dfc89f8 — llvm/lib/Target/X86 X86ISelLowering.cpp, llvm/test/CodeGen/X86 pr91005.ll

2024-05-09 03:14:03 UTC by Phoebe Wang via Tom Stellard on ⎇

release/18.x

[X86][FP16] Do not create VBROADCAST_LOAD for f16 without AVX2 (#91125)

AVX doesn't provide 16-bit BROADCAST instruction.

Fixes #91005

Delta		File
+40	-0	llvm/test/CodeGen/X86/pr91005.ll
+1	-1	llvm/lib/Target/X86/X86ISelLowering.cpp
+41	-1	2 files

LLVM/project 047cd91 — llvm/lib/Target/X86 X86InstrAVX512.td X86ISelLowering.cpp, llvm/test/CodeGen/X86 pr90844.ll

2024-05-09 03:10:38 UTC by Phoebe Wang via Tom Stellard on ⎇

release/18.x

[X86][EVEX512] Add `HasEVEX512` when `NoVLX` used for 512-bit patterns (#91106)

With KNL/KNC being deprecated, we don't need to care about such no VLX
cases anymore. We may remove such patterns in the future.

Fixes #90844

(cherry picked from commit 7963d9a2b3c20561278a85b19e156e013231342c)

Delta		File
+21	-21	llvm/lib/Target/X86/X86InstrAVX512.td
+19	-0	llvm/test/CodeGen/X86/pr90844.ll
+3	-1	llvm/lib/Target/X86/X86ISelLowering.cpp
+43	-22	3 files

LLVM/project 58e44d3 — llvm/lib/Target/AMDGPU SIInstrInfo.h SIInsertWaitcnts.cpp, llvm/test/CodeGen/AMDGPU llvm.amdgcn.s.barrier.wait.ll llvm.amdgcn.s.barrier.ll

2024-05-09 03:08:59 UTC by David Stuttard via Tom Stellard on ⎇

release/18.x

[AMDGPU] Enhance s_waitcnt insertion before barrier for gfx12 (#90595)

Code to determine if a waitcnt is required before a barrier instruction
only
considered S_BARRIER.
gfx12 adds barrier_signal/wait so need to enhance the existing code to
look for
a barrier start (which is just an S_BARRIER for earlier architectures).

Delta		File
+22	-0	llvm/test/CodeGen/AMDGPU/llvm.amdgcn.s.barrier.wait.ll
+11	-0	llvm/lib/Target/AMDGPU/SIInstrInfo.h
+1	-1	llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp
+2	-0	llvm/test/CodeGen/AMDGPU/llvm.amdgcn.s.barrier.ll
+36	-1	4 files

LLVM/project d1d7131 — .github/workflows release-binaries.yml set-release-binary-outputs.sh

2024-05-09 02:47:50 UTC by Tom Stellard on ⎇

release/18.x

[Workflows] Re-write release-binaries workflow (#89521)

This updates the release-binaries workflow so that the different build
stages are split across multiple jobs. This saves money by reducing the
time spent on the larger github runners and also makes it easier to
debug, because now it's possible to build a smaller release package
(with clang and lld) using only the free GitHub runners.

The workflow no longer uses the test-release.sh script but instead uses
the Release.cmake cache. This gives the workflow more flexibility and
ensures that the binary package will always be created even if the tests
fail.

This idea to split the stages comes from the "LLVM Precommit CI through
Github Actions" RFC:

https://discourse.llvm.org/t/rfc-llvm-precommit-ci-through-github-actions/76456
(cherry picked from commit abac98479b81cc0cc717bb6cdbae6f774e3b0232)

Delta		File
+190	-73	.github/workflows/release-binaries.yml
+0	-7	.github/workflows/set-release-binary-outputs.sh
+190	-80	2 files

LLVM/project f2c5a10 — clang/cmake/caches Release.cmake

2024-05-09 02:47:50 UTC by Tom Stellard on ⎇

release/18.x

[CMake][Release] Add stage2-package target (#89517)

This target will be used to generate the release binary package for
uploading to GitHub.

(cherry picked from commit a38f201f1ec70c2b1f3cf46e7f291c53bb16753e)

Delta		File
+2	-0	clang/cmake/caches/Release.cmake
+2	-0	1 files

LLVM/project 211cdc6 — .github/workflows release-binaries.yml

2024-05-09 02:47:50 UTC by Tom Stellard on ⎇

release/18.x

workflows: Fix incorrect input name in release-binaries.yml (#84604)

In aa02002491333c42060373bc84f1ff5d2c76b4ce the input name was changed
from tag to release-version, but the code was never updated.

(cherry picked from commit 8d220d109d28dac352c563ab062fb72132b7eca1)

Delta		File
+2	-2	.github/workflows/release-binaries.yml
+2	-2	1 files

LLVM/project d9661e1 — .github/workflows release-binaries.yml

2024-05-09 02:47:50 UTC by Aiden Grossman via Tom Stellard on ⎇

release/18.x

[Github] Add repository checks to release-binaries workflow (#84437)

This patch adds repository checks to the release-binaries workflow jobs.
People were observing that the job was running on a schedule in their
forks. This only happens on old forks, but those probably exist in great
number given how prolific LLVM is. This is also good practice anyways,
on top of solving the direct problem of these jobs running with the cron
schedule on people's forks.

(cherry picked from commit 9f5be5f0092a636274953389cd5771c45ac0a568)

Delta		File
+3	-0	.github/workflows/release-binaries.yml
+3	-0	1 files

LLVM/project b7e2397 — clang/cmake/caches Release.cmake, llvm/utils/release test-release.sh

2024-05-09 02:47:50 UTC by Tom Stellard on ⎇

release/18.x

[CMake][Release] Enable CMAKE_POSITION_INDEPENDENT_CODE (#90139)

Set this in the cache file directly instead of via the test-release.sh
script so that the release builds can be reproduced with just the cache
file.

(cherry picked from commit 53ff002c6f7ec64a75ab0990b1314cc6b4bb67cf)

Delta		File
+1	-2	llvm/utils/release/test-release.sh
+1	-0	clang/cmake/caches/Release.cmake
+2	-2	2 files

LLVM/project ce88e86 — clang/cmake/caches Release.cmake

2024-05-09 02:47:50 UTC by Tom Stellard on ⎇

release/18.x

[CMake][Release] Refactor cache file and use two stages for non-PGO builds (#89812)

Completely refactor the cache file to simplify it and remove unnecessary
variables. The main functional change here is that the non-PGO builds
now use two stages, so `ninja -C build stage2-package` can be used with
both PGO and non-PGO builds.

(cherry picked from commit 6473fbf2d68c8486d168f29afc35d3e8a6fabe69)

Delta		File
+71	-73	clang/cmake/caches/Release.cmake
+71	-73	1 files

LLVM/project 0ec1bc4 — .github/workflows release-binaries.yml

2024-05-09 02:47:50 UTC by Tom Stellard on ⎇

release/18.x

workflows: Fixes for building the release binaries (#83694)

Since aa02002491333c42060373bc84f1ff5d2c76b4ce we weren't installing the
correct dependencies, and since 2836d8edbfbcd461b25101ed58f93c862d65903a
we must pass a custom token to github-upload-release.py for verifying
permissions.

(cherry picked from commit 51207756b0692f325cf75560185cf0336239b3e0)

Delta		File
+6	-1	.github/workflows/release-binaries.yml
+6	-1	1 files

LLVM/project dd3aa6d — llvm CMakeLists.txt, llvm/utils/lit/lit init.py

2024-05-09 02:41:30 UTC by Tom Stellard via GitHub on ⎇

release/18.x

Bump version to 18.1.6 (#91094)

Delta		File
+1	-1	llvm/CMakeLists.txt
+1	-1	llvm/utils/lit/lit/__init__.py
+2	-2	2 files

LLVM/project b910beb — llvm/include/llvm/Object MachO.h, llvm/lib/Object MachOObjectFile.cpp

2024-05-09 01:53:15 UTC by Zixu Wang via GitHub on ⎇

main

[llvm][MachO] Fix integer truncation in rebase/bind parsing (#89337)

`Count` and `Skip` should use `uint64_t` as they are encoded/decoded
using 64-bit ULEB128.

In `*_OPCODE_DO_*_ULEB_TIMES_SKIPPING_ULEB`, `Skip` could be encoded as
a two's complement for moving `SegmentOffset` backwards. Having a 32-bit
`Skip` truncates the encoded value and leads to a malformed
`AdvanceAmount`
and invalid `SegmentOffset` that extends past valid sections.

Delta		File
+499	-0	llvm/test/Object/Inputs/MachO/bind-negative-skip.yaml
+10	-10	llvm/lib/Object/MachOObjectFile.cpp
+17	-0	llvm/test/Object/macho-bind-negative-skip.test
+8	-7	llvm/include/llvm/Object/MachO.h
+534	-17	4 files

LLVM/project ea126ae — llvm/lib/Target/PowerPC PPCISelLowering.cpp PPCAsmPrinter.cpp, llvm/test/CodeGen/PowerPC aix-shared-lib-tls-model-opt.ll aix-shared-lib-tls-model-opt-small-local-dynamic-tls.ll

2024-05-09 01:50:36 UTC by Felix (Ting Wang) via GitHub on ⎇

main

[PowerPC] Tune AIX shared library TLS model at function level (#84132)

Under some circumstance (library loaded with the main program), TLS
initial-exec model can be applied to local-dynamic access(es). We
could use some simple heuristic to decide the update at function level:
* If there is equal or less than a number of TLS local-dynamic access(es)
in the function, use TLS initial-exec model. (the threshold which default to
1 is controlled by hidden option)

Delta		File
+627	-0	llvm/test/CodeGen/PowerPC/aix-shared-lib-tls-model-opt.ll
+74	-0	llvm/test/CodeGen/PowerPC/aix-shared-lib-tls-model-opt-small-local-dynamic-tls.ll
+58	-0	llvm/lib/Target/PowerPC/PPCISelLowering.cpp
+22	-0	llvm/test/CodeGen/PowerPC/check-aix-shared-lib-tls-model-opt-Option.ll
+21	-0	llvm/test/CodeGen/PowerPC/check-aix-shared-lib-tls-model-opt-IRattribute.ll
+14	-1	llvm/lib/Target/PowerPC/PPCAsmPrinter.cpp
+816	-1	7 files not shown
+859	-3	13 files

LLVM/project 51f178d — clang/lib/StaticAnalyzer/Checkers MallocChecker.cpp, clang/test/Analysis NewDelete-atomics.cpp

2024-05-09 01:00:59 UTC by Artem Dergachev via GitHub on ⎇

main

[analyzer] MallocChecker: Recognize std::atomics in smart pointer suppression. (#90918)

Fixes #90498.

Same as 5337efc69cdd5 for atomic builtins, but for `std::atomic` this
time. This is useful because even though the actual builtin atomic is
still there, it may be buried beyond the inlining depth limit.

Also add one popular custom smart pointer class name to the name-based
heuristics, which isn't necessary to fix the bug but arguably a good
idea regardless.

Delta		File
+109	-9	clang/test/Analysis/NewDelete-atomics.cpp
+15	-4	clang/lib/StaticAnalyzer/Checkers/MallocChecker.cpp
+7	-0	clang/test/Analysis/Inputs/system-header-simulator-cxx.h
+131	-13	3 files

LLVM/project 73a0144 — bolt/test/X86 jump-table-fixed-ref-pic.test, bolt/test/X86/Inputs jump-table-fixed-ref-pic.s

2024-05-09 00:56:44 UTC by Maksim Panchenko via GitHub on ⎇

main

[BOLT] Add test case for PIC fixed indirect jump (#91547)

A compiler can generate a redundant indirection for a jump via a fixed
jump table target. Add a test case that covers such pattern that covers
PIC case. We already have non-PIC case detection.

Currently XFAIL.

Delta		File
+35	-0	bolt/test/X86/Inputs/jump-table-fixed-ref-pic.s
+9	-0	bolt/test/X86/jump-table-fixed-ref-pic.test
+44	-0	2 files

LLVM/project 62b5b61 — clang/lib/Sema SemaLookup.cpp SemaTemplate.cpp, clang/test/CXX/temp/temp.res/temp.dep/temp.dep.type p4.cpp

2024-05-09 00:49:59 UTC by Krystian Stasiowski via GitHub on ⎇

main

[Clang][Sema] Fix lookup of dependent operator= outside of complete-class contexts (#91498)

Fixes a crash caused by #90152.

Delta		File
+20	-15	clang/lib/Sema/SemaLookup.cpp
+13	-0	clang/test/CXX/temp/temp.res/temp.dep/temp.dep.type/p4.cpp
+2	-5	clang/lib/Sema/SemaTemplate.cpp
+35	-20	3 files

LLVM/project ba5170f — llvm/test/Transforms/InstCombine lshr.ll

2024-05-09 00:29:14 UTC by AtariDreams via GitHub on ⎇

main

[InstCombine] Thwart complexity-based canonicalization in shl-add test (NFC) (#91413)

Fixed test for #88193

Delta		File
+4	-2	llvm/test/Transforms/InstCombine/lshr.ll
+4	-2	1 files

LLVM/project 409ff97 — llvm/lib/Transforms/InstCombine InstCombineShifts.cpp

2024-05-09 00:26:36 UTC by AtariDreams via GitHub on ⎇

main

[InstCombine] Fix comment from #88193 (NFC) (#91427)

It is inaccurate and needs to be corrected.

Delta		File
+2	-2	llvm/lib/Transforms/InstCombine/InstCombineShifts.cpp
+2	-2	1 files

LLVM/project 1aaab33 — llvm/lib/TargetParser RISCVISAInfo.cpp

2024-05-09 00:22:18 UTC by Craig Topper via GitHub on ⎇

main

[RISCV] Don't use std::vector<std::string> for split extensions in RISCVISAInfo::parseArchString. NFC (#91538)

We can use a SmallVector<StringRef>.

Adjust the code so we check for empty strings in the loop instead of
making a copy of the vector returned from StringRef::split.

This overlaps with #91532 which also removed the std::vector, but
that PR may be more controversial.

Delta		File
+18	-31	llvm/lib/TargetParser/RISCVISAInfo.cpp
+18	-31	1 files

LLVM/project 96568f3 — llvm/docs LangRef.rst, llvm/include/llvm/Transforms/Instrumentation PGOCtxProfLowering.h

2024-05-08 23:49:08 UTC by Mircea Trofin via GitHub on ⎇

main

[llvm][ctx_profile] Add instrumentation lowering (#90821)

This adds the instrumentation lowering pass.

(Tracking Issue: #89287, RFC referenced there)

Delta		File
+326	-0	llvm/lib/Transforms/Instrumentation/PGOCtxProfLowering.cpp
+229	-0	llvm/test/Transforms/PGOProfile/ctx-instrumentation.ll
+43	-5	llvm/docs/LangRef.rst
+17	-0	llvm/test/Transforms/PGOProfile/ctx-instrumentation-invalid-roots.ll
+4	-1	llvm/include/llvm/Transforms/Instrumentation/PGOCtxProfLowering.h
+5	-0	llvm/lib/Passes/PassBuilderPipelines.cpp
+624	-6	2 files not shown
+626	-6	8 files

LLVM/project 1710c8c — flang/lib/Lower Bridge.cpp, flang/test/Lower/HLFIR custom-intrinsic.f90 binary-ops.f90

2024-05-08 23:48:14 UTC by Slava Zakharin via GitHub on ⎇

main

[flang] Lowering changes for assigning dummy_scope to hlfir.declare. (#90989)

The lowering produces fir.dummy_scope operation if the current
function has dummy arguments. Each hlfir.declare generated
for a dummy argument is then using the result of fir.dummy_scope
as its dummy_scope operand. This is only done for HLFIR.

I was not able to find a reliable way to identify dummy symbols
in `genDeclareSymbol`, so I added a set of registered dummy symbols
that is alive during the variables instantiation for the current
function. The set is initialized during the mapping of the dummy
argument symbols to their MLIR values. It is reset right after
all variables are instantiated - this is done to avoid generating
hlfir.declare operations with dummy_scope for the clones of
the dummy symbols (e.g. this happens with OpenMP privatization).

If this can be done in a cleaner way, please advise.

Delta		File
+52	-51	flang/test/Lower/HLFIR/custom-intrinsic.f90
+42	-42	flang/test/Lower/HLFIR/binary-ops.f90
+62	-11	flang/lib/Lower/Bridge.cpp
+23	-23	flang/test/Lower/HLFIR/assignment-intrinsics.f90
+21	-21	flang/test/Lower/HLFIR/designators.f90
+21	-21	flang/test/Lower/OpenMP/parallel-firstprivate-clause-scalar.f90
+221	-169	131 files not shown
+806	-696	137 files

LLVM/project 36d8b37 — llvm/test/CodeGen/RISCV imm.ll, llvm/test/CodeGen/RISCV/rv64-legal-i32 imm.ll

2024-05-08 23:20:07 UTC by Craig Topper on ⎇

main

[RISCV] Add another missed Zbs constant materialization test. NFC

This can be LI+BCLRI+BCLRI.

Delta		File
+67	-0	llvm/test/CodeGen/RISCV/imm.ll
+43	-0	llvm/test/CodeGen/RISCV/rv64-legal-i32/imm.ll
+110	-0	2 files

LLVM/project c0b5a96 — llvm/test/CodeGen/RISCV imm.ll, llvm/test/CodeGen/RISCV/rv64-legal-i32 imm.ll

2024-05-08 23:04:54 UTC by Craig Topper on ⎇

main

[RISCV] Add tests where we could use Zbs instructions in constant materialization. NFC

Delta		File
+137	-0	llvm/test/CodeGen/RISCV/rv64-legal-i32/imm.ll
+116	-0	llvm/test/CodeGen/RISCV/imm.ll
+253	-0	2 files

LLVM/project 2fb3774 — llvm/lib/Transforms/Vectorize SLPVectorizer.cpp, llvm/test/Transforms/SLPVectorizer/AArch64 gather-with-minbith-user.ll user-node-not-in-bitwidths.ll

2024-05-08 23:01:47 UTC by Arthur Eubanks on ⎇

main

Revert "[SLP]Fix PR91467: Look through scalar cast, when trying to cast to another type."

This reverts commit 2475efa91d8b4fa8f1a2d16052cb6d14be7d5dc6.

Causes crashes, see comments on https://github.com/llvm/llvm-project/commit/2475efa91d8b4fa8f1a2d16052cb6d14be7d5dc6.

Delta		File
+8	-1	llvm/test/Transforms/SLPVectorizer/AArch64/gather-with-minbith-user.ll
+6	-1	llvm/test/Transforms/SLPVectorizer/AArch64/user-node-not-in-bitwidths.ll
+1	-5	llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
+2	-1	llvm/test/Transforms/SLPVectorizer/SystemZ/minbitwidth-root-trunc.ll
+17	-8	4 files

LLVM/project 99052c4 — llvm/unittests/IR DebugInfoTest.cpp

2024-05-08 22:51:46 UTC by Augusto Noronha via GitHub on ⎇

main

[gardening][DebugInfo][NFC] Improve comment on HashingDISubprogram test (#91543)

Delta		File
+3	-2	llvm/unittests/IR/DebugInfoTest.cpp
+3	-2	1 files