LLVM / project - FreshBSD

LLVM/project fc7e74e — llvm/test/Analysis/CostModel/X86 trunc-sizelatency.ll trunc-codesize.ll

2024-05-03 11:17:18 UTC by Simon Pilgrim on ⎇

main

[CostModel][X86] getCastInstrCost - improve CostKind adjustment when splitting src/dst types

Noticed in #90883 review - for non-Throughput costs, we weren't applying the split count to '0 or 1' cost value.

This still doesn't work well as many of the type legalizations are hidden so we don't have the split count, really we need to move a CostKindCosts based costs table, but that's going to be a lot of work :/

Delta		File
+807	-277	llvm/test/Analysis/CostModel/X86/trunc-sizelatency.ll
+807	-277	llvm/test/Analysis/CostModel/X86/trunc-codesize.ll
+807	-277	llvm/test/Analysis/CostModel/X86/trunc-latency.ll
+290	-290	llvm/test/Analysis/CostModel/X86/shuffle-replication-i1-latency.ll
+290	-290	llvm/test/Analysis/CostModel/X86/shuffle-replication-i1-sizelatency.ll
+290	-290	llvm/test/Analysis/CostModel/X86/shuffle-replication-i1-codesize.ll
+3,291	-1,701	11 files not shown
+3,584	-1,993	17 files

LLVM/project bcdbd0b — llvm/lib/Transforms/Instrumentation DataFlowSanitizer.cpp

2024-05-03 10:58:40 UTC by Youngsuk Kim on ⎇

main

[llvm][DataFlowSanitizer] Don't pass vector by value (NFC)

Closes #89201

Delta		File
+1	-1	llvm/lib/Transforms/Instrumentation/DataFlowSanitizer.cpp
+1	-1	1 files

LLVM/project 2933ef2 — clang/lib/Driver/ToolChains HIPUtility.cpp

2024-05-03 10:45:17 UTC by Youngsuk Kim on ⎇

main

[clang][HIPUtility] Iterate by const reference (NFC)

Closes #90284

Delta		File
+2	-2	clang/lib/Driver/ToolChains/HIPUtility.cpp
+2	-2	1 files

LLVM/project 256797e — llvm/include/llvm/IR DebugProgramInstruction.h

2024-05-03 10:28:57 UTC by Orlando Cazalet-Hyams on ⎇

main

[NFC][RemoveDIs] Fix some comments in DebugProgramInstruction.h

Delta		File
+5	-7	llvm/include/llvm/IR/DebugProgramInstruction.h
+5	-7	1 files

LLVM/project 1efc191 — clang/lib/Driver/ToolChains Clang.cpp

2024-05-03 10:18:27 UTC by Youngsuk Kim on ⎇

main

[clang][Driver] Iterate with const reference (NFC)

Closes #90282

Delta		File
+1	-1	clang/lib/Driver/ToolChains/Clang.cpp
+1	-1	1 files

LLVM/project 6086f69 — clang-tools-extra/clang-tidy/cert CERTTidyModule.cpp, clang-tools-extra/docs ReleaseNotes.rst

2024-05-03 10:11:50 UTC by whisperity via GitHub on ⎇

main

[clang-tidy] Add 'cert-int09-c' alias for 'readability-enum-initial-value' (#90868)

The check's ruling exactly matches the corresponding CERT C
Recommendation, and, as such, worth a trivial alias.

Delta		File
+49	-36	clang-tools-extra/docs/clang-tidy/checks/readability/enum-initial-value.rst
+10	-0	clang-tools-extra/docs/clang-tidy/checks/cert/int09-c.rst
+4	-1	clang-tools-extra/docs/clang-tidy/checks/list.rst
+4	-0	clang-tools-extra/docs/ReleaseNotes.rst
+4	-0	clang-tools-extra/clang-tidy/cert/CERTTidyModule.cpp
+71	-37	5 files

LLVM/project fb1c2db — clang/lib/AST/Interp Interp.h Program.cpp, clang/test/AST/Interp builtin-align-cxx.cpp c.c

2024-05-03 09:56:23 UTC by David Spickett on ⎇

main

Revert "Reapply "[clang][Interp] Create full type info for dummy pointers""

This reverts commit 1aeb64c8ec7b96b2301929d8a325a6e1d9ddaa2f.

Due to failures in 32 bit Arm builds:
https://lab.llvm.org/buildbot/#/builders/245/builds/24041

Delta		File
+22	-14	clang/lib/AST/Interp/Interp.h
+9	-12	clang/lib/AST/Interp/Program.cpp
+14	-1	clang/test/AST/Interp/builtin-align-cxx.cpp
+8	-0	clang/lib/AST/Interp/Descriptor.cpp
+3	-3	clang/lib/AST/Interp/Descriptor.h
+0	-3	clang/test/AST/Interp/c.c
+56	-33	6 files

LLVM/project 4e67b50 — llvm/test/Transforms/AtomicExpand/AMDGPU expand-atomic-f32-agent.ll expand-atomic-f64-agent.ll

2024-05-03 09:50:59 UTC by Matt Arsenault on ⎇

main

AMDGPU: Add more tests for atomicrmw handling

Add agent scope copies of atomicrmw atomics tests.
Expand testing for the undo identity atomicrmw case.
Test 16-bit atomic expansions.

Delta		File
+3,717	-0	llvm/test/Transforms/AtomicExpand/AMDGPU/expand-atomic-f32-agent.ll
+1,685	-0	llvm/test/Transforms/AtomicExpand/AMDGPU/expand-atomic-f64-agent.ll
+859	-0	llvm/test/Transforms/AtomicExpand/AMDGPU/expand-atomic-v2f16-agent.ll
+859	-0	llvm/test/Transforms/AtomicExpand/AMDGPU/expand-atomic-v2bf16-agent.ll
+668	-0	llvm/test/Transforms/AtomicExpand/AMDGPU/expand-atomic-i64-agent.ll
+668	-0	llvm/test/Transforms/AtomicExpand/AMDGPU/expand-atomic-i32-agent.ll
+8,456	-0	3 files not shown
+8,869	-21	9 files

LLVM/project 9f9856d — llvm/test/CodeGen/AMDGPU global_atomics_i64_system.ll flat_atomics_i32_system.ll, llvm/test/Transforms/AtomicExpand/AMDGPU expand-atomic-f32-system.ll expand-atomic-f64-system.ll

2024-05-03 09:50:59 UTC by Matt Arsenault on ⎇

main

AMDGPU: Update name for amdgpu.no.remote.memory metadata

Delta		File
+218	-218	llvm/test/Transforms/AtomicExpand/AMDGPU/expand-atomic-f32-system.ll
+140	-140	llvm/test/CodeGen/AMDGPU/global_atomics_i64_system.ll
+140	-140	llvm/test/CodeGen/AMDGPU/flat_atomics_i32_system.ll
+140	-140	llvm/test/CodeGen/AMDGPU/flat_atomics_i64_system.ll
+140	-140	llvm/test/CodeGen/AMDGPU/global_atomics_i32_system.ll
+98	-98	llvm/test/Transforms/AtomicExpand/AMDGPU/expand-atomic-f64-system.ll
+876	-876	10 files not shown
+1,277	-1,277	16 files

LLVM/project 385f59f — llvm/include/llvm/MC MCRegisterInfo.h, llvm/lib/MCA InstrBuilder.cpp

2024-05-03 09:30:22 UTC by Rin Dobrescu via GitHub on ⎇

main

[llvm-mca] Teach MCA constant registers do not create dependencies (#89387)

Constant registers like the zero registers XZR and WZR are treated as
any other register by LLVM-MCA. This can create non existent dependency
chains.
Currently there is no method in MCA to query if a register is constant.
This patch fixes the issue by adding a bool Constant
variable to MCRegisterDesc that is true for constant registers. Since
constant registers do not create dependencies, it makes sense to add
this check to MCA.

Delta		File
+76	-0	llvm/test/tools/llvm-mca/AArch64/Neoverse/V1-zero-dependency.s
+12	-12	llvm/test/tools/llvm-mca/AArch64/HiSilicon/tsv110-forwarding.s
+16	-5	llvm/lib/MCA/InstrBuilder.cpp
+6	-0	llvm/include/llvm/MC/MCRegisterInfo.h
+3	-2	llvm/utils/TableGen/RegisterInfoEmitter.cpp
+113	-19	5 files

LLVM/project b4e751e — llvm/lib/Target/AMDGPU SIISelLowering.cpp, llvm/test/CodeGen/AMDGPU llvm.set.rounding.ll

2024-05-03 09:17:18 UTC by Matt Arsenault via GitHub on ⎇

main

AMDGPU: Optimize set_rounding if input is known to fit in 2 bits (#88588)

We don't need to figure out the weird extended rounding modes or
handle offsets to keep the lookup table in 64-bits.
    
https://reviews.llvm.org/D153258

Depends #88587

Delta		File
+104	-308	llvm/test/CodeGen/AMDGPU/llvm.set.rounding.ll
+41	-21	llvm/lib/Target/AMDGPU/SIISelLowering.cpp
+145	-329	2 files

LLVM/project 6218992 — clang/lib/Sema SemaTemplate.cpp, clang/test/SemaCXX cxx20-ctad-type-alias.cpp

2024-05-03 09:10:43 UTC by Haojian Wu on ⎇

users/hokein/fix-ctad-aggregate-base

[clang] CTAD: fix the aggregate deduction guide for alias templates.

For alias templates, the way we construct their aggregate deduction guides is
not following the standard way. We should do the same thing as we do for
implicit deduction guides.

Delta		File
+2	-60	clang/lib/Sema/SemaTemplate.cpp
+14	-0	clang/test/SemaCXX/cxx20-ctad-type-alias.cpp
+7	-0	clang/test/SemaTemplate/deduction-guide.cpp
+23	-60	3 files

LLVM/project 4036514 — clang/lib/Sema SemaTemplate.cpp

2024-05-03 08:53:54 UTC by Haojian Wu on ⎇

users/hokein/fix-ctad-aggregate-base

Refactor: Extract the core deduction-guide construction implementation from DeclareImplicitDeductionGuidesForTypeAlias

We move the core implementation to a dedicate function, so that it can
be reused in other places.

Delta		File
+203	-187	clang/lib/Sema/SemaTemplate.cpp
+203	-187	1 files

LLVM/project 7c64b53 — llvm/utils/gn/secondary/llvm/unittests/CodeGen/GlobalISel BUILD.gn

2024-05-03 08:50:20 UTC by LLVM GN Syncbot on ⎇

main

[gn build] Port ed299b3efd66

Delta		File
+1	-0	llvm/utils/gn/secondary/llvm/unittests/CodeGen/GlobalISel/BUILD.gn
+1	-0	1 files

LLVM/project e47d7c6 — llvm/lib/Target/AMDGPU AMDGPUInsertSingleUseVDST.cpp

2024-05-03 08:38:51 UTC by Simon Pilgrim on ⎇

main

Fix MSVC signed/unsigned mismatch warning. NFC.

Delta		File
+1	-1	llvm/lib/Target/AMDGPU/AMDGPUInsertSingleUseVDST.cpp
+1	-1	1 files

LLVM/project ed299b3 — llvm/include/llvm/CodeGen/GlobalISel GIMatchTableExecutor.h GIMatchTableExecutorImpl.h, llvm/include/llvm/Support Compiler.h

2024-05-03 08:26:54 UTC by Pierre van Houtryve via GitHub on ⎇

main

[GlobalISel] Optimize ULEB128 usage (#90565)

- Remove some cases where ULEB128 isn't needed
- Add a fastDecodeULEB128 tailored for GlobalISel which does unchecked
decoding optimized for the common case, which is 1 byte values. We
rarely have >1 byte Inst IDs, OpIdx, etc. and those are the most common
ULEB users by far.

This specific LEB128 decode function generates almost 2x less
instructions than the generic one.

Delta		File
+49	-0	llvm/unittests/CodeGen/GlobalISel/GIMatchTableExecutorTest.cpp
+24	-2	llvm/include/llvm/CodeGen/GlobalISel/GIMatchTableExecutor.h
+6	-9	llvm/include/llvm/CodeGen/GlobalISel/GIMatchTableExecutorImpl.h
+8	-0	llvm/include/llvm/Support/Compiler.h
+6	-2	llvm/utils/TableGen/Common/GlobalISel/GlobalISelMatchTable.cpp
+1	-0	llvm/unittests/CodeGen/GlobalISel/CMakeLists.txt
+94	-13	6 files

LLVM/project 8480c93 — clang/docs ReleaseNotes.rst, clang/include/clang/Basic DiagnosticSemaKinds.td

2024-05-03 08:20:42 UTC by YanzuoLiu via GitHub on ⎇

main

[clang] pointer to member with qualified-id enclosed in parentheses in unevaluated context should be invalid (#89713)

clang don't check whether the operand of the & operator is enclosed in
parantheses when pointer to member is formed in unevaluated context, for
example:

```cpp
struct foo { int val; };

int main() { decltype(&(foo::val)) ptr; }
```

`decltype(&(foo::val))` should be invalid, but clang accepts it. This PR
fixes this issue.

Fixes #40906.

---------

Co-authored-by: cor3ntin <corentinjabot at gmail.com>

Delta		File
+17	-0	clang/test/CXX/expr/expr.unary/expr.unary.op/p4.cpp
+16	-0	clang/lib/Sema/SemaExpr.cpp
+3	-0	clang/include/clang/Basic/DiagnosticSemaKinds.td
+2	-0	clang/docs/ReleaseNotes.rst
+38	-0	4 files

LLVM/project e4b04b3 — mlir/include/mlir/Dialect/Transform/IR TransformOps.td, mlir/include/mlir/Dialect/Transform/Interfaces TransformInterfaces.h

2024-05-03 08:15:44 UTC by Oleksandr "Alex" Zinenko via GitHub on ⎇

main

[mlir] make transform.foreach_match forward arguments (#89920)

It may be useful to have access to additional handles or parameters when
performing matches and actions in `foreach_match`, for example, to
parameterize the matcher by rank or restrict it in a non-trivial way.
Enable `foreach_match` to forward additional handles from operands to
matcher symbols and from action symbols to results.

Delta		File
+123	-37	mlir/lib/Dialect/Transform/IR/TransformOps.cpp
+110	-0	mlir/test/Dialect/Transform/foreach-match.mlir
+81	-4	mlir/test/Dialect/Transform/ops-invalid.mlir
+38	-22	mlir/include/mlir/Dialect/Transform/IR/TransformOps.td
+29	-5	mlir/lib/Dialect/Transform/Interfaces/TransformInterfaces.cpp
+13	-0	mlir/include/mlir/Dialect/Transform/Interfaces/TransformInterfaces.h
+394	-68	6 files

LLVM/project edbe6eb — llvm/lib/CodeGen/SelectionDAG LegalizeFloatTypes.cpp, llvm/lib/Target/SystemZ SystemZISelLowering.cpp

2024-05-03 08:04:12 UTC by Matt Arsenault via GitHub on ⎇

main

SystemZ: Don't promote atomic store in IR (#90899)

This is the mirror to the recent atomic load change. The same
bitcast-back-to-integer case is a small code quality regression for the
same reason. This would disappear with a bitcastable legal 128-bit type.

Delta		File
+35	-11	llvm/lib/Target/SystemZ/SystemZISelLowering.cpp
+16	-7	llvm/test/CodeGen/SystemZ/atomic-store-08.ll
+0	-1	llvm/lib/CodeGen/SelectionDAG/LegalizeFloatTypes.cpp
+51	-19	3 files

LLVM/project 6535e7a — llvm/test/CodeGen/SystemZ copy-phys-reg-gr128-to-vr128.mir

2024-05-03 08:03:05 UTC by Matt Arsenault on ⎇

main

SystemZ: Remove redundant copy tests from 75f4baa70

Delta		File
+0	-31	llvm/test/CodeGen/SystemZ/copy-phys-reg-gr128-to-vr128.mir
+0	-31	1 files

LLVM/project 44648cc — llvm/lib/Target/AMDGPU AMDGPUAsmPrinter.cpp, llvm/test/CodeGen/AMDGPU pal-metadata-3.0.ll

2024-05-03 08:01:03 UTC by Carl Ritson via GitHub on ⎇

main

[AMDGPU] Always emit lds_size in PAL ELF Metadata 3.0 (#87222)

Emit lds_size for all shader types in PAL metadata.

Delta		File
+39	-0	llvm/test/CodeGen/AMDGPU/pal-metadata-3.0.ll
+4	-4	llvm/lib/Target/AMDGPU/AMDGPUAsmPrinter.cpp
+43	-4	2 files

LLVM/project 9731b77 — llvm/docs AMDGPUUsage.rst ReleaseNotes.rst, llvm/lib/Target/AMDGPU SIModeRegisterDefaults.cpp SIISelLowering.cpp

2024-05-03 07:41:27 UTC by Matt Arsenault via GitHub on ⎇

main

AMDGPU: Implement llvm.set.rounding (#88587)

Use a shift of a magic constant and some offseting to convert from
flt_rounds values.

I don't know why the enum defines Dynamic = 7. The standard suggests -1
is the cannot determine value. If we could start the extended values at
4 we wouldn't need the extra compare sub and select.

https://reviews.llvm.org/D153257

Delta		File
+1,919	-0	llvm/test/CodeGen/AMDGPU/llvm.set.rounding.ll
+113	-0	llvm/lib/Target/AMDGPU/SIModeRegisterDefaults.cpp
+72	-0	llvm/lib/Target/AMDGPU/SIISelLowering.cpp
+12	-0	llvm/lib/Target/AMDGPU/SIModeRegisterDefaults.h
+6	-0	llvm/docs/AMDGPUUsage.rst
+2	-0	llvm/docs/ReleaseNotes.rst
+2,124	-0	2 files not shown
+2,127	-0	8 files

LLVM/project 13a6fe8 — clang/include/clang/CIRFrontendAction CIRGenAction.h, clang/lib/CIR/CodeGen CIRGenModule.h

2024-05-03 07:22:53 UTC by Nathan Lanza on ⎇

users/lanza/sprcirgenmodule-buildtopleveldecl-husk

fix comments

Created using spr 1.3.5

Delta		File
+4	-4	clang/include/clang/CIRFrontendAction/CIRGenAction.h
+3	-5	clang/lib/CIR/CodeGen/CIRGenModule.h
+6	-0	clang/lib/FrontendTool/CMakeLists.txt
+1	-1	clang/lib/CIR/FrontendAction/CIRGenAction.cpp
+2	-0	clang/lib/CIR/FrontendAction/CMakeLists.txt
+16	-10	5 files

LLVM/project 70b5a22 — llvm/lib/Transforms/Utils MemoryTaggingSupport.cpp, llvm/test/Instrumentation/HWAddressSanitizer alloca.ll

2024-05-03 07:16:57 UTC by Vitaly Buka via GitHub on ⎇

main

[hwasan] Don't crash on vscale allocas (#90932)

getAllocaSizeInBytes will crash casting size to
constant.

Delta		File
+18	-0	llvm/test/Instrumentation/HWAddressSanitizer/alloca.ll
+2	-0	llvm/lib/Transforms/Utils/MemoryTaggingSupport.cpp
+20	-0	2 files

LLVM/project 4300feb — clang/include/clang/CIR CIRGenerator.h, clang/include/clang/CIRFrontendAction CIRGenAction.h

2024-05-03 07:10:47 UTC by Nathan Lanza on ⎇

users/lanza/sprcirgenmodule-buildtopleveldecl-husk

get buildTopLevelDecl to run

Created using spr 1.3.5

Delta		File
+45	-2	clang/lib/CIR/FrontendAction/CIRGenAction.cpp
+31	-0	clang/include/clang/CIR/CIRGenerator.h
+24	-3	clang/lib/CIR/CodeGen/CIRGenModule.h
+20	-0	clang/lib/CIR/CodeGen/CIRGenerator.cpp
+16	-0	clang/lib/CIR/CodeGen/CIRGenModule.cpp
+3	-4	clang/include/clang/CIRFrontendAction/CIRGenAction.h
+139	-9	7 files not shown
+158	-15	13 files

LLVM/project e450f98 — lldb/source/Utility Scalar.cpp, lldb/test/API/python_api/type TestTypeList.py main.cpp

2024-05-03 07:07:20 UTC by Pavel Labath via GitHub on ⎇

main

[lldb] Fix Scalar::GetData for non-multiple-of-8-bits values (#90846)

It was aligning the byte size down. Now it aligns up. This manifested
itself as SBTypeStaticField::GetConstantValue returning a zero-sized
value for `bool` fields (because clang represents bool as a 1-bit
value).

I've changed the code for float Scalars as well, although I'm not aware
of floating point values that are not multiples of 8 bits.

Delta		File
+30	-0	lldb/unittests/Utility/ScalarTest.cpp
+13	-0	lldb/test/API/python_api/type/TestTypeList.py
+2	-2	lldb/source/Utility/Scalar.cpp
+1	-0	lldb/test/API/python_api/type/main.cpp
+46	-2	4 files

LLVM/project 0ddf974 — flang/test/Lower/OpenMP default-clause.f90, llvm/lib/Transforms/AggressiveInstCombine AggressiveInstCombine.cpp

2024-05-03 07:06:56 UTC by Vitaly Buka on ⎇

users/vitalybuka/spr/hwasan-dont-crash-on-vscale-allocas

rebase

Created using spr 1.3.4

Delta		File
+243	-23	llvm/lib/Transforms/AggressiveInstCombine/AggressiveInstCombine.cpp
+0	-219	llvm/test/Transforms/AggressiveInstCombine/strcmp.ll
+216	-0	llvm/test/Transforms/AggressiveInstCombine/strncmp-1.ll
+147	-0	llvm/test/Transforms/AggressiveInstCombine/strncmp-2.ll
+59	-20	mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
+51	-4	flang/test/Lower/OpenMP/default-clause.f90
+716	-266	16 files not shown
+855	-314	22 files

LLVM/project b03e7a5 — llvm/test/Instrumentation/HWAddressSanitizer alloca.ll

2024-05-03 07:02:36 UTC by Vitaly Buka via GitHub on ⎇

users/vitalybuka/spr/hwasan-dont-crash-on-vscale-allocas

[HWASAN] Regenerate a test (#90943)

Delta		File
+81	-81	llvm/test/Instrumentation/HWAddressSanitizer/alloca.ll
+81	-81	1 files

LLVM/project 922ab70 — llvm/lib/Frontend/OpenMP OMPIRBuilder.cpp, mlir/lib/Target/LLVMIR/Dialect/OpenMP OpenMPToLLVMIRTranslation.cpp

2024-05-03 06:59:01 UTC by Kareem Ergawy via GitHub on ⎇

users/vitalybuka/spr/hwasan-dont-crash-on-vscale-allocas

[MLIR][OpenMP] Extend omp.private materialization support: `dealloc` (#90841)

Extends current support for delayed privatization during translation to
LLVM IR. This adds support for materlizaing the `dealloc` region in
`omp.private` ops when this region contains clean-up/deallocation logic
that needs to be executed at the end of the parallel region.

This changes the `OMPIRBuilder` slightly to execute the finalization
callback **after** the privatization callback. This allows us to collect
information about privatized variables on the MLIR and LLVM sides so
that we can properly emit deallocation logic.

Delta		File
+59	-20	mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
+53	-0	mlir/test/Target/LLVMIR/openmp-omp.private-dealloc.mlir
+13	-13	llvm/lib/Frontend/OpenMP/OMPIRBuilder.cpp
+125	-33	3 files

LLVM/project f8fedfb — lldb/packages/Python/lldbsuite/test/make Makefile.rules

2024-05-03 06:30:49 UTC by Pavel Labath on ⎇

main

[lldb] Fix TestSharedLibStrippedSymbols for #90622

`ifeq` needs to be at the beginning of a line, otherwise it's
interpreted as part of the recipe.

Delta		File
+2	-2	lldb/packages/Python/lldbsuite/test/make/Makefile.rules
+2	-2	1 files