llvm-project

Author	SHA1	Message	Date
Ellis Hoag	9cc5a4bf66	Remove llvm::shouldOptForSize() from Utils.h (#112630 ) Remove `llvm::shouldOptForSize()` from `Utils.h` since we can use `llvm::shouldOptimizeForSize()` from `SizeOpts.h` instead. Depends on https://github.com/llvm/llvm-project/pull/112626	2024-10-29 14:23:47 -05:00
Afanasyev Ivan	4e1b9d34f9	[mir-strip-debug] Fix debug location info strip for bundled instructions (#113676 ) Fix bug that `mir-strip-debug` pass does not remove debug location from bundled instructions. Problem arises during testing that debug info does not affect optimization passes output (`llvm-lit` with ` -Dllc="llc -debugify-and-strip-all-safe"`), when pass operates on MIR with bundled instructions + memory operands. Let mir test check looks like: ``` CHECK-NEXT: BUNDLE { CHECK-NEXT: $r3 = LD $r1, $r2 :: (load (s64) from %ir.a, !tbaa !2) CHECK-NEXT: } ``` So as `mir-strip-debug` pass does not process bundled instructions, running `llc -debugify-and-strip-all-safe` on the test will produce the following output: ``` BUNDLE { $r3 = LD $r1, $r2, debug-location !DILocation(line: 3, column: 1, scope: <0x608cb2b99b10>) :: (load (s64) from %ir.a, !tbaa !2) } ``` And test will fail, but it shouldn't. Seems like the root cause is that `mir-strip-debug` pass should remove debug location from bundled instructions.	2024-10-29 10:26:15 -07:00
Matt Arsenault	88e23eb2cf	DAG: Fix legalization of vector addrspacecasts (#113964 )	2024-10-29 08:08:50 -05:00
Jay Foad	2443549b85	[IR] Remove some uses of StructType::setBody. NFC. (#113685 ) It is simple to create the struct body up front, now that we have transitioned to opaque pointers.	2024-10-29 11:44:53 +00:00
Benjamin Maxwell	c3260c65e8	[IR] Add `llvm.sincos` intrinsic (#109825 ) This adds the `llvm.sincos` intrinsic, legalization, and lowering. The `llvm.sincos` intrinsic takes a floating-point value and returns both the sine and cosine (as a struct). ``` declare { float, float } @llvm.sincos.f32(float %Val) declare { double, double } @llvm.sincos.f64(double %Val) declare { x86_fp80, x86_fp80 } @llvm.sincos.f80(x86_fp80 %Val) declare { fp128, fp128 } @llvm.sincos.f128(fp128 %Val) declare { ppc_fp128, ppc_fp128 } @llvm.sincos.ppcf128(ppc_fp128 %Val) declare { <4 x float>, <4 x float> } @llvm.sincos.v4f32(<4 x float> %Val) ``` The lowering is built on top of the existing FSINCOS ISD node, with additional type legalization to allow for f16, f128, and vector values.	2024-10-29 10:52:20 +00:00
Matt Arsenault	1ceccbb0dd	VirtRegRewriter: Add implicit register defs for live out undef lanes (#112679 ) If an undef subregister def is live into another block, we need to maintain a physreg def to track the liveness of those lanes. This would manifest a verifier error after branch folding, when the cloned tail block use no longer had a def. We need to detect interference with other assigned intervals to avoid clobbering the undef lanes defined in other intervals, since the undef def didn't count as interference. This is pretty ugly and adds a new dependency on LiveRegMatrix, keeping it live for one more pass. It also adds a lot of implicit operand spam (we really should have a better representation for this). There is a missing verifier check for this situation. Added an xfailed test that demonstrates this. We may also be able to revert the changes in 47d3cbcf842a036c20c3f1c74255cdfc213f41c2. It might be better to insert an IMPLICIT_DEF before the instruction rather than using the implicit-def operand. Fixes #98474	2024-10-28 17:33:53 -07:00
Ellis Hoag	6ab26eab4f	Check hasOptSize() in shouldOptimizeForSize() (#112626 )	2024-10-28 09:45:03 -07:00
Simon Pilgrim	056cf936a7	[DAG] Fold (and X, (bswap/bitreverse (not Y))) -> (and X, (not (bswap/bitreverse Y))) (#112547 ) On ANDNOT capable targets we can always do this profitably, without ANDNOT we only attempt this if we don't introduce an additional NOT Fixes #112425	2024-10-28 11:52:44 +00:00
Jack Styles	933a56674e	[PAuthLR] Add Missing Break Statement for MachineOperand Switch Statement (#113883 ) There was a missing break, which led to an unannotated fallthrough when merging #112171. This has caused sanitizer builds to fail. This adds the missing break in the switch statement to ensure that the fallthrough does not occur.	2024-10-28 09:08:48 +00:00
Jack Styles	86f76c3b17	[AArch64][Libunwind] Add Support for FEAT_PAuthLR DWARF Instruction (#112171 ) As part of FEAT_PAuthLR, a new DWARF Frame Instruction was introduced, `DW_CFA_AARCH64_negate_ra_state_with_pc`. This instructs Libunwind that the PC has been used with the signing instruction. This change includes three commits - Libunwind support for the newly introduced DWARF Instruction - CodeGen Support for the DWARF Instructions - Reversing the changes made in #96377. Due to `DW_CFA_AARCH64_negate_ra_state_with_pc`'s requirements to be placed immediately after the signing instruction, this would mean the CFI Instruction location was not consistent with the generated location when not using FEAT_PAuthLR. The commit reverses the changes and makes the location consistent across the different branch protection options. While this does have a code size effect, this is a negligible one. For the ABI information, see here: `853286c7ab/aadwarf64/aadwarf64.rst (id23)`	2024-10-28 08:22:38 +00:00
Aiden Grossman	7c9cf0c6f0	[SHT_LLVM_BB_ADDR_MAP][AsmPrinter] Emit error on bad option combinatons This patch makes it so that specifying all or none for -pgo-analysis-map along with an explicit option causes an error as this set of options does not really make sense.	2024-10-26 08:15:34 +00:00
Aiden Grossman	38caf282ab	[SHT_LLVM_BB_ADDR_MAP][AsmPrinter] Add none and all options to PGO Map (#111221 ) This patch adds none and all options to the -pgo-analysis-map flag, which do basically what they say on the tin. The none option is added to enable forcing the pgo-analysis-map by overriding an earlier invocation of the flag. The all option is just added for convenience.	2024-10-25 15:39:52 -07:00
Gaëtan Bossu	a0c318938a	[CodeGen][NFC] Properly split MachineLICM and EarlyMachineLICM (#113573 ) Both are based on MachineLICMBase, and the functionality there is "switched" based on a PreRegAlloc flag. This commit is simply about trusting the original value of that flag, defined by the `MachineLICM` and `EarlyMachineLICM` classes. The `PreRegAlloc` flag used to be overwritten it based on MRI.isSSA(), which is un-reliable due to how it is inferred by the MIRParser. I see that we can now define isSSA in MIR (thanks @gargaroff ), meaning the fix isn’t really needed anymore, but redefining that flag still feels wrong. Note that I'm looking into upstreaming more changes to MachineLICM, see [the discourse thread](https://discourse.llvm.org/t/extending-post-regalloc-machinelicm/82725).	2024-10-25 11:19:22 -07:00
Tex Riddell	c03d09ce3e	[aarch64] atan2 intrinsic lowering (p5) (#112611 ) This change is part of this proposal: https://discourse.llvm.org/t/rfc-all-the-math-intrinsics/78294 - `VecFuncs.def`: define intrinsic to sleef/armpl mapping - `LegalizerHelper.cpp`: add missing fewerElementsVector handling for the new atan2 intrinsic - `AArch64ISelLowering.cpp`: Add arch64 specializations for lowering like neon instructions - `AArch64LegalizerInfo.cpp`: Legalize atan2. Part 5 for Implement the atan2 HLSL Function #70096.	2024-10-24 17:53:12 -07:00
Daniel Hoekwater	1b8cff9a52	Reland "CFIFixup] Factor CFI remember/restore insertion into a helper (NFC)" (#113387 ) The original patch (ac1a01f) dereferenced an end iterator, breaking some tests (e.g. https://lab.llvm.org/buildbot/#/builders/17/builds/3116). This updated patch only accesses the iterator when it's valid.	2024-10-24 17:07:27 -04:00
Dimitry Andric	4bce21480f	Ensure !NDEBUG with LLVM_ENABLE_ABI_BREAKING_CHECKS does not segfault (#113588 ) In SelectionDAG, `TargetTransformInfo::hasBranchDivergence()` can be called when both `NDEBUG` and `LLVM_ENABLE_ABI_BREAKING_CHECKS` are enabled. In that case, the class member `TTI` is still initialized to `nullptr`, causing a segfault. Fix this by ensuring that all the calls to `hasBranchDivergence` and `VerifyDAGDivergence` only occur when `NDEBUG` is disabled, and `LLVM_ENABLE_ABI_BREAKING_CHECKS` is enabled.	2024-10-24 19:30:38 +02:00
Zaara Syeda	f3131c99bf	[GlobalMerge] Aggressively merge constants to reduce TOC entries (#111756 ) Symbols that get mapped into the read-only section are loaded as part of the text segment and will always need a TOC entry to be addressable. Add an option to aggressively merge these read only globals to reduce TOC usage.	2024-10-24 10:16:39 -04:00
Nuno Lopes	509af087cc	replace 2 placeholder uses of undef with poison [NFC]	2024-10-24 09:01:25 +01:00
Kazu Hirata	141574bacb	[llvm] Remove redundant calls to std::unique_ptr<T>::get (NFC) (#113415 )	2024-10-23 10:44:09 -07:00
Vladimir Radosavljevic	401d123a1f	[MCP] Optimize copies when src is used during backward propagation (#111130 ) Before this patch, redundant COPY couldn't be removed for the following case: ``` $R0 = OP ... ... // Read of %R0 $R1 = COPY killed $R0 ``` This patch adds support for tracking the users of the source register during backward propagation, so that we can remove the redundant COPY in the above case and optimize it to: ``` $R1 = OP ... ... // Replace all uses of %R0 with $R1 ```	2024-10-23 13:37:02 +02:00
Akshat Oke	c4c60c0db9	[CodeGen][NewPM] Port OptimizePHIs to NPM (#113433 )	2024-10-23 16:55:21 +05:30
Augusto Noronha	8234f8ae26	[DebugInfo] Emit linkage name into DWARF for types for Swift (#112802 ) Store Swift mangled names in DW_AT_linkage_name. The Swift compiler emits only the type mangled name in debug information, and LLDB uses those mangled names as keys to look up size, alignment, fields, etc from either reflection metadata or Swift modules. Additionally, emit types linkage names for types into the accelerator table if they exist and they're different from the display name.	2024-10-22 16:47:58 -07:00
Heejin Ahn	5c92f2331c	[WebAssembly] Fix MIR printing of reference types (#113028 ) When printing a memory operand in MIR, this line `d37bc32a65/llvm/lib/CodeGen/MachineOperand.cpp (L1247)` calls this `d37bc32a65/llvm/include/llvm/Support/Alignment.h (L238)` which assumes `Rhs` (the size in this case) is positive. But Wasm reference types' size is set to 0: `d37bc32a65/llvm/include/llvm/CodeGen/ValueTypes.td (L326-L328)` `getSize() > 0` condition was added with the Wasm reference types support in `46667a1003`, and it looks it was removed in #84751. This revives the condition so that Wasm reference types will not crash the MIR printer.	2024-10-22 13:48:00 -07:00
Daniel Hoekwater	f66bc4d3f1	Revert "Reland [CFIFixup] Factor CFI remember/restore insertion into a helper (NFC)" (#113340 ) Reverts llvm/llvm-project#113328 This change breaks a number of builds (e.g https://lab.llvm.org/buildbot/#/builders/25/builds/3504), for some reason. Reverting to do some troubleshooting.	2024-10-22 12:50:15 -04:00
Daniel Hoekwater	ac1a01f533	Reland [CFIFixup] Factor CFI remember/restore insertion into a helper (NFC) (#113328 ) The previous submission looked like it triggered build failure https://lab.llvm.org/buildbot/#/builders/17/builds/3116, but this appears to be a spurious failure due to a flaky test.	2024-10-22 10:57:47 -04:00
James Chesterman	11c818816d	[AArch64] Improve index selection for histograms (#111150 ) Removes unnecessary extends on the indices passed into histogram instructions. It also removes the instruction when the mask is zero.	2024-10-22 11:14:00 +01:00
Akshat Oke	4e32d7236b	[NewPM][CodeGen] Port LiveRegMatrix to NPM (#109938 )	2024-10-22 15:28:04 +05:30
Akshat Oke	93802815ab	[NewPM][CodeGen] Port VirtRegMap to NPM (#109936 )	2024-10-22 15:15:56 +05:30
Ellis Hoag	e6ada7162e	[regalloc][basic] Change spill weight for optsize funcs (#112960 ) Change the spill weight calculations for `optsize` functions to remove the block frequency multiplier. For those functions, we do not want to consider the runtime cost of spilling, only the codesize cost. I built a large app with the basic and greedy (default) register allocator enabled. \| Regalloc Type \| Uncompressed Size Delta \| Compressed Size Delta \| \| - \| - \| - \| \| Basic \| -303.8 KiB (-0.23%) \| -232.0 KiB (-0.39%) \| \| Greedy \| 159.1 KiB (0.12%) \| 130.1 KiB (0.22%) \| Since I only saw a size win with the basic register allocator, I decided to only change the behavior for that type.	2024-10-21 11:10:50 -07:00
Michael Maitland	6bac41496e	[RISCV][GISEL] Legalize G_INSERT_SUBVECTOR (#108859 ) This code is heavily based on the SelectionDAG lowerINSERT_SUBVECTOR code.	2024-10-21 08:49:13 -04:00
Simon Pilgrim	f0b3b6d15b	[DAG] isConstantIntBuildVectorOrConstantInt - peek through bitcasts (#112710 ) (REAPPLIED) Alter both isConstantIntBuildVectorOrConstantInt + isConstantFPBuildVectorOrConstantFP to return a bool instead of the underlying SDNode, and adjust usage to account for this. Update isConstantIntBuildVectorOrConstantInt to peek though bitcasts when attempting to find a constant, in particular this improves canonicalization of constants to the RHS on commutable instructions. X86 is the beneficiary here as it often bitcasts rematerializable 0/-1 vector constants as vXi32 and bitcasts to the requested type Minor cleanup that helps with #107423 Reapplied after regression fix ba1255def64a9c3c68d97ace051eec76f546eeb0	2024-10-20 14:23:21 +01:00
Simon Pilgrim	ba1255def6	[DAG] Use FoldConstantArithmetic to constant fold (and (ext (and V, c1)), c2) -> (and (ext V), (and c1, (ext c2))) Noticed while triaging the regression from #112710 noticed by @mstorsjo - don't rely on isConstantIntBuildVectorOrConstantInt+getNode to guarantee constant folding (if it fails to constant fold it will infinite loop), use FoldConstantArithmetic instead.	2024-10-20 13:05:23 +01:00
Martin Storsjö	b26df3e463	Revert "[DAG] isConstantIntBuildVectorOrConstantInt - peek through bitcasts (#112710 )" This reverts commit a630771b28f4b252e2754776b8f3ab416133951a. This caused compilation to hang for Windows/ARM, see https://github.com/llvm/llvm-project/pull/112710 for details.	2024-10-20 00:49:16 +03:00
Simon Pilgrim	93ec08d629	[DAG] Move SIGN_EXTEND_INREG constant folding inside FoldConstantArithmetic Update visitSIGN_EXTEND_INREG to call FoldConstantArithmetic instead of getNode.	2024-10-19 20:57:07 +01:00
Thorsten Schütt	d8b17f2fb6	[GlobalISel] Combine G_UNMERGE_VALUES with anyext and build vector (#112370 ) G_UNMERGE_VALUES (G_ANYEXT (G_BUILD_VECTOR)) ag G_UNMERGE_VALUES llvm/test/CodeGen/AArch64/GlobalISel \| grep ANYEXT [ANYEXT] is build vector or shuffle vector Prior art: https://reviews.llvm.org/D87117 https://reviews.llvm.org/D87166 https://reviews.llvm.org/D87174 https://reviews.llvm.org/D87427 ; CHECK-NEXT: [[BUILD_VECTOR2:%[0-9]+]]:_(<8 x s8>) = G_BUILD_VECTOR [[C2]](s8), [[C2]](s8), [[C2]](s8), [[C2]](s8), [[DEF1]](s8), [[DEF1]](s8), [[DEF1]](s8), [[DEF1]](s8) ; CHECK-NEXT: [[ANYEXT1:%[0-9]+]]:_(<8 x s16>) = G_ANYEXT [[BUILD_VECTOR2]](<8 x s8>) ; CHECK-NEXT: [[UV10:%[0-9]+]]:_(<4 x s16>), [[UV11:%[0-9]+]]:_(<4 x s16>) = G_UNMERGE_VALUES [[ANYEXT1]](<8 x s16>) Test: llvm/test/CodeGen/AArch64/GlobalISel/combine-unmerge.mir	2024-10-19 09:41:43 +02:00
Frank Schlimbach	d5746d73ce	eliminating g++ warnings (#105520 ) Eliminating g++ warnings. Mostly declaring "[[maybe_unused]]", adding return statements where missing and fixing casts. @rengolin --------- Co-authored-by: Benjamin Maxwell <macdue@dueutil.tech> Co-authored-by: Renato Golin <rengolin@systemcall.eu>	2024-10-18 21:20:47 +01:00
Jinsong Ji	6c60ead15a	[NFC] Fix Werror=extra warning related to mismatched enum type (#112808 ) This is one of the many PRs to fix errors with LLVM_ENABLE_WERROR=on. Built by GCC 11. Fix warnings: llvm-project/llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp: In member function ‘void llvm::AsmPrinter::emitJumpTableSizesSection(const llvm::MachineJumpTableInfo*, const llvm::Function&) const’: llvm-project/llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp:2852:31: error: enumerated and non-enumerated type in conditional expression [-Werror=extra] 2852 \| int Flags = F.hasComdat() ? ELF::SHF_GROUP : 0; \| ~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~	2024-10-18 13:07:37 -04:00
Jay Foad	b7bc1d07d3	[CodeGen] Fix return type of PHI_iterator::getIncomingValue. NFC. This is supposed to match ValT aka Register.	2024-10-18 14:33:45 +01:00
Simon Pilgrim	e1330d96a0	[DAG] visitFMA/FDIV - avoid SDLoc duplication. NFC.	2024-10-18 11:57:41 +01:00
Simon Pilgrim	5c37316b54	[DAG] visitFMA/FMAD - use FoldConstantArithmetic to add missing vector constant folding support	2024-10-18 11:12:06 +01:00
Simon Pilgrim	a630771b28	[DAG] isConstantIntBuildVectorOrConstantInt - peek through bitcasts (#112710 ) Alter both isConstantIntBuildVectorOrConstantInt + isConstantFPBuildVectorOrConstantFP to return a bool instead of the underlying SDNode, and adjust usage to account for this. Update isConstantIntBuildVectorOrConstantInt to peek though bitcasts when attempting to find a constant, in particular this improves canonicalization of constants to the RHS on commutable instructions. X86 is the beneficiary here as it often bitcasts rematerializable 0/-1 vector constants as vXi32 and bitcasts to the requested type Minor cleanup that helps with #107423	2024-10-18 10:52:55 +01:00
Simon Pilgrim	3ec1b1a4dd	[DAG] visitFP_EXTEND - use FoldConstantArithmetic to attempt to constant fold Don't rely on isConstantFPBuildVectorOrConstantFP followed by getNode() will constant fold - FoldConstantArithmetic will do all of this for us.	2024-10-18 10:10:44 +01:00
Simon Pilgrim	3a1df05ca9	[DAG] visitFP_ROUND - use FoldConstantArithmetic to attempt to constant fold Don't rely on isConstantFPBuildVectorOrConstantFP followed by getNode() will constant fold - FoldConstantArithmetic will do all of this for us.	2024-10-18 10:10:43 +01:00
Simon Pilgrim	7a43be1690	[DAG] visitXROUND - use FoldConstantArithmetic to attempt to constant fold Don't rely on isConstantFPBuildVectorOrConstantFP followed by getNode() will constant fold - FoldConstantArithmetic will do all of this for us.	2024-10-18 10:10:43 +01:00
Simon Pilgrim	c72992bf89	[DAG] visitABS - use FoldConstantArithmetic to attempt to constant fold Don't rely on isConstantFPBuildVectorOrConstantFP followed by getNode() will constant fold - FoldConstantArithmetic will do all of this for us. Cleanup for #112682	2024-10-18 10:10:43 +01:00
Keith Packard	44b020a381	[PowerPC][ISelLowering] Support -mstack-protector-guard=tls (#110928 ) Add support for using a thread-local variable with a specified offset for holding the stack guard canary value. This supports both 32- and 64- bit PowerPC targets. This mirrors changes from #108942 but targeting PowerPC instead of RISCV. Because both of these PRs modify the same driver functions, this series is stack on top of the RISC-V one. --------- Signed-off-by: Keith Packard <keithp@keithp.com>	2024-10-17 19:06:47 -07:00
Simon Pilgrim	256bbdb3f6	[DAG] visitFCEIL/FTRUNC/FFLOOR/FNEG - use FoldConstantArithmetic to attempt to constant fold Don't rely on isConstantFPBuildVectorOrConstantFP followed by getNode() will constant fold - FoldConstantArithmetic will do all of this for us. Cleanup for #112682	2024-10-17 16:53:44 +01:00
Jay Foad	85c17e4092	[LLVM] Make more use of IRBuilder::CreateIntrinsic. NFC. (#112706 ) Convert many instances of: Fn = Intrinsic::getOrInsertDeclaration(...); CreateCall(Fn, ...) to the equivalent CreateIntrinsic call.	2024-10-17 16:20:43 +01:00
gxlayer	4a2bd78f5b	[ARM] Fix -mno-omit-leaf-frame-pointer flag doesn't works on 32-bit ARM (#109628 ) The -mno-omit-leaf-frame-pointer flag works on 32-bit ARM architectures and addresses the bug reported in #108019	2024-10-17 20:25:06 +08:00
Simon Pilgrim	cf046c8717	[DAG] visitSIGN_EXTEND_INREG - avoid SDLoc duplication. NFC.	2024-10-17 12:51:11 +01:00

1 2 3 4 5 ...

36651 Commits