llvm-project

Author	SHA1	Message	Date
Paul Walker	235cea720c	[NFC][LLVM] Refactor rounding mode detection of constrained fp intrinsic IDs (#90854 ) I've refactored the code to genericise the implementation to better allow for target specific constrained fp intrinsics.	2024-05-07 11:23:55 +01:00
Quentin Colombet	6ce04747cf	[SDISel] Teach the type legalizer about ADDRSPACECAST (#90969 ) Vectorized ADDRSPACECASTs were not supported by the type legalizer. This patch adds the support for: - splitting the vector result: <2 x ptr> => 2 x <1 x ptr> - scalarization: <1 x ptr> => ptr - widening: <3 x ptr> => <4 x ptr> This is all exercised by the added NVPTX tests.	2024-05-07 11:08:33 +02:00
Thorsten Schütt	b42f553af5	[GlobalIsel] Combine extract vector element (#90339 ) look through shuffle vectors	2024-05-07 07:12:58 +02:00
Simon Pilgrim	522b4bfe5b	[DAG] Fold bitreverse(shl/srl(bitreverse(x),y)) -> srl/shl(x,y) (#89897 ) Noticed while investigating GFNI per-element vector shifts (we can form SHL but not SRL/SRA) Alive2: https://alive2.llvm.org/ce/z/fSH-rf	2024-05-06 11:13:05 +01:00
Phoebe Wang	02dfbbff19	[SelectionDAG] Make ARITH_FENCE support half and bfloat type (#90836 )	2024-05-05 13:08:34 +08:00
David Blaikie	004485690e	Revert "llvm/lib/CodeGen/TargetSchedule.cpp:132:12: warning: Assert statement modifies 'NIter'" (#91079 ) Reverts llvm/llvm-project#90982 NIter was only declared in !NDEBUG, and only used for assertions - so it was correct that it was incremented inside the assertion. (& in fact now the non-asserts build fails, because the variable is incremented even though it isn't declared)	2024-05-04 11:43:08 -07:00
akshaykumars614	18d1df4633	llvm/lib/CodeGen/TargetSchedule.cpp:132:12: warning: Assert statement modifies 'NIter' (#90982 ) Modified the assert statement	2024-05-04 14:16:02 -04:00
Simon Pilgrim	caacf8685a	[DAG] Fold freeze(shuffle(x,y,m)) -> shuffle(freeze(x),freeze(y),m) (#90952 ) If the shuffle mask contains no undef elements, then we can move the freeze through a shuffle node. This requires special case handling to create a new ShuffleVectorSDNode. Includes VECTOR_SHUFFLE support for isGuaranteedNotToBeUndefOrPoison / canCreateUndefOrPoison.	2024-05-04 12:03:10 +01:00
Craig Topper	3563af6c06	[DAGCombiner] In mergeTruncStore, make sure we aren't storing shifted in bits. (#90939 ) When looking through a right shift, we need to make sure that all of the bits we are using from the shift come from the shift input and not the sign or zero bits that are shifted in. Fixes #90936.	2024-05-03 09:59:33 -07:00
Matt Arsenault	edbe6ebb4d	SystemZ: Don't promote atomic store in IR (#90899 ) This is the mirror to the recent atomic load change. The same bitcast-back-to-integer case is a small code quality regression for the same reason. This would disappear with a bitcastable legal 128-bit type.	2024-05-03 10:04:12 +02:00
Youngsuk Kim	9d4575c910	[llvm] Make lambda take const reference to prevent unneeded copy (NFC) Closes #89198	2024-05-02 15:34:03 -05:00
Matt Arsenault	b6d24cb018	DAG: Implement softening for fp atomic load (#90839 )	2024-05-02 13:38:37 +02:00
Matt Arsenault	d9fc5babb9	DAG: Implement softening for fp atomic store (#90840 ) This will prevent SystemZ test regressions in a future change, tested by #90826	2024-05-02 12:08:52 +02:00
zxc12523	171aeb20ad	[DAG] SelectionDAG.computeKnownBits - add NSW/NUW flags support to ISD::SHL handling (#89877 ) fix #89414	2024-05-02 10:31:56 +01:00
Nikita Popov	d484c4d350	[InterleavedLoadCombine] Bail out on non-byte-sized vector element type (#90705 ) Vectors are always tightly packed, and elements of non-byte-sized usually do not have a well-defined (byte) offset. Fixes https://github.com/llvm/llvm-project/issues/90695.	2024-05-02 09:38:09 +09:00
David Tellenbach	cf2f32c97f	[MIR] Serialize MachineFrameInfo::isCalleeSavedInfoValid() (#90561 ) In case of functions without a stack frame no "stack" field is serialized into MIR which leads to isCalleeSavedInfoValid being false when reading a MIR file back in. To fix this we should serialize MachineFrameInfo::isCalleeSavedInfoValid() into MIR.	2024-05-01 10:07:51 -07:00
Matt Arsenault	39e24bdd8e	MachineLICM: Allow hoisting REG_SEQUENCE (#90638 )	2024-05-01 16:52:04 +02:00
Jake Egan	8cde1cfc60	[AIX] Add git revision to .file string (#88164 ) If `LLVM_APPEND_VC_REV` is on, add the git revision to the `.file` string. The revision can be set with `LLVM_FORCE_VC_REVISION`. Before: `.file "git_revision.cpp",,"LLVM version 19.0.0git"` After: `.file "git_revision.cpp",,"LLVM version 19.0.0git (LLVM_REVISION)"`	2024-04-30 20:37:35 -04:00
Craig Topper	a03eeb0e98	[SelectionDAG][X86] Add a NoWrap flag to SelectionDAG::isAddLike. NFC (#90681 ) If this flag is set, Xor will not be considered AddLike. If an Xor were treated as an Add it may wrap. If we can prove there would be no carry out and thus no wrap, the Xor would be turned into a disjoint Or by DAGCombine. Use this new flag to fix a bug in X86 where an Xor is incorrectly being treated as an NUWAdd. Fixes #90668.	2024-04-30 16:52:56 -07:00
Amara Emerson	19f4d68252	[GlobalISel] Fix store merging incorrectly classifying an unknown index expr as 0. (#90375 ) During analysis, we incorrectly leave the offset part of an address info struct as zero, when in actual fact we failed to decompose it into base + offset. This results in incorrectly assuming that the address is adjacent to another store addr. To fix this we wrap the offset in an optional<> so we can distinguish between real zero and unknown. Fixes issue #90242	2024-04-30 14:42:14 -07:00
Craig Topper	267329d7e0	[LegalizeDAG] Simplify interface to PromoteReduction. NFC Return an SDValue instead of pushing to the Results vector. Let the caller do the push.	2024-04-30 09:48:41 -07:00
Simon Pilgrim	91c52b966a	[DAG] Pull out repeated SDLoc() from SHL/SRL/SRA combines. NFC. We were always calling SDLoc(N) at the top of each visitSHL/SRL/SRA for the FoldConstantArithmetic call, so just reuse this as much as possible.	2024-04-30 17:30:43 +01:00
Min-Yih Hsu	539f626ecd	[VP][RISCV] Add vp.cttz.elts intrinsic and its RISC-V codegen (#90502 ) This intrinsic is the VP version of `experimental.cttz.elts`.	2024-04-30 09:27:10 -07:00
Matt Arsenault	114a59d4d3	MachineLICM: Remove unnecessary isReg checks COPY operands are always registers.	2024-04-30 17:44:45 +02:00
Luke Lau	5e03c0af47	[DAGCombiner] Fix mayAlias not accounting for scalable MMOs with offsets (#90573 ) In #70452 DAGCombiner::mayAlias was taught to handle scalable sizes, but when it checks via AA->isNoAlias it didn't take into account the case where the size is scalable but there was an offset too. For the fixed length case the offset was just accounted for by adding to the LocationSize, but for the scalable case there doesn't seem to be a way to represent both a scalable and fixed part in it. So this patch works around it by bailing if there is an offset. Fixes #90559	2024-04-30 20:20:40 +08:00
Craig Topper	705636a113	[SelectionDAG][RISCV] Move VP_REDUCE* legalization to LegalizeDAG.cpp. (#90522 ) LegalizeVectorType is responsible for legalizing nodes that perform an operation on each element may need to scalarize. This is not true for nodes like VP_REDUCE.*, BUILD_VECTOR, SHUFFLE_VECTOR, EXTRACT_SUBVECTOR, etc. This patch drops any nodes with a scalar result from LegalizeVectorOps and handles them in LegalizeDAG instead. This required moving the reduction promotion to LegalizeDAG. I have removed the support integer promotion as it was incorrect for integer min/max reductions. Since it was untested, it was best to assert on it until it was really needed. There are a couple regressions that can be fixed with a small DAG combine which I will do as a follow up.	2024-04-29 22:44:24 -07:00
paperchalice	6ea0c0a283	[NewPM][CodeGen] Add `MachineFunctionAnalysis` (#88610 ) In new pass system, `MachineFunction` could be an analysis result again, machine module pass can now fetch them from analysis manager. `MachineModuleInfo` no longer owns them. Remove `FreeMachineFunctionPass`, replaced by `InvalidateAnalysisPass<MachineFunctionAnalysis>`. Now `FreeMachineFunction` is replaced by `InvalidateAnalysisPass<MachineFunctionAnalysis>`, the workaround in `MachineFunctionPassManager` is no longer needed, there is no difference between `unittests/MIR/PassBuilderCallbacksTest.cpp` and `unittests/IR/PassBuilderCallbacksTest.cpp`.	2024-04-30 09:54:48 +08:00
Kazu Hirata	ae7ce1c6e7	[CodeGen] Remove extraneous ArrayRef (NFC) (#90423 ) We don't need to explicitly create an instance of ArrayRef here because getIndexedOffsetInType takes ArrayRef, and ArrayRef can be implicitly constructed from a C array.	2024-04-29 16:30:57 -07:00
Bjorn Pettersson	55c6bda01e	Revert "Revert "[SelectionDAG] Handle more opcodes in canCreateUndefOrPoison (#84921 )" and more..." This reverts commit 16bd10a38730fed27a3bf111076b8ef7a7e7b3ee. Re-applies: b3c55b707110084a9f50a16aade34c3be6fa18da - "[SelectionDAG] Handle more opcodes in canCreateUndefOrPoison (#84921)" 8e2f6495c0bac1dd6ee32b6a0d24152c9c343624 - "[DAGCombiner] Do not always fold FREEZE over BUILD_VECTOR (#85932)" 73472c5996716cda0dbb3ddb788304e0e7e6a323 - "[SelectionDAG] Treat CopyFromReg as freezing the value (#85932)" with a fix in DAGCombiner::visitFREEZE.	2024-04-29 13:08:52 +02:00
chuongg3	bf57d2e57c	[AArch64][GlobalISel] Enable computeNumSignBits for G_XOR, G_AND, G_OR (#89896 )	2024-04-29 10:53:30 +01:00
Maciej Gabka	bfc0317153	Move several vector intrinsics out of experimental namespace (#88748 ) This patch is moving out following intrinsics: * vector.interleave2/deinterleave2 * vector.reverse * vector.splice from the experimental namespace. All these intrinsics exist in LLVM for more than a year now, and are widely used, so should not be considered as experimental.	2024-04-29 10:16:45 +01:00
David Spickett	16bd10a387	Revert "[SelectionDAG] Handle more opcodes in canCreateUndefOrPoison (#84921 )" and more... This reverts: b3c55b707110084a9f50a16aade34c3be6fa18da - "[SelectionDAG] Handle more opcodes in canCreateUndefOrPoison (#84921)" (because it updates a test case that I don't know how to resolve the conflict for) 8e2f6495c0bac1dd6ee32b6a0d24152c9c343624 - "[DAGCombiner] Do not always fold FREEZE over BUILD_VECTOR (#85932)" 73472c5996716cda0dbb3ddb788304e0e7e6a323 - "[SelectionDAG] Treat CopyFromReg as freezing the value (#85932)" Due to a test suite failure on AArch64 when compiling for SVE. https://lab.llvm.org/buildbot/#/builders/197/builds/13955 clang: ../llvm/llvm/include/llvm/CodeGen/ValueTypes.h:307: MVT llvm::EVT::getSimpleVT() const: Assertion `isSimple() && "Expected a SimpleValueType!"' failed.	2024-04-29 09:47:41 +01:00
Yingwei Zheng	ab12bba0aa	[CGP] Drop poison-generating flags after hoisting (#90382 ) See the following case: ``` define i8 @src1(i8 %x) { entry: %cmp = icmp eq i8 %x, -1 br i1 %cmp, label %exit, label %if.then if.then: %inc = add nuw nsw i8 %x, 1 br label %exit exit: %retval = phi i8 [ %inc, %if.then ], [ -1, %entry ] ret i8 %retval } define i8 @tgt1(i8 %x) { entry: %inc = add nuw nsw i8 %x, 1 %0 = icmp eq i8 %inc, 0 br i1 %0, label %exit, label %if.then if.then: ; preds = %entry br label %exit exit: ; preds = %if.then, %entry %retval = phi i8 [ %inc, %if.then ], [ -1, %entry ] ret i8 %retval } ``` `optimizeBranch` converts `icmp eq X, -1` into cmp to zero on RISC-V and hoists the add into the entry block. Poison-generating flags should be dropped as they don't still hold. Proof: https://alive2.llvm.org/ce/z/sP7mvK Fixes https://github.com/llvm/llvm-project/issues/90380	2024-04-29 15:51:49 +08:00
Qiu Chaofan	4a8f2f2e1a	[Legalizer] Expand fmaximum and fminimum (#67301 ) According to langref, llvm.maximum/minimum has -0.0 < +0.0 semantics and propagates NaN. Expand the nodes on targets not supporting the operation, by adding extra check for NaN and using is_fpclass to check zero signs.	2024-04-29 15:09:54 +08:00
Björn Pettersson	b3c55b7071	[SelectionDAG] Handle more opcodes in canCreateUndefOrPoison (#84921 ) [SelectionDAG] Handle more opcodes in canCreateUndefOrPoison Handle SELECT_CC similarly as SETCC. Handle these operations that only propagate poison/undef based on the input operands: SADDSAT, UADDSAT, SSUBSAT, USUBSAT, MULHU, MULHS, SMIN, SMAX, UMIN, UMAX These operations may create poison based on shift amount and exact flag being violated: SRL, SRA One goal here is to allow pushing freeze through these operations when allowed, as well as letting analyses such as isGuaranteedNotToBeUndefOrPoison to not break on such operations. Since some problems have been observed with pushing freeze through SRA/SRL we block that explicitly in DAGCombiner::visitFreeze now. That way we can still model SRA/SRL properly in SelectionDAG::canCreateUndefOrPoison, e.g. when used by isGuaranteedNotToBeUndefOrPoison, even if we do not want to push freeze through those instructions.	2024-04-29 07:56:49 +02:00
Thorsten Schütt	bc349cea7a	[GlobalIsel] combine insert vector element (#89363 ) preliminary steps poison symbols	2024-04-27 08:39:35 +02:00
Matt Arsenault	405c018c71	DAG: Simplify demanded bits for truncating atomic_store (#90113 ) It's really unfortunate that STORE and ATOMIC_STORE are separate opcodes. This duplicates a basic simplify demanded for the truncating case. This avoids some AMDGPU lit regressions in a future patch. I'm not sure how to craft a test that exposes this without first introducing the regressions by promoting half to i16.	2024-04-26 15:21:44 +02:00
Simon Pilgrim	55d85c84ac	[DAG] visitORCommutative - fold build_pair(not(x),not(y)) -> not(build_pair(x,y)) style patterns (#90050 ) (Sorry, not an actual build_pair node just a similar pattern). For cases where we're concatenating 2 integers into a double width integer, see if both integer sources are NOT patterns. We could take this further and handle all logic ops with a constant operands, but I just wanted to handle the case reported on #89533 initially. Fixes #89533	2024-04-26 14:11:03 +01:00
Bjorn Pettersson	8e2f6495c0	[DAGCombiner] Do not always fold FREEZE over BUILD_VECTOR (#85932 ) Avoid turning a BUILD_VECTOR that can be recognized as "all zeros", "all ones" or "constant" into something that depends on freeze(undef), as that would destroy those properties. Instead we replace undef by 0/-1 in such vectors, making it possible to fold away the freeze. We typically use -1 if the BUILD_VECTOR would identify as "all ones", and otherwise we use the value 0.	2024-04-26 13:41:21 +02:00
Bjorn Pettersson	73472c5996	[SelectionDAG] Treat CopyFromReg as freezing the value (#85932 ) The description of CopyFromReg in ISDOpcodes.h says that the input valus is defined outside the scope of the current SelectionDAG. I think that means that we basically can treat it as a FREEZE in the sense that it can be seen as neither being undef nor poison. Being able to fold freeze(CopyFromReg) into CopyFromReg seems useful to avoid regressions if we start to introduce freeze instruction in DAGCombiner/foldBoolSelectToLogic, e.g. to solve https://github.com/llvm/llvm-project/issues/84653 Things _not_ dealt with in this patch: - Depending on calling convention an input argument can be passed also on the stack and not in a register. If it is allowed to treat an argument received in a register as not being poison, then I think we want to treat arguments received on the stack the same way. But then we need to attribute load instructions, or add explicit FREEZE when lowering formal arguments. - A common pattern is that there is an AssertZext or AssertSext just after CopyFromReg. I think that if we treat CopyFromReg as never being poison, then it should be allowed to fold (freeze(AssertZext(CopyFromReg))) -> AssertZext(CopyFromReg))	2024-04-26 13:41:21 +02:00
Matt Arsenault	f1112ebe07	AMDGPU: Do not bitcast atomic load in IR (#90060 ) These hooks should be removed. This is a trivial legalization transform the legalizer needs to support. The IR just complicates things, and it was losing metadata. Implement the DAG promotion support, and switch AMDGPU over to using it. Really we'd be a lot better off merging ATOMIC_LOAD and LOAD like GlobalISel does.	2024-04-26 12:20:40 +02:00
Fangrui Song	5a12f2867a	LLVM_FALLTHROUGH => [[fallthrough]]. NFC	2024-04-25 17:50:59 -07:00
Philip Reames	f4e3daa562	[DAG] Early exit for flags in canCreateUndefOrPoison [nfc] (#89834 ) This matches the style used in the Analysis version of this routine, and makes it less likely we'll miss a poison generating flag in future changes. Unlike IR, the check for poison generating flags doesn't need to switch over opcode since all nodes have the SDFlags storage.	2024-04-25 09:12:59 -07:00
Alex Bradbury	1c8410a67d	[CodeGenPrepare] Preserve flags (such as nsw/nuw) in SinkCast (#89904 ) As demonstrated in the test change, when deciding to sink a trunc we were losing its flags. This patch moves to cloning the original instruction instead.	2024-04-25 15:05:07 +01:00
Simon Pilgrim	d51a17f684	[DAG] visitORCommutative - pull out repeated SDLoc(). NFC.	2024-04-25 14:23:36 +01:00
AtariDreams	13188bcd9f	[GlobalISel]: Simplify udiv lowering by determining known zeros (#89678 )	2024-04-24 22:14:02 +02:00
Craig Topper	c5dcb5239e	[SelectionDAG] Move GlobalAddressSDNode and AddrSpaceCastSDNode constructors into header. NFC These constructors are no more complicated than any of the other *SDNode constructors that are already in the header.	2024-04-24 13:11:57 -07:00
Craig Topper	fc538b070d	[SelectionDAG] Pass SDVTList instead of VTs to SDNode constructors. NFC (#89880 ) All of these constructors were creating a SDVTList using an EVT created by SDNode::getValueTypeList. This EVT needs to live at least as long as the SDNode that uses it. To do this, SDNode::getValueTypeList contains several function scoped static variables that hold the memory for the EVT. So the EVT lives until global destructors run. This is problematic since an EVT contains a Type* that points to memory allocated by an LLVMContext. If multiple LLVMContexts are used that don't have overlapping lifetimes, we can end up with stale or or incorrect pointers cached in the EVTs owned by SDNode::getValueTypeList. I want to try to make the EVTs be owned by SelectionDAG instead. This is already done for SDVTLists with more than 1 VT. The single value case is a very old optimizaton that should be re-evaluated. In order to do this, I need the SDVTLists to be created by SelectionDAG rather than by the SDNode itself. This patch doesn't change how the allocation is done yet. It just moves the code around. This patch does reduce the number of calls to getVTList since we now share with the call needed for the SDNode FoldingSet. Part of fixing #88233.	2024-04-24 12:31:14 -07:00
Matt Arsenault	a45eb62877	AtomicExpand: Fix dropping a syncscope when bitcasting atomicrmw	2024-04-24 19:09:34 +02:00
Matt Arsenault	50082d64e6	DAG: Fix widening of fptrunc_round vectors (#89918 )	2024-04-24 16:21:40 +02:00

1 2 3 4 5 ...

35711 Commits