llvm-project

Author	SHA1	Message	Date
Noah Goldstein	9170e38575	Add support for `nneg` flag with `uitofp` As noted when #82404 was pushed (canonicalizing `sitofp` -> `uitofp`), different signedness on fp casts can have dramatic performance implications on different backends. So, it makes to create a reliable means for the backend to pick its cast signedness if either are correct. Further, this allows us to start canonicalizing `sitofp`- > `uitofp` which may easy middle end analysis. Closes #86141	2024-04-09 18:12:33 -05:00
elhewaty	7d3924cee3	[IR] Add nowrap flags for trunc instruction (#85592 ) This patch adds the nuw (no unsigned wrap) and nsw (no signed wrap) poison-generating flags to the trunc instruction. Discourse thread: https://discourse.llvm.org/t/rfc-add-nowrap-flags-to-trunc/77453	2024-03-29 14:08:49 +08:00
Orlando Cazalet-Hyams	954a048d0d	[RemoveDIs] Fix SimplifyCFG behaviour to match existing behaviour (#82981 ) llvm.dbg.labels are deleted in SpeculativelyExecuteBB so DPLabels should be too. Modify existing test to check this (NB I couldn't find a dedicated debug-info test that checks this behaviour).	2024-02-26 11:44:54 +00:00
Simon Pilgrim	9978f6a10f	[CostModel][X86] Reduce the extra costs for ICMP complex predicates when an operand is constant In most cases, SETCC lowering will be able to simplify/commute the comparison by adjusting the constant. TODO: We still need to adjust ExtraCost based on CostKind Fixes #80122	2024-02-21 16:19:39 +00:00
Simon Pilgrim	c16d0d14de	[SimplifyCFG] Add test coverage for #80122	2024-02-21 16:19:39 +00:00
Orlando Cazalet-Hyams	c302909760	[RemoveDIs] Fix DPValue hoisting in hoistSuccIdenticalTerminatorToSwitchOrIf (#80822 ) Follow up to #79476 - that patch added a call to hoistLockstepIdenticalDPValues which hoists identical DPValues in lockstep, matching dbg intrinsic hoisting behaviour. The code deleted in this patch, which unconditionally hoists DPValues, should have been deleted in that patch. Update test with --try-experimental-debuginfo-iterators to check the behaviour. Follow up to #79476 - that change introduces a call to hoistLockstepIdenticalDPValues.	2024-02-06 10:45:31 +00:00
Orlando Cazalet-Hyams	ddd95b15d1	[RemoveDIs] Handle DPValues in hoistCommonCodeFromSuccessors (#79476 ) Hoist DPValues attached to each instruction being considered for hoisting if they are identical in lock-step. This includes the final instructions which are considered but not hoisted, because the corresponding dbg.values would appear before those instruction and thus hoisted if identical. Identical debug records hoisted: llvm/test/Transforms/SimplifyCFG/hoist-dbgvalue.ll Non-identical debug records not hoisted: llvm/test/Transforms/SimplifyCFG/X86/pr39187-g.ll Debug records attached to first not-hoisted instructions are hoisted: llvm/test/Transforms/SimplifyCFG/hoist-dbgvalue-inlined.ll	2024-02-05 10:58:46 +00:00
DianQK	a58dcc5e08	Reland "[SimplifyCFG] Improve the precision of `PtrValueMayBeModified`" This relands commit f890f010f6a70addbd885acd0c8d1b9578b6246f. The result value of `getelementptr inbounds (TY, null, not zero)` is a poison value. We can think of it as undefined behavior.	2024-01-25 06:42:14 +08:00
DianQK	a0c1b5bdda	Reland "[SimplifyCFG] Check if the return instruction causes undefined behavior" This relands commit b6a0be8ce3114d0c57e7a7d6c3c222986ca506ad. Return undefined to a noundef return value is undefined. Example: ``` define noundef i32 @test_ret_noundef(i1 %cond) { entry: br i1 %cond, label %bb1, label %bb2 bb1: br label %bb2 bb2: %r = phi i32 [ undef, %entry ], [ 1, %bb1 ] ret i32 %r } ```	2024-01-25 06:42:14 +08:00
alexfh	2d5cc1c9b3	Revert "[SimplifyCFG] `switch`: Do Not Transform the Default Case if the Condition is Too Wide" (#78469 ) Reverts llvm/llvm-project#77831, which depends on #76669, which seriously regresses compilation time / memory usage see https://github.com/llvm/llvm-project/pull/76669#issuecomment-1889271710.	2024-01-17 19:04:34 +01:00
Qiongsi Wu	39bb790b90	[SimplifyCFG] `switch`: Do Not Transform the Default Case if the Condition is Too Wide (#77831 ) https://github.com/llvm/llvm-project/pull/76669 taught SimplifyCFG to handle switches when `default` has only one case. When the `switch`'s condition is wider than 64 bit, the current implementation can calculate the wrong default value. This PR skips cases where the condition is too wide.	2024-01-12 08:54:35 -05:00
Yingwei Zheng	45be680b1a	[SimplifyCFG] Emit `rotl` directly in `ReduceSwitchRange` (#77603 ) This patch emits `ROTL(Cond, BitWidth - Shift)` directly in `ReduceSwitchRange`. This should give better codegen because `SimplifyDemandedBits` will break the rotation patterns in the original form. See also https://github.com/llvm/llvm-project/pull/73441 and the IR diff https://github.com/dtcxzyw/llvm-opt-benchmark/pull/115/files. This patch should cover most of cases handled by #73441.	2024-01-10 22:57:17 +08:00
Quentin Dian	7d81e07271	[SimplifyCFG] When only one case value is missing, replace default with that case (#76669 ) When the default branch is the last case, we can transform that branch into a concrete branch with an unreachable default branch. ```llvm target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128" target triple = "x86_64-unknown-linux-gnu" define i64 @src(i64 %0) { %2 = urem i64 %0, 4 switch i64 %2, label %5 [ i64 1, label %3 i64 2, label %3 i64 3, label %4 ] 3: ; preds = %1, %1 br label %5 4: ; preds = %1 br label %5 5: ; preds = %1, %4, %3 %.0 = phi i64 [ 2, %4 ], [ 1, %3 ], [ 0, %1 ] ret i64 %.0 } define i64 @tgt(i64 %0) { %2 = urem i64 %0, 4 switch i64 %2, label %unreachable [ i64 0, label %5 i64 1, label %3 i64 2, label %3 i64 3, label %4 ] unreachable: ; preds = %1 unreachable 3: ; preds = %1, %1 br label %5 4: ; preds = %1 br label %5 5: ; preds = %1, %4, %3 %.0 = phi i64 [ 2, %4 ], [ 1, %3 ], [ 0, %1 ] ret i64 %.0 } ``` Alive2: https://alive2.llvm.org/ce/z/Y-PGXv After transform to a lookup table, I believe `tgt` is better code. The final instructions are as follows: ```asm src: # @src and edi, 3 lea rax, [rdi - 1] cmp rax, 2 ja .LBB0_1 mov rax, qword ptr [8rdi + .Lswitch.table.src-8] ret .LBB0_1: xor eax, eax ret tgt: # @tgt and edi, 3 mov rax, qword ptr [8rdi + .Lswitch.table.tgt] ret .Lswitch.table.src: .quad 1 # 0x1 .quad 1 # 0x1 .quad 2 # 0x2 .Lswitch.table.tgt: .quad 0 # 0x0 .quad 1 # 0x1 .quad 1 # 0x1 .quad 2 # 0x2 ``` Godbolt: https://llvm.godbolt.org/z/borME8znd Closes #73446.	2024-01-03 09:22:13 +08:00
DianQK	463dad107f	[SimplifyCFG] Regenerate test checks (NFC) Use `UTC_ARGS: --version 4`.	2024-01-01 20:43:03 +08:00
Nikita Popov	d77067d08a	[ValueTracking] Add dominating condition support in computeKnownBits() (#73662 ) This adds support for using dominating conditions in computeKnownBits() when called from InstCombine. The implementation uses a DomConditionCache, which stores which branches may provide information that is relevant for a given value. DomConditionCache is similar to AssumptionCache, but does not try to do any kind of automatic tracking. Relevant branches have to be explicitly registered and invalidated values explicitly removed. The necessary tracking is done inside InstCombine. The reason why this doesn't just do exactly the same thing as AssumptionCache is that a lot more transforms touch branches and branch conditions than assumptions. AssumptionCache is an immutable analysis and mostly gets away with this because only a handful of places have to register additional assumptions (mostly as a result of cloning). This is very much not the case for branches. This change regresses compile-time by about ~0.2%. It also improves stage2-O0-g builds by about ~0.2%, which indicates that this change results in additional optimizations inside clang itself. Fixes https://github.com/llvm/llvm-project/issues/74242.	2023-12-06 14:17:18 +01:00
Jeremy Morse	d2d9dc8eb4	[DebugInfo][RemoveDIs] Make debugify pass convert to/from RemoveDIs mode (#73251 ) Debugify is extremely useful as a testing and debugging tool, and a good number of LLVM-IR transform tests use it. We need it to support "new" non-instruction debug-info to get test coverage, but it's not important enough to completely convert right now (and it'd be a large undertaking). Thus: convert to/from dbg.value/DPValue mode on entry and exit of the pass, which gives us the functionality without any further work. The cost is compile-time, but again this is only happening during tests. Tested by: the large set of debugify tests enabled here. Note the InstCombine test (cast-mul-select.ll) that hasn't been fully enabled: this is because there's a debug-info sinking piece of code there that hasn't been instrumented.	2023-11-29 13:19:50 +00:00
Craig Topper	d9962c400f	[IR] Add disjoint flag for Or instructions. (#72583 ) This flag indicates that every bit is known to be zero in at least one of the inputs. This allows the Or to be treated as an Add since there is no possibility of a carry from any bit. If the flag is present and this property does not hold, the result is poison. This makes it easier to reverse the InstCombine transform that turns Add into Or. This is inspired by a comment here https://github.com/llvm/llvm-project/pull/71955#discussion_r1391614578 Discourse thread https://discourse.llvm.org/t/rfc-add-or-disjoint-flag/75036	2023-11-24 08:49:19 -08:00
Jeremy Morse	59fab22642	[DebugInfo][RemoveDIs] Support cloning and remapping DPValues (#72546 ) This patch adds support for CloneBasicBlock duplicating the DPValues attached to instructions, and adds facilities to remap them into their new context. The plumbing to achieve this is fairly straightforwards and mechanical. I've also added illustrative uses to LoopUnrollRuntime, SimpleLoopUnswitch and SimplifyCFG. The former only updates for the epilogue right now so I've added CHECK lines just for the end of an unrolled loop (further updates coming later). SimpleLoopUnswitch had no debug-info tests so I've added a new one. The two modified parts of SimplifyCFG are covered by the two modified SimplifyCFG tests. These are scenarios where we have to do extra cloning for copying of DPValues because they're no longer instructions, and remap them too.	2023-11-24 15:17:32 +00:00
Jeremy Morse	84f6e1d71c	[DebugInfo] Clone dbg.values in SimplifyCFG like normal instructions (#72526 ) The code in the CloneInstructionsIntoPredec... function modified by this patch has a long history that dates back to 2011, see d715ec82b4ad12c59. There, when folding branches, all dbg.value intrinsics seen when folding would be saved and then re-inserted at the end of whatever was folded. Over the last 12 years this behaviour has been preserved. However, IMO it's bad behaviour. If we have: inst1 dbg.value1 inst2 dbg.value2 And we fold that sequence into a different block, then we would want the instructions and variable assignments to appear in the same order. However because of this old behaviour, the dbg.values are sunk, and we get: inst1 inst2 dbg.value1 dbg.value2 This clustering of dbg.values can make assignments to the same variable invisible, as well as reducing the coverage of other assignments. This patch relaxes the CloneInstructions... function and allows it to clone and update dbg.values in-place, causing them to appear in the original order in the destination block. I've added some extra dbg.values to the updated test: without the changes to the pass, the dbg.values sink into a blob ahead of the select. The RemoveDIs code can't cope with this right now so I've removed the "--try..." flag, restored in a commit to land in a couple of hours. (Metadata changes to make the LLVM-IR parser not drop the debug-info for it being out of date. The RemoveDIs related RUN line has been removed because it was spuriously passing due to the debug-info being dropped).	2023-11-24 13:30:34 +00:00
Jeremy Morse	eaffcc85ea	[DebugInfo][RemoveDIs] Make dropping variable locations explicit (#72399 ) In present-day debug-info, when you delete all instructions, you delete all their debug-info with it because debug-info is stored in instructions. With debug-info stored in DPValue objects however, deleting instructions causes DPValue objects to clump together into a large blob of debug-info that hangs around in the block, as nothing has explicitly deleted it. To restore this behaviour, scatter calls to dropDbgValues around in places that used to delete chunks of dbg.values, for example during stripDebugInfo and in the code that deletes everything after an Unreachable instruction. DCE is another example. The tests with --try... added to them are new scenarios where we can now correctly replicate the "normal" debug-info behaviour. Alas, there's no explicit test for the opt -strip-debug option though (in dbg.value mode or DPValue mode).	2023-11-21 00:01:00 +00:00
Jeremy Morse	f42482def2	[DebugInfo][RemoveDIs] Don't convert debug-intrinsics to Unreachable (#72380 ) It might seem obvious, but it's not a good idea to convert a debug-intrinsic instruction into an UnreachableInst, as this means things operate differently with and without the -g option. However this can happen due to the "mutate the next instruction" API calls we make. With RemoveDIs eliminating debug intrinsics, this behaviour is at risk of changing, hence this patch ensures we only ever mutate the next _non_ debuginfo instruction into an Unreachable. The tests instrumented with the --try... flag all exercise this, I've added some metadata to a SCCP test to ensure it's exercised.	2023-11-20 20:53:24 +00:00
Jeremy Morse	80d3a4c39f	[DebugInfo][RemoveDIs] Add local-utility plumbing for DPValues (#72276 ) This patch re-implements a variety of debug-info maintenence functions to use DPValues instead of DbgValueInst's: supporting the "new" non-intrinsic representation of debug-info. As per [0], we need to have parallel implementations of various utilities for a time, and these are the most fundamental utilities used throughout the compiler. I've added --try-experimental-debuginfo-iterators to a variety of RUN lines: this is a flag that turns on "new debug-info" if it's built into LLVM, and not otherwise. This should ensure that we have the same behaviour for the same IR inputs, but using a different internal representation. For the most part these changes affect SROA/Mem2Reg promotion of dbg.declares into dbg.value intrinsics (now DPValues), we're leaving dbg.declares as instructions until later in the day. There's also some salvaging changes made. I believe the tests that I've added cover almost all the code being updated here. The only thing I'm not confident about is SimplifyCFG, which calls rewriteDebugUsers down a variety of code paths. Those changes can't immediately get full coverage as an additional patch is needed that updates handling of Unreachable instructions, will upload that shortly. [0] https://discourse.llvm.org/t/rfc-instruction-api-changes-needed-to-eliminate-debug-intrinsics-from-ir/68939/9	2023-11-20 16:56:31 +00:00
Daniil	424c4249cc	[SimplifyCFG] Add optimization for switches of powers of two (#70977 ) Optimization reduces the range for switches whose cases are positive powers of two by replacing each case with count_trailing_zero(case). Resolves #70756	2023-11-18 15:14:14 +08:00
Valery Pykhtin	b93ff3e5ce	[SimplifyCFG] Fix uint32_t overflow in cbranch to cbranch merge prevention check. (#72329 ) This fixes https://github.com/llvm/llvm-project/issues/72323. Resulted from `f054947c0d`	2023-11-15 12:07:58 +01:00
Valery Pykhtin	f054947c0d	[SimplifyCFG] Prevent merging cbranch to cbranch if the branch probability from the first to second is too low. (#69375 ) AMDGPU target has faced the situation which can be illustrated with the following testcase: define void @dont_merge_cbranches(i32 %V) { %divergent_cond = icmp ne i32 %V, 0 %uniform_cond = call i1 @uniform_result(i1 %divergent_cond) br i1 %uniform_cond, label %bb2, label %exit, !prof !0 bb2: br i1 %divergent_cond, label %bb3, label %exit bb3: call void @bar( ) br label %exit exit: ret void } !0 = !{!"branch_weights", i32 1, i32 100000} SimplifyCFG merges branches on %uniform_cond and %divergent_cond which is undesirable because the first branch to bb2 is taken extremely rare and the second branch is expensive. The merged branch becomes as expensive as the second. This patch prevents such merging if the branch to the second branch is unlikely to happen.	2023-11-13 15:37:55 +01:00
Allen	7ec86f4d68	[SimplifyCFG] Fix the compile crash for invalid upper bound value (#71351 ) Fix the crash for the last land PR70542. Note: For '%add = add nuw i32 %x, 1', we can only infer the LowerBound is 1, but the UpperBound is wrapped to 0 in computeConstantRange. so we can't assume the UpperBound is valid bound when its value is 0. Fix https://github.com/llvm/llvm-project/issues/71329. Reviewed By: zmodem, nikic	2023-11-09 12:33:24 +08:00
Hans Wennborg	05ed92127c	Revert "Reland [SimplifyCFG] Delete the unnecessary range check for small mask operation (#70542 )" This caused https://github.com/llvm/llvm-project/issues/71329 > Fix the compile crash when the default result has no result for > https://github.com/llvm/llvm-project/pull/65835 > > Fixes https://github.com/llvm/llvm-project/issues/65120 > Reviewed By: zmodem, nikic This reverts commit 7c4180a36a905b7ed46c09df77af1b65e356f92a.	2023-11-07 10:53:22 +01:00
Allen	7c4180a36a	Reland [SimplifyCFG] Delete the unnecessary range check for small mask operation (#70542 ) Fix the compile crash when the default result has no result for https://github.com/llvm/llvm-project/pull/65835 Fixes https://github.com/llvm/llvm-project/issues/65120 Reviewed By: zmodem, nikic	2023-11-03 09:12:29 +08:00
Nikita Popov	ed3f06b9b3	[IR] Add zext nneg flag (#67982 ) Add an nneg flag to the zext instruction, which specifies that the argument is non-negative. Otherwise, the result is a poison value. The primary use-case for the flag is to preserve information when sext gets replaced with zext due to range-based canonicalization. The nneg flag allows us to convert the zext back into an sext later. This is useful for some optimizations (e.g. a signed icmp can fold with sext but not zext), as well as some targets (e.g. RISCV prefers sext over zext). Discourse thread: https://discourse.llvm.org/t/rfc-add-zext-nneg-flag/73914 This patch is based on https://reviews.llvm.org/D156444 by @Panagiotis156, with some implementation simplifications and additional tests. --------- Co-authored-by: Panagiotis K <karouzakispan@gmail.com>	2023-10-30 09:04:04 +01:00
XChy	fc6bdb8549	[SimplifyCFG] Reland transform for redirecting phis between unmergeable BB and SuccBB (#68473 ) Reland #67275 with #68953 resolved.	2023-10-28 17:10:20 +08:00
Allen	851338b126	Revert "[SimplifyCFG] Delete the unnecessary range check for small mask operation (#70324 ) This reverts commit 5e07481d4240b5e8fd85f9b92df30849606c2af0.	2023-10-26 20:39:24 +08:00
zhongyunde 00443407	5e07481d42	[SimplifyCFG] Delete the unnecessary range check for small mask operation When the small mask value little than 64, we can eliminate the checking for upper limit of the range by enlarge the lookup table size to the maximum index value. (Then the final table size grows to the next pow2 value) ``` bool f(unsigned x) { switch (x % 8) { case 0: return 1; case 1: return 0; case 2: return 0; case 3: return 1; case 4: return 1; case 5: return 0; case 6: return 1; // This would remove the range check: case 7: return 0; } return 0; } ``` Use WouldFitInRegister instead of fitsInLegalInteger to support more result type beside bool. Fixes https://github.com/llvm/llvm-project/issues/65120 Reviewed By: zmodem, nikic, RKSimon	2023-10-26 19:01:22 +08:00
zhongyunde 00443407	925f4622dc	[SimplifyCFG] Precommit tests for PR65835	2023-10-26 18:57:15 +08:00
Muhammad Omair Javaid	431969ede1	Revert "[SimplifyCFG] Transform for redirecting phis between unmergeable BB and SuccBB (#67275 )" This reverts commit fc86d031fec5e47c6811efd3a871742ad244afdd. This change breaks LLVM buildbot clang-aarch64-sve-vls-2stage https://lab.llvm.org/buildbot/#/builders/176/builds/5474 I am going to revert this patch as the bot has been failing for more than a day without a fix.	2023-09-26 15:47:16 +05:00
XChy	fc86d031fe	[SimplifyCFG] Transform for redirecting phis between unmergeable BB and SuccBB (#67275 ) This patch extends function TryToSimplifyUncondBranchFromEmptyBlock to handle the similar cases below. ```llvm define i8 @src(i8 noundef %arg) { start: switch i8 %arg, label %unreachable [ i8 0, label %case012 i8 1, label %case1 i8 2, label %case2 i8 3, label %end ] unreachable: unreachable case1: br label %case012 case2: br label %case012 case012: %phi1 = phi i8 [ 3, %case2 ], [ 2, %case1 ], [ 1, %start ] br label %end end: %phi2 = phi i8 [ %phi1, %case012 ], [ 4, %start ] ret i8 %phi2 } ``` The phis here should be merged into one phi, so that we can better optimize it: ```llvm define i8 @tgt(i8 noundef %arg) { start: switch i8 %arg, label %unreachable [ i8 0, label %end i8 1, label %case1 i8 2, label %case2 i8 3, label %case3 ] unreachable: unreachable case1: br label %end case2: br label %end case3: br label %end end: %phi = phi i8 [ 4, %case3 ], [ 3, %case2 ], [ 2, %case1 ], [ 1, %start ] ret i8 %phi } ``` Proof: [normal](https://alive2.llvm.org/ce/z/vAWi88) [multiple stages](https://alive2.llvm.org/ce/z/DDBQqp) [multiple stages 2](https://alive2.llvm.org/ce/z/nGkeqN) [multiple phi combinations](https://alive2.llvm.org/ce/z/VQeEdp) And lookup table optimization should convert it into add %arg 1. This patch just match similar CFG structure and merge the phis in different cases. Maybe such transform can be applied to other situations besides switch, but I'm not sure whether it's better than not merging. Therefore, I only try it in switch, Related issue: #63876 [Migrated](https://reviews.llvm.org/D155940)	2023-09-25 10:13:45 +08:00
DianQK	d200bd1a7d	Reland "[SimplifyCFG] Hoist common instructions on switch" (#67077 ) This relands commit 96ea48ff5dcba46af350f5300eafd7f7394ba606.	2023-09-22 18:29:59 +08:00
Fangrui Song	9f4c9b90c9	Revert D155711 "[SimplifyCFG] Hoist common instructions on Switch." This reverts commit 96ea48ff5dcba46af350f5300eafd7f7394ba606. The change may cause Verifier.cpp error "musttail call must precede a ret with an optional bitcast"	2023-09-20 11:49:20 -07:00
DianQK	96ea48ff5d	[SimplifyCFG] Hoist common instructions on Switch. Sink common instructions are not always performance friendly. We need to implement hoist common instructions on switch instruction to solve the following problem: ``` define i1 @foo(i64 %a, i64 %b, i64 %c, i64 %d) { start: %test = icmp eq i64 %a, %d br i1 %test, label %switch_bb, label %exit switch_bb: ; preds = %start switch i64 %a, label %bb0 [ i64 1, label %bb1 i64 2, label %bb2 ] bb0: ; preds = %switch_bb %0 = icmp eq i64 %b, %c br label %exit bb1: ; preds = %switch_bb %1 = icmp eq i64 %b, %c br label %exit bb2: ; preds = %switch_bb %2 = icmp eq i64 %b, %c br label %exit exit: ; preds = %bb2, %bb1, %bb0, %start %result = phi i1 [ false, %start ], [ %0, %bb0 ], [ %1, %bb1 ], [ %2, %bb2 ] ret i1 %result } ``` The pre-commit test is D156617. Reviewed By: XChy, nikic Differential Revision: https://reviews.llvm.org/D155711	2023-09-20 07:21:49 +08:00
DianQK	40b0ab287f	[SimplifyCFG] Pre-commit test for extending HoistThenElseCodeToIf. Pre-commit test for D155711. Differential Revision: https://reviews.llvm.org/D156617	2023-09-20 07:21:48 +08:00
Kohei Asano	fef8249220	[SimplifyCFG] handle monotonic wrapped case for D150943 (#65882 )	2023-09-14 21:26:11 +09:00
Nikita Popov	69bd66b3ce	[Tests] Remove some and/or constant expressions in tests (NFC) In preparation for their removal in D158081.	2023-08-21 12:05:32 +02:00
Florian Hahn	b7a95ad467	[SimplifyCFG] Don't sink loads/stores with swifterror pointers. swifterror pointers can only be used as pointer operands of load & store instructions (and as swifterror argument of a call). Sinking loads or stores with swifterror pointer operands would require introducing a select of of the pointer operands, which isn't allowed. Check for this condition in canSinkInstructions. Reviewed By: aschwaighofer Differential Revision: https://reviews.llvm.org/D158083	2023-08-17 09:59:07 +01:00
Florian Hahn	5816d2ab28	[SimplifyCFG] Add tests for sinking load/store with swifterror operand. Add test coverage for sinking/hoisting loads/stores with swifterror pointers. Currently this isn't handled correctly by SimplifyCFG and causes a verifier error.	2023-08-16 14:51:29 +01:00
Matt Arsenault	25bc999d1f	Intrinsics: Add type overload to stacksave and stackstore This allows use with non-0 address space stacks. llvm_ptr_ty should never be used. This could use some more percolation up through mlir, but this is enough to fix existing tests. https://reviews.llvm.org/D156666	2023-08-09 18:33:11 -04:00
Johannes Doerfert	fa367d159a	[IR] Mark `llvm.assume` as `memory(inaccessiblemem: write)` It was `inaccessiblemem: readwrite` before, no need for the read. No real benefit is expected but it can help debugging and other efforts. Differential Revision: https://reviews.llvm.org/D156478	2023-07-31 13:44:52 -07:00
Johannes Doerfert	6fa8244eb6	[IR] Mark `llvm.trap` as `memory(inaccessiblemem: write)` Traps will not read/write the program state but they need an effect for preservation, similar to `llvm.assume`. We really want a new memory kind for that (see TODO), but for now `inaccessiblemem: write` is better than any possible effect. Differential Revision: https://reviews.llvm.org/D156476	2023-07-31 13:44:52 -07:00
DianQK	6d55f6d818	[SimplifyCFG] Regenerate test checks (NFC) Remove the following warning. ``` WARNING: Change IR value name 'tmp4' or use --prefix-filecheck-ir-name to prevent possible conflict with scripted FileCheck name. ```	2023-07-30 19:04:06 +08:00
Teresa Johnson	5986559caa	[SimplifyCFG] Guard branch folding by speculate blocks flag Guard FoldBranchToCommonDest in SimplifyCFG with the SpeculateBlocks flag as it can also speculate instructions. This was split out of D155997. Differential Revision: https://reviews.llvm.org/D156194	2023-07-25 06:46:19 -07:00
Nikita Popov	edb2fc6dab	[llvm] Remove explicit -opaque-pointers flag from tests (NFC) Opaque pointers mode is enabled by default, no need to explicitly enable it.	2023-07-12 14:35:55 +02:00
Nikita Popov	bb3763e497	Revert "[SimplifyCFG] Allow dropping block that only contains ephemeral values" This reverts commit 20f0c68fd83a0147a8ec1722bd2e848180610288. https://reviews.llvm.org/D153966#4464594 reports an optimization regression in Rust. Additionally this change has caused an unexpected 0.3% compile-time regression.	2023-06-30 21:24:05 +02:00

1 2 3 4 5 ...

1133 Commits