llvm-project

Author	SHA1	Message	Date
Martin Sebor	5ccfd5f6d4	[SimplifyLibCalls] Optimize memchr() with known char+str and unknown length If both the character and string are known, but the length potentially isn't, we can optimize the memchr() call to a select of either the known position of the character or null. Split off from https://reviews.llvm.org/D122836.	2022-04-04 11:01:33 +02:00
Martin Sebor	d18991debf	[SimplifyLibCalls] Fold memchr() with size 1 If the memchr() size is 1, then we can convert the call into a single-byte comparison. This works even if both the string and the character are unknown. Split off from https://reviews.llvm.org/D122836.	2022-04-04 10:41:20 +02:00
Martin Sebor	0f08875744	[InstCombine] Add additional memchr test (NFC) And fix some test names / comments.	2022-04-04 10:41:20 +02:00
Hirochika Matsumoto	f138a9964b	Reapply "[InstSimplify][NFC] Add baseline tests for folds of icmp with ctpop" This change was previously reverted because I forgot rerunning update_test_checks.py and tests were not actually baseline. Extracted from: https://reviews.llvm.org/D122757	2022-04-03 22:07:04 +09:00
Dávid Bolvanský	f02a0a69af	[NFCI] Fixed missing colon in CHECK directives	2022-04-03 11:52:38 +02:00
Alexander Shaposhnikov	6cf10b7e6e	[InstCombine] Fold srem(X, PowerOf2) == C into (X & Mask) == C for positive C This diff extends InstCombinerImpl::foldICmpSRemConstant to handle the cases srem(X, PowerOf2) == C and srem(X, PowerOf2) != C for positive C. This addresses the issue https://github.com/llvm/llvm-project/issues/54650 Differential revision: https://reviews.llvm.org/D122942 Test plan: make check-all	2022-04-03 03:57:05 +00:00
Alexander Shaposhnikov	911cfcd7f5	[InstCombine][NFC] Add baseline tests for folds of srem(X, PowerOf2) == C Extracted from: https://reviews.llvm.org/D122942 Test plan: make check-all	2022-04-03 03:26:47 +00:00
Sanjay Patel	5f8c2b884d	[InstCombine] limit icmp fold with sub if other sub user is a phi This is a hacky fix for: https://github.com/llvm/llvm-project/issues/54558 As discussed there, codegen regressed when we opened up this transform to allow extra uses ( 61580d0949fd3465 ), and it's not clear how to undo the transforms at the later stage of compilation. As noted in the code comments, there's a set of remaining folds that are still limited to one-use, so we can try harder to refine and expand the limitations on these folds, but it's likely to be an up-and-down battle as we find and overcome similar regressions. Differential Revision: https://reviews.llvm.org/D122909	2022-04-02 19:23:42 -04:00
Sanjay Patel	97ac0cd6c4	[InstCombine] fold fcmp with lossy casted constant (2nd try) This is a retry of 9397bdc67eb2 - that was reverted until we had a clang warning in place to alert users about a possible mistake in source. The warning was added with ab982eace6e4. This is noted as a missing clang warning in #54222, but it is also a missing optimization opportunity. Alive2 proofs: https://alive2.llvm.org/ce/z/Q8drDq https://alive2.llvm.org/ce/z/pE6LRt I don't see a single conversion for all predicates using "getFCmpCode" logic, so other predicates are left as a TODO item.	2022-04-02 19:23:01 -04:00
Roman Lebedev	308ca349cb	[InstCombine] Fold `(X \| C2) ^ C1 --> (X & ~C2) ^ (C1^C2)` These two are equivalent, and i think the `and` form is more-ish canonical. General proof: https://alive2.llvm.org/ce/z/RrF5s6 If constant on the (outer) `xor` is an `undef`, the whole lane is dead: https://alive2.llvm.org/ce/z/mu4Sh2 However, if the constant on the (inner) `or` is an `undef`, we must sanitize it first: https://alive2.llvm.org/ce/z/MHYJL7 I guess, producing a zero `and`-mask is optimal in that case. alive-tv is happy about the entirety of `xor-of-or.ll`.	2022-04-03 00:12:56 +03:00
Roman Lebedev	3ae08dac8f	[NFC][InstCombine] Autogenerate check lines in a test affected by the future change	2022-04-03 00:12:56 +03:00
Roman Lebedev	b3fca02a6d	[NFC][InstCombine] Add some tests for `(X \| C2) ^ C1` pattern	2022-04-03 00:12:48 +03:00
Hirochika Matsumoto	f65c78a094	Revert "[InstSimplify][NFC] Add baseline tests for folds of icmp with ctpop" This reverts commit b48abeea44ac3c7860b13b863210116e8db1d978. Accidentally added already optimized tests, not baseline tests.	2022-04-03 02:27:59 +09:00
Hirochika Matsumoto	b48abeea44	[InstSimplify][NFC] Add baseline tests for folds of icmp with ctpop Extracted from: https://reviews.llvm.org/D122757	2022-04-03 02:19:24 +09:00
Sanjay Patel	2c6f78dc2c	[InstCombine] add tests for icmp with sub with multiple uses; NFC Issue #54558	2022-04-01 13:39:24 -04:00
Martin Sebor	884d7c60f3	[InstCombine] Add additional tests for strlen/strnlen (NFC) Taken from D122686.	2022-04-01 16:58:38 +02:00
Martin Sebor	371d2ed3f3	[InstCombine] Add additional memchr tests (NFC)	2022-04-01 12:16:03 +02:00
Hirochika Matsumoto	a3cffc1150	[InstCombine] Fold (ctpop(X) == 1) \| (X == 0) into ctpop(X) < 2 https://alive2.llvm.org/ce/z/94yRMN Fixes #54177 Differential Revision: https://reviews.llvm.org/D122077	2022-03-29 11:30:06 -04:00
Johannes Doerfert	bb0b23174e	[InstCombineCalls] Optimize call of bitcast even w/ parameter attributes Before we gave up if a call through bitcast had parameter attributes. Interestingly, we allowed attributes for the return value already. We now handle both the same way, namely, we drop the ones that are incompatible with the new type and keep the rest. This cannot cause "more UB" than initially present. Differential Revision: https://reviews.llvm.org/D119967	2022-03-28 20:57:52 -05:00
chenglin.bi	9a53793ab8	[InstCombine] Fold two select patterns into and-or select (~a \| c), a, b -> and a, (or c, b) https://alive2.llvm.org/ce/z/bnDobs select (~c & b), a, b -> and b, (or a, c) https://alive2.llvm.org/ce/z/k2jJHJ Differential Revision: https://reviews.llvm.org/D122152	2022-03-28 16:07:55 -04:00
chenglin.bi	7cc48026bd	[InstCombine] add baseline tests for logical and/or folds; NFC Extracted from D122152	2022-03-27 09:55:55 -04:00
Hirochika Matsumoto	ebaa28e075	[InstCombine] add baseline tests for fold of ctpop + icmp; NFC Extracted from D122077.	2022-03-27 09:11:20 -04:00
Simon Pilgrim	6a094a6264	[InstCombine] SimplifyDemandedUseBits - remove ashr node if we only demand known sign bits We already do this for SelectionDAG, but we're missing it here. Noticed while re-triaging PR21929 Differential Revision: https://reviews.llvm.org/D122340	2022-03-25 15:39:08 +00:00
Johannes Doerfert	a81fff8afd	Reapply "[Intrinsics] Add `nocallback` to the default intrinsic attributes" This reverts commit c5f789050daab25aad6770790987e2b7c0395936 and reapplies 7aea3ea8c3b33c9bb338d5d6c0e4832be1d09ac3 with additional test changes.	2022-03-25 09:36:50 -05:00
Sanjay Patel	65d4354149	[InstCombine] add more tests for nsw propagation; NFC Follow-up as suggested with b6efd2510a1efaf2 - show a couple of examples of general subtraction (the earlier patch was all negations).	2022-03-24 14:40:59 -04:00
Sanjay Patel	5dbb53b1b4	[InstCombine] merge shuffled vector negate and multiply Add the "(0 - X) --> (X * -1)" reverse identity to the list of alternate form binops. We need a little hack to make the existing logic work because it does not expect to move constants from op0 to op1, but the code comment hopefully makes that clear. I don't think there are any other identities like that. Fixes #54364 Differential Revision: https://reviews.llvm.org/D122390	2022-03-24 10:25:16 -04:00
Sanjay Patel	b6efd2510a	[InstCombine] add tests for nsw propagation; NFC These are based on tests that were included in the abandoned D122299. Comments indicate what should or should not happen if we change behavior in Negator.	2022-03-23 15:31:25 -04:00
chenglin.bi	52f323d0f1	[InstCombine] Fold abs of known negative operand when source is sub When abs source comes from (x - y), check if a "x > y" dominating condition exists. Fixes #54132 Differential Revision: https://reviews.llvm.org/D122013	2022-03-23 15:21:33 -04:00
Simon Pilgrim	b75399a5e2	[InstCombine] Add some initial SimplifyDemandedBits tests for removal of ashr with sufficient signbits We have this in SelectionDAG but it's missing in InstCombine Based off PR21929 test case	2022-03-23 19:07:10 +00:00
Sanjay Patel	0fcff69bcb	[InstCombine] try to narrow shifted bswap-of-zext (2nd try) The first attempt at this missed a validity check. This version includes a test of the narrow source type for modulo-16-bits. Original commit message: This is the IR counterpart to 370ebc9d9a573d6 which provided a bswap narrowing fix for issue #53867. Here we can be more general (although I'm not sure yet what would happen for illegal types in codegen - too rare to worry about?): https://alive2.llvm.org/ce/z/3-CPfo This will be more effective if we have moved the shift after the bswap as proposed in D122010, but it is independent of that patch. Differential Revision: https://reviews.llvm.org/D122166	2022-03-23 11:28:37 -04:00
Sanjay Patel	87f3ebd505	[InstCombine] add test for bogus bswap; NFC This is reduced from a crash caused by D122166.	2022-03-23 11:28:37 -04:00
Sanjay Patel	af5dfc190f	[InstCombine] add tests for shuffle of mismatched binops; NFC	2022-03-23 07:51:09 -04:00
Nathan Chancellor	4e0008dcbe	Revert "[InstCombine] try to narrow shifted bswap-of-zext" This reverts commit 9e9bda2e8f5b88715bad767a4b7740df32b040d2. This causes a backend error when building the Linux kernel for arm64. See https://reviews.llvm.org/D122166 for a simplified reproducer.	2022-03-22 17:32:33 -07:00
Philip Reames	7abefc4222	[instcombine] Fold away memset/memmove from otherwise unused alloca The motivation for this is that while both memcpyopt and dse will catch this case, both are limited by MSSA's walk back threshold when finding clobbers. As such, if you have a memcpy of an otherwise dead alloca placed towards the end of a long basic block with lots of other memory instructions, it would be missed. This is a bit undesirable for such an "obviously" useless bit of code. As noted in comments, we should probably generalize instcombine's escape analysis peephole (see visitAllocInst) to allow read xor write. Doing that would subsume this code in a more general way, but is also a more involved change. For the moment, I went with the easiest fix.	2022-03-22 13:48:48 -07:00
Philip Reames	57d02900b5	[test,instcombine] Precommit test for upcoming transform	2022-03-22 13:21:09 -07:00
Sanjay Patel	c4d74a93f6	[InstCombine] add test for abs with dominating condition; NFC There's a potential miscompile or missed optimization with propagating 'nsw' in the transform proposed in D122013, so we need at least one more test for coverage.	2022-03-22 10:42:52 -04:00
chenglin.bi	01a2ba5dfb	[InstCombine] add tests for abs with dominating condition; NFC Baseline tests for D122013 (issue #54132).	2022-03-22 10:37:12 -04:00
Sanjay Patel	60820e53ec	[InstCombine] try to canonicalize logical shift after bswap When shifting by a byte-multiple: bswap (shl X, C) --> lshr (bswap X), C bswap (lshr X, C) --> shl (bswap X), C This is an IR implementation of a transform suggested in D120648. The "swaps cancel" test models the motivating optimization from that proposal. Alive2 checks (as noted in the other review, we could use knownbits to handle shift-by-variable-amount, but that can be an enhancement patch): https://alive2.llvm.org/ce/z/pXUaRf https://alive2.llvm.org/ce/z/ZnaMLf Differential Revision: https://reviews.llvm.org/D122010	2022-03-22 09:10:55 -04:00
Sanjay Patel	9e9bda2e8f	[InstCombine] try to narrow shifted bswap-of-zext This is the IR counterpart to 370ebc9d9a573d6 which provided a bswap narrowing fix for issue #53867. Here we can be more general (although I'm not sure yet what would happen for illegal types in codegen - too rare to worry about?): https://alive2.llvm.org/ce/z/3-CPfo This will be more effective if we have moved the shift after the bswap as proposed in D122010, but it is independent of that patch. Differential Revision: https://reviews.llvm.org/D122166	2022-03-22 08:22:30 -04:00
Sanjay Patel	c4f31d1da5	[InstCombine] add tests for shift-of-bswap; NFC	2022-03-22 08:22:30 -04:00
serge-sans-paille	39b02d49cc	[instcombine] Support and test __builtin_object_size interaction with __strdup and __strndup Differential Revision: https://reviews.llvm.org/D122005	2022-03-21 11:30:51 +01:00
Sanjay Patel	1f001b25f1	[InstCombine] add tests for bswap with shifted operand; NFC	2022-03-18 11:22:15 -04:00
Andrew Wei	0af3e6a22d	[InstCombine] Sink instructions with multiple users in a successor block. This patch tries to sink instructions when they are only used in a successor block. This is a further enhancement patch based on Anna's commit: D109700, which allows sinking an instruction having multiple uses in a single user. In this patch, sink instructions with multiple users in a single successor block will be supported. It could fix a known issue from rust: https://github.com/rust-lang/rust/issues/51346#issuecomment-394443610 Reviewed By: nikic, reames Differential Revision: https://reviews.llvm.org/D121585	2022-03-18 11:53:45 +08:00
Andrew Wei	f241d43b40	[NFC][ InstCombine] precommit test for D121585 Based on original tests from D121585.	2022-03-18 00:37:21 +08:00
Matt Devereau	a9e08bc7c1	[AArch64][SVE] InstCombine llvm.aarch64.sve.sel to select InstCombine llvm.aarch64.sve.sel to select. This allows an existing instCombine added in 20b0fa91c9ee to fire. Differential Revision: https://reviews.llvm.org/D121792	2022-03-17 16:20:48 +00:00
Nikita Popov	4010a7a5d0	Reapply [InstCombine] Support switch in phi to cond fold Reapply with an explicit check for multi-edges, as the expected behavior of multi-edge dominance is unclear (D120811). ----- For conditional branches, we know the value is i1 0 or i1 1 along the outgoing edges. For switches we can apply exactly the same optimization, just with the known values determined by the switch cases.	2022-03-17 10:03:09 +01:00
Sanjay Patel	598721f866	[InstCombine] try harder to propagate 'nsz' through fneg-of-select This can be viewed as swapping the select arms: https://alive2.llvm.org/ce/z/jUvFMJ ...so we don't have the 'nsz' problem with the more general fold. This unlocks other folds for the motivating fabs example. This was discussed in issue #38828.	2022-03-15 11:05:29 -04:00
Sanjay Patel	2d3593e668	[InstCombine] add tests for fneg-of-select with FMF; NFC	2022-03-15 11:05:29 -04:00
Simon Pilgrim	7e4cf582cf	[InstCombine] Add general constant support to eq/ne icmp(add(X,C1),add(Y,C2)) -> icmp(add(X,C1-C2),Y) fold A further extension for Issue #32161 For eq/ne comparisons - the sign mismatch and bounds constraints are redundant, so if the that fold fails, fallback and just fold the constants directly. https://alive2.llvm.org/ce/z/cdodNQ The loop rotation test change looks mostly benign - the backend doesn't seem to suffer? https://gcc.godbolt.org/z/dErMY78To Differential Revision: https://reviews.llvm.org/D121551	2022-03-15 14:17:38 +00:00
Craig Topper	ce78e68261	[InstCombine] Fold select based logic of fcmps with same operands when FMF is present. If we have a logical and/or in select form and the true/false operand is an fcmp with poison generating FMF, we won't be able to fold it to an and/or instruction. This prevents us from optimizing the case where it is a logical operation of two fcmps with identical operands. This patch adds explicit checks for this case that doesn't rely on converting to and/or to do the optimization. It reuses the existing foldLogicOfFCmps, but adds a new flag to disable the other combine that is inside that function. FMF flags from the two FCmps are intersected using the logic added in D121243. The FIXME has been updated to indicate that we can only use a union for the non-select form. This allows us to optimize cases like this from compare-fp-3.c in the gcc torture suite with fast math. void test1 (float x, float y) { if ((x==y) && (x!=y)) link_error0(); } Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D121323	2022-03-14 14:45:07 -07:00

1 2 3 4 5 ...

6602 Commits