llvm-project

Author	SHA1	Message	Date
Mikael Holmen	ce0a750fe4	[AggressiveInstCombine] Ignore debug instructions when load combining (#70200 ) We previously included debug instructions when counting instructions when looking for loads to combine. This meant that the presence of debug instructions could affect optimization, as shown in the updated testcase. This fixes #69925.	2023-10-26 09:58:54 +02:00
Craig Topper	e9e458418f	[AggressiveInstCombine] Improve line breaks in comment. NFC The comments contain IR where some instructions don't fit in 80 columns. The extra part of the line was placed in front of the next IR instruction instead of on its own line.	2023-08-25 10:08:23 -07:00
Alexander Kornienko	0b779b0daa	Revert "[AggressiveInstCombine] Fold strcmp for short string literals" This reverts commit 5dde755188e34c0ba5304365612904476c8adfda, cbfcf90152de5392a36d0a0241eef25f5e159eef and 8981520b19f2d2fe3d2bc80cf26318ee6b5b7473 due to a miscompile introduced in 8981520b19f2d2fe3d2bc80cf26318ee6b5b7473 (see https://reviews.llvm.org/D154725#4568845 for details) Differential Revision: https://reviews.llvm.org/D157430	2023-08-08 22:53:45 +02:00
Maksim Kita	5dde755188	[AggressiveInstCombine][NFC] Fix typo AggressiveInstCombine fix typo in expandStrcmp method. Differential Revision: https://reviews.llvm.org/D156556	2023-08-07 21:51:44 +03:00
David Green	aa97f6b494	[AIC] Fix the sext cost operands in tryToFPToSat As pointed out in D125755 the operand of a call to getCastInstrCost had the Src and Dst the wrong way around. Differential Revision: https://reviews.llvm.org/D154841	2023-08-07 09:33:18 +01:00
Maksim Kita	cbfcf90152	[AggressiveInstCombine] Fold strcmp for short string literals with size 2 Fold strcmp for short string literals with size 2. Depends D155742. Differential Revision: https://reviews.llvm.org/D155743	2023-07-27 18:45:21 +03:00
Maksim Kita	8981520b19	[AggressiveInstCombine] Fold strcmp for short string literals Fold strcmp() against 1-char string literals. This designates AggressiveInstCombine as the pass for libcalls simplifications that may need to change the control flow graph. Fixes https://github.com/llvm/llvm-project/issues/58003. Differential Revision: https://reviews.llvm.org/D154725	2023-07-19 17:12:27 +02:00
Matt Arsenault	6640df94f9	ValueTracking: Remove CannotBeOrderedLessThanZero Replace the last user of CannotBeOrderedLessThanZero with new version. Makes assumes work in this case.	2023-07-11 20:42:18 -04:00
Youngsuk Kim	d22a236ae7	[llvm] Replace use of Type::getPointerTo() (NFC) Partial progress towards replacing in-tree uses of `Type::getPointerTo()`. If `getPointerTo()` is used solely to support an unnecessary bitcast, remove the bitcast. Reviewed By: barannikov88, nikic Differential Revision: https://reviews.llvm.org/D153307	2023-06-23 22:32:29 -04:00
bipmis	cbc50ba12e	[AggressiveInstCombine] Handle the nested GEP/BitCast scenario in Load Merge. This seems to be an issue currently where there are nested/chained GEP/BitCast Pointers. The patch generates a new GEP for the wider load to avoid dominance problems. Differential Revision: https://reviews.llvm.org/D150864	2023-05-24 10:36:11 +01:00
Matt Arsenault	86d0b524f3	ValueTracking: Expand signature of isKnownNeverInfinity/NaN This is in preparation for replacing the implementation with a wrapper around computeKnownFPClass.	2023-05-16 20:42:58 +01:00
khei4	39a0677784	[AggressiveInstCombine] folding load for constant global patterened arrays and structs by GEP-indices Differential Revision: https://reviews.llvm.org/D146622 Fixes https://github.com/llvm/llvm-project/issues/61615 Reviewed By: nikic	2023-05-12 19:02:28 +09:00
Jordan Rupprecht	e08c397a88	Revert "[AggressiveInstCombine] folding load for constant global patterened arrays and structs by GEP-indices Differential Revision: https://reviews.llvm.org/D146622 Fixes https://github.com/llvm/llvm-project/issues/61615 " This reverts commit 0574a4be879e07b48ba9be8d63eebba49a04dfe8. It causes a compiler crash due to a div by zero.	2023-05-09 10:38:46 -07:00
khei4	0574a4be87	[AggressiveInstCombine] folding load for constant global patterened arrays and structs by GEP-indices Differential Revision: https://reviews.llvm.org/D146622 Fixes https://github.com/llvm/llvm-project/issues/61615	2023-05-09 23:22:21 +09:00
Arthur Eubanks	ed443d81d1	[AggressiveInstCombine] Only fold consecutive shifts of loads with constant shift amounts This is what the code assumed but never actually checked. Fixes https://github.com/llvm/llvm-project/issues/62509. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D149896	2023-05-04 13:52:25 -07:00
khei4	cde1c3c014	[AggressiveInstCombine] use m_Deferred on funnel shift(NFC) Differential Revision: https://reviews.llvm.org/D146798 Reviewed By: nikic	2023-03-24 21:49:02 +09:00
khei4	434b0badb5	[AggressiveInstCombine] folding load for constant global patterened arrays and structs by alignment Differential Revision: https://reviews.llvm.org/D144445 Reviewed By: nikic fix: wrong arrow	2023-03-23 23:31:22 +09:00
chenglin.bi	76df706bca	Revert "[LogicCombine 1/?] Implement a general way to simplify logical operations." This reverts commit 97dcbea63e11d566cff0cd3a758cf1114cf1f633.	2023-03-14 09:00:06 +08:00
chenglin.bi	97dcbea63e	[LogicCombine 1/?] Implement a general way to simplify logical operations. This patch involves boolean ring to simplify logical operations. We can treat `&` as ring multiplication and `^` as ring addition. So we need to canonicalize all other operations to `` `+`. Like: ``` a & b -> a b a ^ b -> a + b ~a -> a + 1 a \| b -> a * b + a + b c ? a : b -> c * a + (c + 1) * b ``` In the code, we use a mask set to represent an expression. Every value that is not comes from logical operations could be a bit in the mask. The mask itself is a multiplication chain. The mask set is an addiction chain. We can calculate two expressions based on boolean algebras. For now, the initial patch only enabled on and/or/xor, Later we can enhance the code step by step. Reference: https://en.wikipedia.org/wiki/Boolean_ring Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D142803	2023-03-02 20:46:16 +08:00
Paul Walker	f53234cbfd	[AggressiveInstCombine] Fix invalid TypeSize conversion when combining loads. Much of foldLoadsRecursive relies on knowing the size of loaded data, which is not possible for scalable vector types. However, the logic of combining two small loads into one bigger load does not apply for vector types so rather than converting the algorithm to use TypeSize I've simply added an early exit for vectors. Fixes #59510 Differential Revision: https://reviews.llvm.org/D140106	2022-12-17 15:34:27 +00:00
bipmis	e9393789a9	[AggressiveInstCombine] Handle the insert point of the merged load correctly. This patch updates the load insert point of the merged load in AggressiveInstCombine(). This is done to handle the reported test breaks by handling Alias Analysis correctly. Differential Revision: https://reviews.llvm.org/D137201	2022-11-29 10:53:51 +00:00
Stanislav Mekhanoshin	bcaf31ec3f	[AMDGPU] Allow finer grain control of an unaligned access speed A target can return if a misaligned access is 'fast' as defined by the target or not. In reality there can be different levels of 'fast' and 'slow'. This patch changes the boolean 'Fast' argument of the allowsMisalignedMemoryAccesses family of functions to an unsigned representing its speed. A target can still define it as it wants and the direct translation of the current code uses 0 and 1 for current false and true. This makes the change an NFC. Subsequent patch will start using an actual value of speed in the load/store vectorizer to compare if a vectorized access going to be not just fast, but not slower than before. Differential Revision: https://reviews.llvm.org/D124217	2022-11-17 09:23:53 -08:00
Arthur Eubanks	70dc3b811e	[AggressiveInstCombine] Remove legacy PM pass As part of legacy PM optimization pipeline removal. This shouldn't be used in codegen pipelines so it should be ok to remove. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D137116	2022-11-15 14:35:15 -08:00
bipmis	150fc73dda	[AggressiveInstCombine] Avoid load merge/widen if stores are present b/w loads This patch is to address the test cases in which the load has to be inserted at a right point. This happens when there is a store b/w the loads. This patch reverts the loads merge in all cases when stores are present b/w loads and will eventually be replaced with proper fix and test cases. Differential Revision: https://reviews.llvm.org/D137333	2022-11-03 14:32:07 +00:00
bipmis	38f3e44997	[AggressiveInstCombine] Load merge the reverse load pattern of consecutive loads. This patch extends the load merge/widen in AggressiveInstCombine() to handle reverse load patterns. Differential Revision: https://reviews.llvm.org/D135137	2022-10-19 11:22:58 +01:00
David Stuttard	d1d7d2235c	[AggressiveInstCombine] Fix cases where non-opaque pointers are used In the case of non-opaque pointers, when combining consecutive loads, need to bitcast the pointer source to the combined type size, otherwise asserts are triggered. Differential Revision: https://reviews.llvm.org/D135249	2022-10-05 13:42:46 +01:00
bipmis	3b49a9fcf6	[AggressiveInstCombine] Combine consecutive loads which are being merged to form a wider load. The patch simplifies some of the patterns as below 1. (ZExt(L1) << shift1) \| (ZExt(L2) << shift2) -> ZExt(L3) << shift1 2. (ZExt(L1) << shift1) \| ZExt(L2) -> ZExt(L3) The pattern is indicative of the fact that the loads are being merged to a wider load and the only use of this pattern is with a wider load. In this case for a non-atomic/non-volatile loads reduce the pattern to a combined load which would improve the cost of inlining, unrolling, vectorization etc. Fix the error reported on reverse load merge. Differential Revision: https://reviews.llvm.org/D127392	2022-09-28 17:32:47 +01:00
Dmitri Gribenko	954d3cd2c6	Revert "[AggressiveInstCombine] Combine consecutive loads which are being merged to form a wider load." This reverts commit 3c70c8c1df66500f67f77596b1e76cf0a8447ee5. After this commit, during the 3-stage bootstrap the second-stage Clang crashes.	2022-09-23 19:21:09 +02:00
bipmis	3c70c8c1df	[AggressiveInstCombine] Combine consecutive loads which are being merged to form a wider load. The patch simplifies some of the patterns as below 1. (ZExt(L1) << shift1) \| (ZExt(L2) << shift2) -> ZExt(L3) << shift1 2. (ZExt(L1) << shift1) \| ZExt(L2) -> ZExt(L3) The pattern is indicative of the fact that the loads are being merged to a wider load and the only use of this pattern is with a wider load. In this case for a non-atomic/non-volatile loads reduce the pattern to a combined load which would improve the cost of inlining, unrolling, vectorization etc. Differential Revision: https://reviews.llvm.org/D127392	2022-09-23 10:19:50 +01:00
Djordje Todorovic	f0f8b46863	Recommit "[AggressiveInstCombine] Lower Table Based CTTZ The bug reported on the [0] has been fixed. The issue was we have not checked if the global variables that represent cttz tables was constant. There is a new negative test added in negative-lower-table-based-cttz.ll that represents this. [0] https://reviews.llvm.org/rGdf868edee561eb973edd85ec9df41c67aa0bff6b	2022-09-20 13:12:47 +02:00
Djordje Todorovic	b080d0bae8	Revert ""Recommit "[AggressiveInstCombine] Lower Table Based CTTZ""" This reverts commit df868edee561eb973edd85ec9df41c67aa0bff6b, as it introduces a bug found by Alive2 (more on the rGdf868edee561).	2022-09-12 08:23:07 +02:00
Djordje Todorovic	df868edee5	"Recommit "[AggressiveInstCombine] Lower Table Based CTTZ"" This reverts commit 053841c5624ca7eacd108a26071d8a1cefe1bebd. We faced a use-after-free after pushing the D113291, since the foldSqrt() has a call to eraseFromParent(). The function should be at the end of the main loop that folds the patterns. This patch fixes that.	2022-09-09 10:29:39 +02:00
Djordje Todorovic	7aec9ddcfd	Revert "Recommit "[AggressiveInstCombine] Lower Table Based CTTZ"" This reverts commit f87993915768772d113bfd524347ce4341b843cf.	2022-09-08 17:01:16 +02:00
Djordje Todorovic	f879939157	Recommit "[AggressiveInstCombine] Lower Table Based CTTZ"	2022-09-08 16:36:46 +02:00
Richard Smith	053841c562	Revert "[AggressiveInstCombine] Lower Table Based CTTZ" This reverts commit fec01ee3f5244bb9a04bc4310fc892c56c5b6bab. According to asan, this patch introduces a heap use after free.	2022-09-02 16:19:09 -07:00
Djordje Todorovic	fec01ee3f5	[AggressiveInstCombine] Lower Table Based CTTZ This patch introduces recognition of table-based ctz implementation during the AggressiveInstCombine. This fixes the [0]. [0] https://bugs.llvm.org/show_bug.cgi?id=46434 Differential Revision: https://reviews.llvm.org/D113291	2022-09-02 17:26:55 +02:00
Sanjay Patel	e079bf6558	[AggressiveInstCombine] check sqrt operand to allow more libcall->intrinsic transforms This should fix issue #56383 (at least when compiled with -O3 because this pass is only run at -O3 currently).	2022-07-27 11:36:13 -04:00
Sanjay Patel	e3205b8765	[AggressiveInstCombine] convert sqrt libcalls with "nnan" to sqrt intrinsics This is an alternate to D129155 that uses TTI.haveFastSqrt() to avoid a potential miscompile for programs with reads of errno. Moving the transform to AggressiveInstCombine provides access to TTI. If a sqrt call has "nnan", that implies that the input argument is never negative because sqrt of {negative number} --> NAN. If the argument is never negative and the call can be lowered without a libcall, then we can assume that errno accesses are unchanged after lowering, so the call can be translated to the LLVM intrinsic (which is expected to become inline code). This affects codegen for targets like x86 that have sqrt instructions, but still have to conservatively assume that a libcall may be needed to set errno as shown in issue #52620 and issue #56383. This patch won't solve those examples - we will need to extend this to use CannotBeOrderedLessThanZero or similar, enhance that analysis for new operators, and/or deal with llvm.assume too. Differential Revision: https://reviews.llvm.org/D129167	2022-07-26 15:50:14 -04:00
David Green	4a5cb957a1	[AggressiveInstcombine] Conditionally fold saturated fptosi to llvm.fptosi.sat This adds a fold for aggressive instcombine that converts smin(smax(fptosi(x))) into a llvm.fptosi.sat, providing that the saturation constants are correct and the cost of the llvm.fptosi.sat is lower. Unfortunately, a llvm.fptosi.sat cannot always be converted back to a smin/smax/fptosi. The llvm.fptosi.sat intrinsic is more defined that the original, which produces poison if the original fptosi was out of range. The llvm.fptosi.sat will saturate any value, so needs to be expanded to a fptosi(fpmin(fpmax(x))), which can be worse for codegeneration depending on the target. So this change thais conditional on the backend reporting that the llvm.fptosi.sat is cheaper that the original smin+smax+fptost. This is a change to the way that AggressiveInstrcombine has worked in the past. Instead of just being a canonicalization pass, that canonicalization can be dependant on the target in certain specific cases. Differential Revision: https://reviews.llvm.org/D125755	2022-06-10 09:36:09 +01:00
serge-sans-paille	59630917d6	Cleanup includes: Transform/Scalar Estimated impact on preprocessor output line: before: 1062981579 after: 1062494547 Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D120817	2022-03-03 07:56:34 +01:00
Kazu Hirata	9ed6800ef9	[Transforms] Use default member initialization in MaskOps (NFC)	2022-02-05 21:39:21 -08:00
Kazu Hirata	843d1eda18	[llvm] Use llvm::reverse (NFC)	2021-11-06 19:31:18 -07:00
Chris Lattner	735f46715d	[APInt] Normalize naming on keep constructors / predicate methods. This renames the primary methods for creating a zero value to `getZero` instead of `getNullValue` and renames predicates like `isAllOnesValue` to simply `isAllOnes`. This achieves two things: 1) This starts standardizing predicates across the LLVM codebase, following (in this case) ConstantInt. The word "Value" doesn't convey anything of merit, and is missing in some of the other things. 2) Calling an integer "null" doesn't make any sense. The original sin here is mine and I've regretted it for years. This moves us to calling it "zero" instead, which is correct! APInt is widely used and I don't think anyone is keen to take massive source breakage on anything so core, at least not all in one go. As such, this doesn't actually delete any entrypoints, it "soft deprecates" them with a comment. Included in this patch are changes to a bunch of the codebase, but there are more. We should normalize SelectionDAG and other APIs as well, which would make the API change more mechanical. Differential Revision: https://reviews.llvm.org/D109483	2021-09-09 09:50:24 -07:00
Anton Afanasyev	d1f9b21677	[AggressiveInstCombine] Add `AssumptionCache` to aggressive instcombine Add support for @llvm.assume() to TruncInstCombine allowing optimizations based on these intrinsics while computing known bits.	2021-09-07 16:45:00 +03:00
Arthur Eubanks	6b9524a05b	[NewPM] Don't mark AA analyses as preserved Currently all AA analyses marked as preserved are stateless, not taking into account their dependent analyses. So there's no need to mark them as preserved, they won't be invalidated unless their analyses are. SCEVAAResults was the one exception to this, it was treated like a typical analysis result. Make it like the others and don't invalidate unless SCEV is invalidated. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D102032	2021-05-18 13:49:03 -07:00
Simon Pilgrim	88c5b50060	[AggressiveInstCombine] Generalize foldGuardedRotateToFunnelShift to generic funnel shifts (REAPPLIED) The fold currently only handles rotation patterns, but with the maturation of backend funnel shift handling we can now realistically handle all funnel shift patterns. This should allow us to begin resolving PR46896 et al. Ensure we block poison in a funnel shift value - similar to rG0fe91ad463fea9d08cbcd640a62aa9ca2d8d05e0 Reapplied with fix for PR48068 - we weren't checking that the shift values could be hoisted from their basicblocks. Differential Revision: https://reviews.llvm.org/D90625	2020-12-21 15:22:27 +00:00
Martin Storsjö	36cf1e7d0e	Revert "[AggressiveInstCombine] Generalize foldGuardedRotateToFunnelShift to generic funnel shifts" This reverts commit 59b22e495c15d2830f41381a327f5d6bf49ff416. That commit broke building for ARM and AArch64, reproducible like this: $ cat apedec-reduced.c a; b(e) { int c; unsigned d = f(); c = d >> 32 - e; return c; } g() { int h = i(); if (a) h = h << a \| b(a); return h; } $ clang -target aarch64-linux-gnu -w -c -O3 apedec-reduced.c clang: ../lib/Transforms/InstCombine/InstructionCombining.cpp:3656: bool llvm::InstCombinerImpl::run(): Assertion `DT.dominates(BB, UserParent) && "Dominance relation broken?"' failed. Same thing for e.g. an armv7-linux-gnueabihf target.	2020-11-04 08:39:32 +02:00
Simon Pilgrim	59b22e495c	[AggressiveInstCombine] Generalize foldGuardedRotateToFunnelShift to generic funnel shifts The fold currently only handles rotation patterns, but with the maturation of backend funnel shift handling we can now realistically handle all funnel shift patterns. This should allow us to begin resolving PR46896 et al. Differential Revision: https://reviews.llvm.org/D90625	2020-11-03 10:49:49 +00:00
Simon Pilgrim	55f15f99cb	[AggressiveInstCombine] foldGuardedRotateToFunnelShift - generalize rotation to funnel shift matcher. Replace matchRotate with a more general matchFunnelShift - at the moment this is still just used for rotation patterns.	2020-11-02 17:09:17 +00:00
Simon Pilgrim	fadd152317	[AggressiveInstCombine] foldAnyOrAllBitsSet - add uniform vector support Replace m_ConstantInt with m_APInt to support uniform vectors (with no undef elements) Adding non-undef support would involve some refactoring of the MaskOps struct but this might still be worth it.	2020-10-15 11:02:35 +01:00

1 2

74 Commits