llvm-project

Author	SHA1	Message	Date
Konstantina Mitropoulou	135e5216ba	[NewGVN] Set parent to the temporal instructions that are generated during phi-of-ops optimization (#66314 ) - Test for future commit in NewGVN - [NewGVN] Set parent to the temporal instructions that are generated during phi-of-ops optimization	2023-09-18 10:13:44 -07:00
Nikita Popov	c7aacbb5b6	[ArgPromotion] Update allocsize indices after promotion Promotion can add/remove arguments. We need to update the indices in the allocsize attribute accordingly. Fixes https://github.com/llvm/llvm-project/issues/66103.	2023-09-18 16:15:16 +02:00
Kohei Asano	baf031a853	[MemCpyOpt] fix miscompile for non-dominated use of src alloca for stack-move optimization (#66618 ) Stack-move optimization, the optimization that merges src and dest alloca of the full-size copy, replaces all uses of the dest alloca with src alloca. For safety, we needed to check all uses of the dest alloca locations are dominated by src alloca, to be replaced. This PR adds the check for that. Fixes #65225	2023-09-18 21:29:10 +09:00
Ben Shi	87143ff9f2	[VectorCombine] Fix a spot in commit 068357d9b09cd635b1c2f126d119ce9afecb28f7 My previous commit leads to a crash in "Builders/sanitizer-x86_64-linux-fast" as https://lab.llvm.org/buildbot/#/builders/5/builds/36746. And this patch fixes it.	2023-09-18 15:01:47 +08:00
Ben Shi	068357d9b0	[VectorCombine] Enable transform 'scalarizeLoadExtract' for scalable vector types (#65443 ) The transform 'scalarizeLoadExtract' can be applied to scalable vector types if the index is less than the minimum number of elements. The check whether the index is less than the minimum number of elements locates at line 1175~1180. 'scalarizeLoadExtract' will call 'canScalarizeAccess' and check the returned result if this transform is safe. At the beginning of the function 'canScalarizeAccess', the index will be checked 1. If it is less than the number of elements of a fixed vector type. 2. If it is less than the minimum number of elements of a scalable vector type. Otherwise 'canScalarizeAccess' will return unsafe and this transform will be prevented.	2023-09-18 10:49:18 +08:00
Fangrui Song	b4d4146db3	[WholeProgramDevirt] Use llvm:: qualifier to implement declared functions. NFC	2023-09-17 19:31:42 -07:00
Yingwei Zheng	1679b20cd0	[InstCombine] Fix transforms of two select patterns (#65845 ) This patch fixes transforms of `select (~a \| c), a, b` and `select (c & b), a, b` as discussed in [D158983](https://reviews.llvm.org/D158983). Alive2: https://alive2.llvm.org/ce/z/ft6TDw	2023-09-18 01:28:37 +08:00
Florian Hahn	1d1cba44ea	[VPlan] Remove stray indent when printing scalar steps recipe. VPScalarIVStepsRecipe will now be printed as vp<%6> = SCALAR-STEPS vp<%3>, ir<1> instead of vp<%6> = SCALAR-STEPS vp<%3>, ir<1>	2023-09-17 10:15:52 +01:00
Antonio Frighetto	ce5b88bf10	[InstCombine] Handle constant arms in `select` of `srem` fold Extend folding for `2^n` euclidean division remainder operations on signed integers by handling the specific instance in which one `select` arm has already been replaced by 1. Reported-By: HypheX Fixes: https://github.com/llvm/llvm-project/issues/66417.	2023-09-16 12:22:46 +02:00
Yingwei Zheng	5163319ee2	[InstCombine] Use `ConstantInt::getBool` instead of `Constant::getIntegerValue`. NFC. See also https://reviews.llvm.org/D156238#inline-1546774	2023-09-16 17:41:10 +08:00
Alexey Bataev	434aa2fe56	[SLP]Improve canreuseExtracts for reordering analysis. Improve the analysis in canReuseExtracts for the reodering to better reorder extracts for ExtractSubvector pattern.	2023-09-15 12:09:45 -07:00
Zequan Wu	32db121b29	[Coverage] Allow Clang coverage to be used with debug info correlation. Debug info correlation is an option in InstrProfiling pass, which is used by both IR instrumentation and front-end instrumentation. So, Clang coverage can also benefits the binary size saving from it. Reviewed By: ellis Differential Revision: https://reviews.llvm.org/D157913	2023-09-15 13:47:23 -04:00
Anton Korobeynikov	51d5d7bbae	Extend `retcon.once` coroutines lowering to optionally produce a normal result (#66333 ) One of the main user of these kind of coroutines is swift. There yield-once (`retcon.once`) coroutines are used to temporary "expose" pointers to internal fields of various objects creating borrow scopes. However, in some cases it might be useful also to allow these coroutines to produce a normal result, but there is no convenient way to represent this (as compared to switched-resume kind of coroutines where C++ `co_return` is transformed to a member / callback call on promise object). The extension is simple: we allow continuation function to have a non-void result and accept optional extra arguments via a special `llvm.coro.end.result` intrinsic that would essentially forward them as normal results.	2023-09-15 09:54:38 -07:00
Alexey Bataev	b9ad72ba05	[SLP]Fix PR66176: SLP incorrectly reorders select operands. On the very first iteration for the reductions, when trying to build reduction for boolean logic operations, no need to compare LHS/RHS with the Reduction(VectorizedTree), need to compare with actual parameters of the reduction operations.	2023-09-15 03:57:36 -07:00
Nikita Popov	18e77760ce	[GVN] Also remove phi nodes from VN table (PR65447) Followup to D158849: We also need to remove the phi node from the VN table, which is not handled by removeInstruction(). Fixes https://github.com/llvm/llvm-project/issues/65447.	2023-09-15 11:50:34 +02:00
Nikita Popov	07460b6666	[MemCpyOpt] Avoid infinite loop in processMemSetMemCpyDependence (PR54983) This adds an additional transform to drop zero-size memcpys, also in the case where the size is only zero after instruction simplification. The motivation is the case from PR54983 where the size is non-trivially zero, and processMemSetMemCpyDependence() keeps trying to reduce the memset size by zero bytes. This fix it's not really principled. It only works on the premise that if InstSimplify doesn't realize the size is zero, then AA also won't. The principled approach would be to instead add a isKnownNonZero() guard to the processMemSetMemCpyDependence() transform, but I suspect that would render that optimization mostly useless (at least it breaks all the existing test coverage -- worth noting that the constant size case is also handled by DSE, so I think this transform is primarily about the dynamic size case). Fixes https://github.com/llvm/llvm-project/issues/54983. Fixes https://github.com/llvm/llvm-project/issues/64886. Differential Revision: https://reviews.llvm.org/D124078	2023-09-15 09:10:15 +02:00
Nikita Popov	7c229f6e85	[GVN] Invalidate MDA when deduplicating phi nodes Duplicate phi nodes were being directly removed, without invalidating MDA. This could result in a new phi node being allocated at the same address, incorrectly reusing a cache entry. Fix this by optionally allowing EliminateDuplicatePHINodes() to collect phi nodes to remove into a vector, which allows GVN to handle removal itself. Fixes https://github.com/llvm/llvm-project/issues/64598. Differential Revision: https://reviews.llvm.org/D158849	2023-09-15 07:04:32 +02:00
Marc Auberer	1f313034cb	[InstCombine] Remove unnecessary one-use-check (#66419 ) This removes a oneUse check, that is actually unnecessary. Alive2: https://alive2.llvm.org/ce/z/qEkUEf Original patch: https://reviews.llvm.org/D159380	2023-09-15 06:46:30 +02:00
Alexey Bataev	c15c1e5dd5	[SLP]Do not account non-instructions for external use. If the non-instruction gets vectorized, no need to account its extract cost, it won't be removed and replaced by extractelement instruction.	2023-09-14 12:40:33 -07:00
Justin Bogner	71e3642619	[Transforms][DXIL] Wire up a basic DXILUpgrade pass (#66275 ) This pass will upgrade DXIL-style llvm constructs (which are mostly metadata) into the representations we use in LLVM for the same concepts. For now we just strip the valver metadata, which we don't need. Later changes will make this pass more useful, and then we should be able to wire it into clang and possibly the DirectX backend's AsmParser.	2023-09-14 11:02:31 -07:00
Matt Arsenault	07acfe3a4d	ADT: Replace FPClassTest fabs with inverse_fabs and unknown_sign (#66390 )	2023-09-14 19:46:53 +03:00
Björn Pettersson	a0ce4384a6	[LICM] Simplify isLoadInvariantInLoop given opaque pointers (#65597 ) Since we no longer support typed pointers in LLVM IR, the PtrASXTy in isLoadInvariantInLoop was set to be equal to Addr->getType() (an opaque ptr in the same address space). That made the loop looking through bitcasts redundant.	2023-09-14 16:53:34 +02:00
Paul Walker	c7d65e4466	[IR] Enable load/store/alloca for arrays of scalable vectors. Differential Revision: https://reviews.llvm.org/D158517	2023-09-14 13:49:01 +00:00
Kohei Asano	fef8249220	[SimplifyCFG] handle monotonic wrapped case for D150943 (#65882 )	2023-09-14 21:26:11 +09:00
khei4	7f3610ac69	Reapply "Revert "[MemCpyOpt] implement multi BB stack-move optimization" This reverts commit efe8aa2e618122e8050af10cc5d6ad83f24ef557. Differential Revision: https://reviews.llvm.org/D155406	2023-09-14 19:42:36 +09:00
Nikita Popov	1fc73cacb2	[InstCombine] Propagate nsw flag when negating When pushing a sub nsw 0, %x negation into an expression, try to preserve the nsw flag for the cases where this is possible. Do this by passing the flag through recursive Negator::negate() calls. Proofs: https://alive2.llvm.org/ce/z/oRPNcY Differential Revision: https://reviews.llvm.org/D158510	2023-09-14 09:09:45 +02:00
Shilei Tian	22e1df7f5b	[LLVM][OpenMPOpt] Fix a crash when associated function is nullptr (#66274 ) The associated function can be a nullptr if it is an indirect call. This causes a crash in `CheckCallee` which always assumes the callee is a valid pointer. Fix #66904.	2023-09-13 20:22:59 -04:00
Noah Goldstein	2a904f456a	[InstCombine] Rename some shadow variables; NFC Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D159325	2023-09-13 15:50:18 -05:00
Noah Goldstein	119194ada6	[InstCombine] Transform `(icmp ult/uge (and X, Y), X)` -> `(icmp ne/eq (and X, Y), X)` eq/ne are generally easier to reason about elsewhere. ult -> ne: https://alive2.llvm.org/ce/z/5wxXGt uge -> eq: https://alive2.llvm.org/ce/z/Dw6kqG Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D145425	2023-09-13 15:50:17 -05:00
Sergey Kachkov	4b14148d24	[GVN] Skip debug instructions in findDominatingValue function (#65977 ) findDominatingValue has a search limit, and when it is reached, optimization is not applied. This patch fixes the issue that this limit also takes into account debug intrinsics, so the result of optimization can depend from the presence of debug info.	2023-09-13 11:23:26 +03:00
Matthias Braun	168c288af1	JumpThreading: Propagate branch weights in tryToUnfoldSelectInCurrBB (#66116 ) Propagate "branch_weights" metadata whe turning a select into a conditional branch in tryToUnfoldSelectInCurrBB	2023-09-12 13:36:49 -07:00
AdityaK	f061b13175	Statically analyze likely and unlikely blocks based on metadata The builtin_expect(), and C++20's likely, unlikely attributes assign branch_weights to annotated branches. This patch adds the the ability to query branch !prof metadata and improve static analysis based on that. Fixes: https://github.com/llvm/llvm-project/issues/64998 Reviewers: tejohnson, efriedma Differential Revision: https://reviews.llvm.org/D159336	2023-09-12 11:28:15 -07:00
Aleksandr Popov	ec0f678744	[GuardWidening] Fix widening possibility check (#66064 ) In the 0e0ff8573de69286536e4f49098226eda0c4c7f5 was introduced inconsistency between condition widening and checking if it's possible to widen. We check the possibility to hoist checks parsed from the condition, but hoist entire condition. This patch returns testing that a condition can be hoisted rather than the checks parsed from that condition. Co-authored-by: Aleksander Popov <apopov@azul.com>	2023-09-12 14:49:14 +02:00
zhanglimin	ec42c78cc4	[sanitizer][msan] VarArgHelper for loongarch64 This patch adds support for variadic argument for loongarch64, which is based on MIPS64. And `check-msan` all pass. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D158587	2023-09-12 09:51:18 +08:00
Konstantina Mitropoulou	798f2465f3	[NewGVN] Decrement UseCount only if SSA copy has one use Committing on behalf of @vladimirradosavljevic (Vladimir Radosavljevic) Differential Revision : https: // reviews.llvm.org/D157267	2023-09-11 18:08:14 -07:00
Matthias Braun	b30c9c9378	LoopUnrollRuntime: Add weights to all branches Make sure every conditional branch constructed by `LoopUnrollRuntime` code sets branch weights. - Add new 1:127 weights for the conditional jumps checking whether the whole (unrolled) loop should be skipped in the generated prolog or epilog code. - Remove `updateLatchBranchWeightsForRemainderLoop` function and just add weights immediately when constructing the relevant branches. This leads to simpler code and makes the code more obvious as every call to `CreateCondBr` now has a `BranchWeights` parameter. - Rework formula for epilogue latch weights, to assume equal distribution of remainders and remove `assert` (as I was able to reach this code when forcing small unroll factors on the commandline). Differential Revision: https://reviews.llvm.org/D158642	2023-09-11 14:23:29 -07:00
Jeremy Morse	e54277fa10	[NFC][RemoveDIs] Use iterators over inst-pointers when using IRBuilder This patch adds a two-argument SetInsertPoint method to IRBuilder that takes a block/iterator instead of an instruction, and updates many call sites to use it. The motivating reason for doing this is given here [0], we'd like to pass around more information about the position of debug-info in the iterator object. That necessitates passing iterators around most of the time. [0] https://discourse.llvm.org/t/rfc-instruction-api-changes-needed-to-eliminate-debug-intrinsics-from-ir/68939 Differential Revision: https://reviews.llvm.org/D152468	2023-09-11 20:01:19 +01:00
Alexey Bataev	9a90457a76	[SLP][NFC]Use ArrayReffor operands directly instead of entry/operand number, NFC.	2023-09-11 11:16:13 -07:00
Matthias Braun	5d7f84ee17	LoopRotate: Add code to update branch weights This adds code to the loop rotation transformation to ensure that the computed block execution counts for the loop bodies are the same before and after the transformation. This isn't always true in practice, but I believe this is because of numeric inaccuracies in the BlockFrequency computation. The invariants this is modeled on and heuristic choice of 0-trip loop amount is explained in a lenghty comment in the new `updateBranchWeights()` function. Differential Revision: https://reviews.llvm.org/D157462	2023-09-11 10:38:06 -07:00
Jeremy Morse	1d82c765ef	[NFC][RemoveDIs] Provide an iterator-taking split-block method As per the stack of patches this is attached to, allow users of BasicBlock::splitBasicBlock to provide an iterator for a position, instead of just an instruction pointer. This is to fit with my proposal for how to get rid of debug intrinsics [0]. There are other call-sites that would need to change, but this is sufficient for a stage2clang self host and some other C++ projects to build identical binaries, in the context of the whole remove-DIs project. [0] https://discourse.llvm.org/t/rfc-instruction-api-changes-needed-to-eliminate-debug-intrinsics-from-ir/68939 Differential Revision: https://reviews.llvm.org/D152545	2023-09-11 17:50:47 +01:00
Jeremy Morse	d529943a27	[NFC][RemoveDIs] Prefer iterators over inst-pointers in InstCombine As per my proposal for how to eliminate debug intrinsics [0], for various places in InstCombine prefer to insert using an instruction iterator rather than an instruction pointer. This is so that we can eventually pass more information in the iterator class. These call-sites where I've changed the spelling are those that necessary to build a stage2clang to produce an identical binary in the coming no-debug-intrinsics mode. [0] https://discourse.llvm.org/t/rfc-instruction-api-changes-needed-to-eliminate-debug-intrinsics-from-ir/68939 Differential Revision: https://reviews.llvm.org/D152543	2023-09-11 15:04:51 +01:00
Jeremy Morse	6942c64e81	[NFC][RemoveDIs] Prefer iterator-insertion over instructions Continuing the patch series to get rid of debug intrinsics [0], instruction insertion needs to be done with iterators rather than instruction pointers, so that we can communicate information in the iterator class. This patch adds an iterator-taking insertBefore method and converts various call sites to take iterators. These are all sites where such debug-info needs to be preserved so that a stage2 clang can be built identically; it's likely that many more will need to be changed in the future. At this stage, this is just changing the spelling of a few operations, which will eventually become signifiant once the debug-info bearing iterator is used. [0] https://discourse.llvm.org/t/rfc-instruction-api-changes-needed-to-eliminate-debug-intrinsics-from-ir/68939 Differential Revision: https://reviews.llvm.org/D152537	2023-09-11 11:48:45 +01:00
Johannes Doerfert	d47cf2bff3	[OpenMPOpt] Allow indirect calls in AAKernelInfoCallSite (#65836 ) The Attributor has gained support for indirect calls but it is opt-in. This patch makes AAKernelInfoCallSite able to handle multiple potential callees.	2023-09-10 19:02:09 -07:00
Yingwei Zheng	44e5afdb91	[InstCombine] Generalize foldICmpWithMinMax This patch generalizes the fold of `icmp pred min/max(X, Y), Z` to address the issue https://github.com/llvm/llvm-project/issues/62898. For example, we can fold `smin(X, Y) < Z` into `X < Z` when `Y > Z` is implied by constant folds/invariants/dom conditions. Alive2 (with `--disable-undef-input` due to the limitation of --smt-to=10000): https://alive2.llvm.org/ce/z/rB7qLc You can run the standalone translation validation tool `alive-tv` locally to verify these transformations. ``` alive-tv transforms.ll --smt-to=600000 --exit-on-error ``` Reviewed By: goldstein.w.n Differential Revision: https://reviews.llvm.org/D156238	2023-09-11 02:26:48 +08:00
Yingwei Zheng	780b046bd0	[InstCombine] Use m_c_And/m_c_Or instead of duplicate logic. NFC. See also https://reviews.llvm.org/D153148#inline-1535588	2023-09-10 23:34:23 +08:00
Tyler Lanphear	52f6f418c7	[GlobalOpt] Handle DL.getAllocaAddrSpace() != 0 (#65847 ) Fix crash on RAUW due to locals and globals having different address spaces. This is the intent of the original code, but it assumes the alloca address space is 0. This patch fixes the code to check that the global's address space matches `DL.getAllocaAddrSpace()` instead. Fixes #65155	2023-09-09 10:12:42 -07:00
Dhruv Chawla	e13e808283	[SROA] Limit the number of allowed slices when trying to split allocas This patch adds a hidden CLI option "--sroa-max-alloca-slices", which is an integer that controls the maximum number of alloca slices SROA can consider before bailing out. This is useful because it may not be profitable to split memcpys into (possibly tens of) thousands of loads/stores. This also prevents an issue with exponential compile time explosion in passes like DSE and MemCpyOpt caused by excessive alloca splitting. Fixes https://github.com/rust-lang/rust/issues/88580. Differential Revision: https://reviews.llvm.org/D159354	2023-09-09 11:00:47 +05:30
Alexey Bataev	5bab59de44	[SLP]Try to vectorize scalars, being vectorized already, but does not need to be scheduled. If the scalar does not need to be scheduled and it was vectorized already in one of the vector nodes, we still can try to vectorize it in another node. Just does not need account its cost in the scalar total cost, as it will be handled in the main vectorized node. Differential Revision: https://reviews.llvm.org/D159205	2023-09-08 13:34:12 -07:00
Shilei Tian	499f691be1	Revert "Reapply "[Attributor] Enable AAAddressSpace for OpenMPOpt (#65544 )""" This reverts commit c5525a6e8fb7f7c2ce7126ac5b17aaff01ac407f. AMD BB is not happy again.	2023-09-08 15:46:23 -04:00
Shilei Tian	c5525a6e8f	Reapply "[Attributor] Enable AAAddressSpace for OpenMPOpt (#65544 )"" This reverts commit e592c2dcf5b7d2da6c2564f5d9990aa34079bad4 that reverts e91e3cf.	2023-09-08 15:39:16 -04:00

1 2 3 4 5 ...

34599 Commits