llvm-project

Author	SHA1	Message	Date
Alexey Bataev	9b5f62685a	[SLP]Fix cost of the broadcast buildvector/gather. Need to include the cost of the initial insertelement to the cost of the broadcasts. Also, need to adjust the cost of the gather/buildvector if the element is inserted into poison/undef vector. Differential Revision: https://reviews.llvm.org/D140498	2023-01-06 09:25:05 -08:00
Nikita Popov	c60149b49e	Revert "[Dominator] Add findNearestCommonDominator() for Instructions (NFC)" This reverts commit 7f0de9573f758f5f9108795850337a5acbd17eef. This is missing handling for !isReachableFromEntry() blocks, which may be relevant for some callers. Revert for now.	2023-01-06 17:36:01 +01:00
Nikita Popov	7f0de9573f	[Dominator] Add findNearestCommonDominator() for Instructions (NFC) This is a recurring pattern: We want to find the nearest common dominator (instruction) for two instructions, but currently only provide an API for the nearest common dominator of two basic blocks. Add an overload that accepts and return instructions.	2023-01-06 17:06:25 +01:00
Guillaume Chatelet	87b6b347fc	Revert D141134 "[NFC] Only expose getXXXSize functions in TypeSize" The patch should be discussed further. This reverts commit dd56e1c92b0e6e6be249f2d2dd40894e0417223f.	2023-01-06 15:27:50 +00:00
Guillaume Chatelet	dd56e1c92b	[NFC] Only expose getXXXSize functions in TypeSize Currently 'TypeSize' exposes two functions that serve the same purpose: - getFixedSize / getFixedValue - getKnownMinSize / getKnownMinValue source : `bf82070ea4/llvm/include/llvm/Support/TypeSize.h (L337-L338)` This patch offers to remove one of the two and stick to a single function in the code base. Differential Revision: https://reviews.llvm.org/D141134	2023-01-06 15:24:52 +00:00
Nikita Popov	b8576086c7	[StackLifetime] Fix sign compare warning (NFC)	2023-01-06 16:11:11 +01:00
Nikita Popov	a6a526ec54	[IR] Add AllocaInst::getAllocationSize() (NFC) When fetching allocation sizes, we almost always want to have the size in bytes, but we were only providing an InBits API. Also add the corresponding byte-based conjugate to save some *8 and /8 juggling everywhere.	2023-01-06 15:36:16 +01:00
Keno Fischer	1436a9232b	[LVI] Look through negations when evaluating conditions This teaches LVI (and thus CVP) to extract range information from branches whose condition is negated using (`xor %c, true`). On the implementation side, we switch the cache to additionally track whether we're looking for the inverted value or not and otherwise using the existing support for computing inverted conditions. I think the biggest question here is why this negation shows up here at all. After all, it should always be possible for some other pass to fold such a negation into a branch, comparison or some other logical operation. Indeed, instcombine does just that. However, these negations can be otherwise fairly persistent, e.g. instsimplify is not able to exchange branch conditions from negations. In addition, jumpthreading, which sits at the same point in default pass pipeline also handles this pattern, which adds further evidence that we might expect these negations to not have been canonicalized away yet at this point in the pass pipeline. In the particular case I was looking at there was a bit of a circular dependency where flags computed by cvp were needed by instcombine, and incstombine's folding of the negation was needed for cvp. Adding a second instombine pass would have worked of course, but instcombine can be somewhat expensive, so it appeared desirable to not require it to have run before cvp (as is the case in the default pass pipeline). Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D140933	2023-01-05 23:03:46 +00:00
serge-sans-paille	38818b60c5	Move from llvm::makeArrayRef to ArrayRef deduction guides - llvm/ part Use deduction guides instead of helper functions. The only non-automatic changes have been: 1. ArrayRef(some_uint8_pointer, 0) needs to be changed into ArrayRef(some_uint8_pointer, (size_t)0) to avoid an ambiguous call with ArrayRef((uint8_t), (uint8_t)) 2. CVSymbol sym(makeArrayRef(symStorage)); needed to be rewritten as CVSymbol sym{ArrayRef(symStorage)}; otherwise the compiler is confused and thinks we have a (bad) function prototype. There was a few similar situation across the codebase. 3. ADL doesn't seem to work the same for deduction-guides and functions, so at some point the llvm namespace must be explicitly stated. 4. The "reference mode" of makeArrayRef(ArrayRef<T> &) that acts as no-op is not supported (a constructor cannot achieve that). Per reviewers' comment, some useless makeArrayRef have been removed in the process. This is a follow-up to https://reviews.llvm.org/D140896 that introduced the deduction guides. Differential Revision: https://reviews.llvm.org/D140955	2023-01-05 14:11:08 +01:00
Owen Anderson	ec40c8f6fe	[ValueTracking] Improve ComputeNumSignBits to handle Trunc Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D140796	2023-01-03 15:26:21 -07:00
Nikita Popov	3f04553e5c	[ValueTracking] Use SmallVector for non-undef/poison ops The way these APIs are used, there isn't really a benefit to deduplicating the ops as part of the API. The only place that benefits from this is PoisonChecking, and for that particular use the assertion emission was potentially non-deterministic. We should populate a vector for deterministic order and then deduplicate via a separate set.	2023-01-02 14:40:15 +01:00
Nikita Popov	e44b11d9b6	[ValueTracking] Treat branch on undef as UB as well We were already treating branch on poison as UB, but branch on undef is also UB. Move the checks into the correct function. From LangRef for br: > If ‘cond’ is poison or undef, this instruction has undefined behavior. From LangRef for switch: > If ‘value’ is poison or undef, this instruction has undefined behavior. There is a minor regression in dont-distribute-phi.ll, apparently we handle that pattern in logical but not bitwise form.	2023-01-02 12:34:23 +01:00
Nikita Popov	86195b8361	[ValueTracking] Remove branch-on-poison-as-ub flag (NFC) This has been enabled by default without issue for a while now, remove the flag.	2023-01-02 11:05:01 +01:00
Matt Arsenault	7e720b010a	ValueTracking: Fix canCreateUndefOrPoison for saturating shifts These need to consider the shift amount.	2022-12-30 11:28:28 -05:00
Sanjay Patel	6c232db2ae	[InstSimplify] fold selects where true/false arm is the same as condition We managed to fold related patterns in issue #59704, but we were missing these more basic folds: https://alive2.llvm.org/ce/z/y6d7SN	2022-12-30 08:54:09 -05:00
Sanjay Patel	f0faea5714	[InstSimplify] fold exact divide to poison if it is known to not divide evenly This is related to the discussion in D140665. I was looking over the demanded bits implementation in IR and noticed that we just bail out of a potential fold if a udiv is exact: `82be8a1d2b/llvm/lib/Transforms/InstCombine/InstCombineSimplifyDemanded.cpp (L799)` Also, see tests added with 7f0c11509e8f. Then, I saw that we could lose a fold to poison if we zap the exact with that transform, so this patch tries to catch that as a preliminary step. Alive2 proofs: https://alive2.llvm.org/ce/z/zCjKM7 https://alive2.llvm.org/ce/z/-tz_RK (trailing zeros must be "less-than") https://alive2.llvm.org/ce/z/c9CMsJ (general proof and specific example) Differential Revision: https://reviews.llvm.org/D140733	2022-12-29 10:26:50 -05:00
Benjamin Kramer	a3d58bbaff	Detemplate llvm::EmitGEPOffset and move it into a cpp file. NFC.	2022-12-29 16:24:21 +01:00
Sanjay Patel	b16d04d2b9	[InstSimplify] fix formatting and add bool function argument comments; NFC Make existing code conform with proposed additions in D140733.	2022-12-29 09:20:41 -05:00
Florian Hahn	a564048899	[SCEV] Properly clean up duplicated FoldCacheUser ID entries. The current code did not properly handled duplicated FoldCacheUser ID entries when overwriting an existing entry in the FoldCache. This triggered verification failures reported by @uabelho and #59721. The patch fixes that by removing stale IDs when overwriting an existing entry in the cache. Fixes #59721.	2022-12-28 00:09:52 +00:00
Roman Lebedev	f487dfd830	[NFC][Analysis] Implement `getShuffleMaskWithWidestElts()` wrapper (+tests) It will be needed in an upcoming patch to implement some shuffle combining.	2022-12-26 01:04:48 +03:00
Matt Arsenault	de8e0a4397	ValueTracking: Teach canCreateUndefOrPoison about saturating intrinsics	2022-12-23 09:42:33 -05:00
Matt Arsenault	876f3d6c91	ValueTracking: Add test for isKnownNeverInfinity for fptrunc	2022-12-22 09:38:14 -05:00
Max Kazantsev	9a7286b61f	[SCEV] Help getLoopInvariantExitCondDuringFirstIterations deal with complex `umin` exit counts. PR59615 Recent improvements in symbolic exit count computation revealed some problems with SCEV's ability to find invariant predicate during first iterations. Ultimately it is based on its ability to prove some facts for value on the last iteration. This last value, when it includes `umin` as part of exit count, isn't always simplified enough. The motivating example is following: https://github.com/llvm/llvm-project/issues/59615 Could not prove: ``` Pred = 36, LHS = (-1 + (-1 * (2147483645 umin (-1 + %var)<nsw>))<nsw> + %var), RHS = %var FoundPred = 36, FoundLHS = {1,+,1}<nuw><nsw><%bb3>, FoundRHS = %var ``` Can prove: ``` Pred = 36, LHS = (-1 + (-1 * (-1 + %var)<nsw>)<nsw> + %var), RHS = %var FoundPred = 36, FoundLHS = {1,+,1}<nuw><nsw><%bb3>, FoundRHS = %var ``` Here ` (2147483645 umin (-1 + %var)<nsw>)` is exit count composed of two parts from two different exits: `2147483645 ` and `(-1 + %var)<nsw>`. When it was only one (latter) analyzeable exit, for it everything was easily provable. Unfortunately, in general case `umin` in one of `add`'s operands doesn't guarantee that the whole sum reduces, especially in presence of negative steps and lack of `nuw`. I don't think there is a generic legal way to somehow play around this `umin`. So the ad-hoc solution is following: if we failed to find an equivalent predicate that is invariant during first `MaxIter` iterations, and `MaxIter = umin(a, b, c...)`, try to find solution for at least one of `a`, `b`, `c`... Because they all are `uge` than `MaxIter`, whatever is true during `a (b, c)` iterations is also true during `MaxIter` iterations. Differential Revision: https://reviews.llvm.org/D140456 Reviewed By: nikic	2022-12-21 18:12:17 +07:00
Kazu Hirata	c08fad8193	[llvm] Remove redundant initialization of std::optional (NFC)	2022-12-20 15:53:38 -08:00
Craig Topper	9b2fecec40	[BuildLibCalls][RISCV] Sign extend return value of bcmp on riscv64. riscv64 wants callees to sign extend signed and unsigned int returns. The caller can use this to avoid a sign extend if the result is used by a comparison since riscv64 only has 64-bit compares. InstCombine/SimplifyLibCalls aggressively turn memcmps that are only used by an icmp eq 0 into bcmp, but we lose the signext attribute that would have been present on the memcmp. This causes an unneeded sext.w in the generated assembly. This looks even sillier if bcmp is implemented alias to memcmp. In that case, not only did we not get any savings by using bcmp, we added an instruction. This probably applies to other functions, this just happens to be the one I noticed so far. See also the discussion here https://discourse.llvm.org/t/can-we-preserve-signext-return-attribute-when-converting-memcmp-to-bcmp/67126 Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D139901	2022-12-20 11:47:04 -08:00
Steven Wu	9cd6fbee7e	Fix module build after TargetParser Need to include the textual header from the correct module.	2022-12-20 10:31:19 -08:00
Matt Arsenault	2c52c811ee	ValueTracking: Document some difficult isKnownNeverInfinity cases Add a comment and some negative tests. I'd like to have test coverage and explicit handling of all the math operations for clarity.	2022-12-20 13:22:22 -05:00
Matt Arsenault	2bf17cc048	ValueTracking: Teach isKnownNeverInfinity about llvm.sin/llvm.cos	2022-12-20 13:17:03 -05:00
Matt Arsenault	9a21475651	ValueTracking: Teach isKnownNeverInfinity about sqrt	2022-12-20 13:03:07 -05:00
Matt Arsenault	41dd02e857	ValueTracking: Teach isKnownNeverInfinity about min/max functions	2022-12-20 12:52:59 -05:00
Matt Arsenault	4e37d00b9d	ValueTracking: Teach isKnownNeverInfinity about rounding intrinsics	2022-12-20 12:45:07 -05:00
Archibald Elliott	f09cf34d00	[Support] Move TargetParsers to new component This is a fairly large changeset, but it can be broken into a few pieces: - `llvm/Support/TargetParser` are all moved from the LLVM Support component into a new LLVM Component called "TargetParser". This potentially enables using tablegen to maintain this information, as is shown in https://reviews.llvm.org/D137517. This cannot currently be done, as llvm-tblgen relies on LLVM's Support component. - This also moves two files from Support which use and depend on information in the TargetParser: - `llvm/Support/Host.{h,cpp}` which contains functions for inspecting the current Host machine for info about it, primarily to support getting the host triple, but also for `-mcpu=native` support in e.g. Clang. This is fairly tightly intertwined with the information in `X86TargetParser.h`, so keeping them in the same component makes sense. - `llvm/ADT/Triple.h` and `llvm/Support/Triple.cpp`, which contains the target triple parser and representation. This is very intertwined with the Arm target parser, because the arm architecture version appears in canonical triples on arm platforms. - I moved the relevant unittests to their own directory. And so, we end up with a single component that has all the information about the following, which to me seems like a unified component: - Triples that LLVM Knows about - Architecture names and CPUs that LLVM knows about - CPU detection logic for LLVM Given this, I have also moved `RISCVISAInfo.h` into this component, as it seems to me to be part of that same set of functionality. If you get link errors in your components after this patch, you likely need to add TargetParser into LLVM_LINK_COMPONENTS in CMake. Differential Revision: https://reviews.llvm.org/D137838	2022-12-20 11:05:50 +00:00
Nikita Popov	88419a30a0	[LICM] Allow load-only scalar promotion in the presence of aliasing loads During scalar promotion, if there are additional potentially-aliasing loads outside the promoted set, we can still perform a load-only promotion. As the stores are retained, any potentially-aliasing loads will still read the correct value. This increases the number of load promotions in llvm-test-suite by a factor of two: \| Old \| New licm.NumPromotionCandidates \| 4448 \| 6038 licm.NumLoadPromoted \| 479 \| 1069 licm.NumLoadStorePromoted \| 1459 \| 1459 Unfortunately, this does have some impact on compile-time: http://llvm-compile-time-tracker.com/compare.php?from=57f7f0d6cf0706a88e1ecb74f3d3e8891cceabfa&to=72b811738148aab399966a0435f13b695da1c1c8&stat=instructions In part this is because we now have less early bailouts from promotion, but also due to second order effects (e.g. for one case I looked at we spend more time in SLP now). Differential Revision: https://reviews.llvm.org/D133192	2022-12-20 10:02:46 +01:00
Sameer Sahasrabuddhe	475ce4c200	RFC: Uniformity Analysis for Irreducible Control Flow Uniformity analysis is a generalization of divergence analysis to include irreducible control flow: 1. The proposed spec presents a notion of "maximal convergence" that captures the existing convention of converging threads at the headers of natual loops. 2. Maximal convergence is then extended to irreducible cycles. The identity of irreducible cycles is determined by the choices made in a depth-first traversal of the control flow graph. Uniformity analysis uses criteria that depend only on closed paths and not cycles, to determine maximal convergence. This makes it a conservative analysis that is independent of the effect of DFS on CycleInfo. 3. The analysis is implemented as a template that can be instantiated for both LLVM IR and Machine IR. Validation: - passes existing tests for divergence analysis - passes new tests with irreducible control flow - passes equivalent tests in MIR and GMIR Based on concepts originally outlined by Nicolai Haehnle <nicolai.haehnle@amd.com> With contributions from Ruiling Song <ruiling.song@amd.com> and Jay Foad <jay.foad@amd.com>. Support for GMIR and lit tests for GMIR/MIR added by Yashwant Singh <yashwant.singh@amd.com>. Differential Revision: https://reviews.llvm.org/D130746	2022-12-20 07:22:24 +05:30
Florian Hahn	8a3efcd40b	[ValueTracking] Consider single poison operands in propgatesPoison. This patch updates propgatesPoison to take a Use as argument and propagatesPoison now returns true if the passed in operand causes the user to yield poison if the operand is poison This allows propagating poison if the condition of a select is poison. This helps improve results for programUndefinedIfUndefOrPoison. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D111643	2022-12-19 11:47:51 +00:00
ibricchi	07af0e2d3e	Reapply "[InlineAdvisor] Allow loading advisors as plugins" This reverts commit 8d22a63e2c8b4931113ca9d1ee8b17f7ff453e81. Fix was missing dependency.	2022-12-17 10:35:14 -08:00
Mircea Trofin	8d22a63e2c	Revert "[InlineAdvisor] Allow loading advisors as plugins" This reverts commit a00aaf2b1317fbc224dc6606ef7c2a10d617f28f. Example failures: https://lab.llvm.org/buildbot#builders/68/builds/44933 https://lab.llvm.org/buildbot#builders/230/builds/6938	2022-12-16 16:10:22 -08:00
ibricchi	a00aaf2b13	[InlineAdvisor] Allow loading advisors as plugins Adds the ability to load InlineAdvisors as plugins. This allows developing and distributing inlining heuristics outside of tree. The PluginInlineAdvisorAnalysis class serves as the entry point for dynamic advisors. Plugins must register instances of this class to provide their own InliningAdvisor. Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D139644	2022-12-16 16:00:37 -08:00
Fangrui Song	2fa744e631	std::optional::value => operator*/operator-> value() has undesired exception checking semantics and calls __throw_bad_optional_access in libc++. Moreover, the API is unavailable without _LIBCPP_NO_EXCEPTIONS on older Mach-O platforms (see _LIBCPP_AVAILABILITY_BAD_OPTIONAL_ACCESS). This commit fixes LLVMAnalysis and its dependencies.	2022-12-16 22:44:08 +00:00
David Goldblatt	61042d2806	[AA][Intrinsics] Add separate_storage assumptions. This operand bundle on an assume informs alias analysis that the arguments point to regions of memory that were allocated separately (i.e. different heap allocations, different allocas, or different globals). As a safety measure, we leave the analysis flag-disabled by default. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D136514	2022-12-16 11:05:00 -08:00
Nikita Popov	29fa062f0a	[SCEV] Add SCEV::operands() method (NFC) Add an operands() method on SCEV, which forwards to the operands() method of individual SCEV expressions.	2022-12-16 15:50:42 +01:00
Nikita Popov	04d652994d	[SCEV] Return ArrayRef for SCEV operands() (NFC) Use a consistent type for the operands() methods of different SCEV types. Also make the API consistent by only providing operands(), rather than also providin op_begin() and op_end() for some of them.	2022-12-16 15:36:19 +01:00
David Goldblatt	02988fce76	[AA] Allow for flow-sensitive analyses. All current analyses ignore the context. We make the argument mandatory for analyses, but optional for the query interface. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D136512	2022-12-15 21:04:38 -08:00
Vasileios Porpodas	cb5ebfa282	[NFC] Cleanup: Remove instances of Function::getBasicBlockList() This is part of a series of patches that aim at making Function::getBasicBlockList() private. Differential Revision: https://reviews.llvm.org/D140121	2022-12-15 13:08:25 -08:00
Kazu Hirata	9112ec6ad0	[mlgo] Use LLVM_HAVE_TFLITE instead of LLVM_HAVE_TF_API This patch replaces uses of LLVM_HAVE_TF_API with LLVM_HAVE_TFLITE in a couple of CMakeLists.txt. Now that 842b0d0fe2dd142305a9461e50cdce9aff7f86bc has landed, we now have: LLVM_HAVE_TF_API is defined if and only if LLVM_HAVE_TFLITE evaluates to true in the CMake variable world (assuming that you do not set LLVM_HAVE_TF_API on the cmake invocation). FWIW, the story is a little different in the C++ macro world, where: LLVM_HAVE_TF_API is defined if and only if LLVM_HAVE_TFLITE is defined This is why edc83a15b45e6b91fce3f35622a6b0a6d34e5211 consisted only of mechanical replacements. Differential Revision: https://reviews.llvm.org/D140061	2022-12-15 11:11:24 -08:00
Kazu Hirata	6eb0b0a045	Don't include Optional.h These files no longer use llvm::Optional.	2022-12-14 21:16:22 -08:00
Florian Hahn	6e86b544dd	[SCEV] Cache folded SExt SCEV expressions. Use FoldID to cache SignExtendExprs that get folded to a different SCEV. Depends on D137505. Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D137849	2022-12-14 11:59:19 +00:00
Fangrui Song	d4b6fcb32e	[Analysis] llvm::Optional => std::optional	2022-12-14 07:32:24 +00:00
Sanjay Patel	6e6fe27689	[ValueTracking] peek through extends in haveNoCommonBitsSet (2nd try) The 1st try was not clean because a portion of the code diff made it into the pre-commit patch to add tests. This should be the same end result without the muddied code diff. Original commit message: In cases with matching extends, this allows changing an 'add' into an 'or' and narrowing the 'or' which then simplifies to a constant. In cases with opposite extends, we just convert to an 'or' currently, but that could be reduced too. https://alive2.llvm.org/ce/z/fTHzdb	2022-12-13 16:57:45 -05:00
Sanjay Patel	41513bc7a2	Revert "[InstCombine] add tests for add-of-extends; NFC" This reverts commit c8cba0bc4a8c9f4f3f10e17f601ed924dfb82bef. An unintended code change snuck into this (was supposed to just add tests).	2022-12-13 16:12:09 -05:00

1 2 3 4 5 ...

12075 Commits