llvm-project

Author	SHA1	Message	Date
Nikita Popov	b9bba6ca9f	[BasicAA] Track nuw through decomposed expressions (#106512 ) When we decompose the GEP offset expression, and the arithmetic is not performed using nuw operations, we cannot retain the nuw flag on the decomposed GEP. For example, if we have `gep nuw p, (a-1)`, this is not at all the same as `gep nuw (gep nuw p, a), -1`. Fix this by tracking NUW through linear expression decomposition, similarly to what we already do for the NSW flag. This fixes the miscompilation reported in https://github.com/llvm/llvm-project/pull/105496#issuecomment-2315322220.	2024-09-02 12:11:03 +02:00
Hari Limaye	ba84cfbe0c	[BasicAA] Use nuw attribute of GEPs (#98608 ) Use the nuw attribute of GEPs to prove that pointers do not alias, in cases matching the following: + + + \| BaseOffset \| +<nuw> Indices \| ---------------->\|-------------------->\| \|-->V2Size \| \|-------> V1Size LHS RHS If the difference between pointers is Offset +<nuw> Indices then we know that the addition does not wrap the pointer index type (add nuw) and the constant Offset is a lower bound on the distance between the pointers. We can then prove NoAlias via Offset u>= V2Size.	2024-08-20 11:00:03 +01:00
Matt Arsenault	9c7c3f94ef	BasicAA: Fix assert when indexing address spaces with different sizes (#103713 ) Fixes #103500	2024-08-14 14:53:06 +04:00
Nikita Popov	3c87f66b7e	[BasicAA] Make use of nusw+nuw -> nneg implication (#102141 ) If the GEP is both nuw and inbounds/nusw, the offset is non-negative. Pass this information to CastedValue and make use of it when determining the value range. Proof for nusw+nuw->nneg: https://alive2.llvm.org/ce/z/a_CKAw Proof for the test case: https://alive2.llvm.org/ce/z/yJ3ymP	2024-08-07 12:47:21 +02:00
Nikita Popov	d56d808fdc	[BasicAA] Check nusw instead of inbounds For the offset scaling, this is sufficient to guarantee nsw. The other checks for inbounds in this file do need proper inbounds.	2024-08-06 14:38:49 +02:00
Nikita Popov	ebab105670	[IR] Don't strip through pointer to vector of pointer bitcasts When using stripPointerCasts() and getUnderlyingObject(), don't strip through a bitcast from ptr to <1 x ptr>, which is not a no-op pointer cast. Calling code is generally not prepared to handle that situation, resulting in incorrect alias analysis results for example. Fixes https://github.com/llvm/llvm-project/issues/97600.	2024-07-04 09:47:59 +02:00
Alex MacLean	d881bac6fa	[BasicAA] Consider 'nneg' flag when comparing CastedValues (#94129 ) Any of the `zext` bits in a `zext nneg` can be converted to `sext` but when checking if casts are compatible `BasicAA` fails to take into account `nneg`. This change adds tracking of `nneg` to the `CastedValue` struct and ensures that `sext` and `zext` bits are treated as interchangeable when either `CastedValue` has a `nneg`. When distributing casted values in `GetLinearExpression` we conservatively discard the `nneg` from the `CastedValue`, except in the case of `shl nsw`, where we know the sign has not changed to negative.	2024-06-04 08:32:57 -07:00
David Green	5e6b4be5cb	[BasicAA] Treat different VScale intrinsics as the same value. (#81152 ) The IR may contain multiple llvm.vscale intrinsics that have not been CSEd. This patch ensures that multiple vscales can be treated the same, either in the decomposition of geps and when we subtract one decomposition from another.	2024-02-12 11:27:49 +00:00
David Green	9d8a236164	[BasicAA] Check for Overflow using vscale_range (#81144 ) This extends #80818 when IsNSW is lost (possibly due to looking through multiple GEPs), to check the vscale_range for an access that will not overflow even with the maximum range.	2024-02-12 10:21:20 +00:00
David Green	9981f5a72e	[BasicAA] Add extra onevscale test for multiple dependent geps that lose the NSW flag. NFC	2024-02-10 13:25:53 +00:00
David Green	0079136f7d	[BasicAA] Fix Scale check in vscale aliasing. (#81174 ) This is a fix for #80818, as pointed out in #81144 it should be checking the abs of Scale. The added test changes from NoAlias to MayAlias.	2024-02-09 07:48:43 +00:00
David Green	878234b320	[BasicAA] Scalable offset with scalable typesize. (#80818 ) This patch adds a simple alias analysis check for accesses that are scalable with a offset between them that is also trivially scalable (there are no other constant/variable offsets). We essentially divide each side by vscale and are left needing to check that the offset >= typesize.	2024-02-08 11:07:33 +00:00
David Green	ef05b4b520	[BasicAA] More vscale tests. NFC This time with i8 geps and scale intrinsics, along with mutiple vscale intrinsics that can be treated as identical.	2024-02-08 09:31:26 +00:00
David Green	84ea236af9	[BasicAA] Handle scalable type sizes with constant offsets (#80445 ) This is a separate, but related issue to #69152 that was attempting to improve AA with scalable dependency distances. This patch attempts to improve when there are scalable accesses with a constant offset between them. We happen to get a report of such a thing recently, where so long as the vscale_range is known, the maximum size of the access can be assessed and better aliasing results can be returned. The Upper range of the vscale_range, along with known part of the typesize are used to prove that Off >= CR.upper * LSize. It does not try to produce PartialAlias results at the moment from the lower vscale_range. It also enables the added benefit of allowing better alias analysis when the RHS of the two values is scalable, but the LHS is normal and can be treated like any other aliasing query.	2024-02-05 11:20:50 +00:00
Nikita Popov	1aee1e1f4c	[Analysis] Convert tests to opaque pointers (NFC)	2024-02-05 12:04:39 +01:00
David Green	de4360d7d5	[BasicAA] Add extra scalable typesize and offset tests. NFC A collection of tests from #69152 and for constant offsets with scalable typesizes.	2024-02-03 21:02:23 +00:00
Nikita Popov	90ba33099c	[InstCombine] Canonicalize constant GEPs to i8 source element type (#68882 ) This patch canonicalizes getelementptr instructions with constant indices to use the `i8` source element type. This makes it easier for optimizations to recognize that two GEPs are identical, because they don't need to see past many different ways to express the same offset. This is a first step towards https://discourse.llvm.org/t/rfc-replacing-getelementptr-with-ptradd/68699. This is limited to constant GEPs only for now, as they have a clear canonical form, while we're not yet sure how exactly to deal with variable indices. The test llvm/test/Transforms/PhaseOrdering/switch_with_geps.ll gives two representative examples of the kind of optimization improvement we expect from this change. In the first test SimplifyCFG can now realize that all switch branches are actually the same. In the second test it can convert it into simple arithmetic. These are representative of common optimization failures we see in Rust. Fixes https://github.com/llvm/llvm-project/issues/69841.	2024-01-24 15:25:29 +01:00
Bruno De Fraine	509f634f76	[BasicAA] Fix new test Analysis/BasicAA/separate_storage-alias-sets.ll An update of the test was not included in 656bf13004 since it was added after the branch point of that patch.	2024-01-17 17:33:58 +01:00
Nikita Popov	5f57ad85a1	[BasicAA] Remove incorrect rule about constant pointers (#76815 ) BasicAA currently says that any Constant cannot alias an identified local object. This is not correct if the local object escaped, as it's possible to create a pointer to the escaped object using an inttoptr constant expression base. To compensate for this, make sure that inttoptr constant expressions are treated as escape sources, just like inttoptr instructions. This ensures that the optimization can still be applied if the local object is non-escaping. This is sufficient to still optimize the original motivating case from c53e2ecf0296a55d3c33c19fb70a3aa7f81f2732. Fixes https://github.com/llvm/llvm-project/issues/76789.	2024-01-17 09:31:00 +01:00
David Green	d69efa4015	[BasicAA] Handle disjoint or as add in DecomposeGEP. (#78209 ) This removes the MaskedValueIsZero check in decomposing geps in BasicAA, using the isDisjoint flags instead. This relies on the disjoint flags being present when AA is ran. The alternative would be to keep the old MaskedValueIsZero check too if this causes issues.	2024-01-16 09:22:20 +00:00
David Goldblatt	852596d804	[BasicAA] Guess reasonable contexts for separate storage hints (#76770 ) The definition of the pointer of the memory location being queried is always one such context. Even this conservative guess can be better than no guess at all in some cases. Fixes #64666 Co-authored-by: David Goldblatt <davidgoldblatt@meta.com>	2024-01-04 11:29:00 -08:00
Nikita Popov	9862491436	[BasicAA] Add tests for #76789 (NFC)	2024-01-03 14:24:31 +01:00
Florian Hahn	2d39cb4983	[BasicAA] Don't use MinAbsVarIndex = 1. (#72993 ) The current code incorrectly assumed that the absolute variable index needs to be at least 1, if the variable is != 0. This is incorrect, in case multiplying with Scale wraps. The code below already checks for wrapping properly, so just remove the incorrect assignment. Fixes https://github.com/llvm/llvm-project/issues/72831.	2023-11-21 14:27:50 +00:00
Florian Hahn	ad86d3e94f	[BasicAA] Add wrapping test for #72831 . Add test with GEP where the index may wrap.	2023-11-21 13:38:57 +00:00
Alex Richardson	e39f6c1844	[opt] Infer DataLayout from triple if not specified There are many tests that specify a target triple/CPU flags but no DataLayout which can lead to IR being generated that has unusual behaviour. This commit attempts to use the default DataLayout based on the relevant flags if there is no explicit override on the command line or in the IR file. One thing that is not currently possible to differentiate from a missing datalayout `target datalayout = ""` in the IR file since the current APIs don't allow detecting this case. If it is considered useful to support this case (instead of passing "-data-layout=" on the command line), I can change IR parsers to track whether they have seen such a directive and change the callback type. Differential Revision: https://reviews.llvm.org/D141060	2023-10-26 12:07:37 -07:00
Mikhail Gudim	9abf3df111	[ValueTracking] Analyze `Select` in `isKnownNonEqual`. (#68427 ) Basic way to recursively analyze `select` in `isKnownNonEqual`: `select %c, %t, %f` is non-equal to `%x` if `%t` is non-equal to `%x` and `%f` is non-equal to `%x`.	2023-10-25 01:08:40 -04:00
Yingwei Zheng	ea4cc2007e	[BasicAA] Remove NSW flags when merging scales (#69122 ) When merging scales of `LinearExpression` that have common index variables, we cannot guarantee the NSW flag still applies to the merged expression. Fixes #69096.	2023-10-16 04:40:10 +08:00
Yingwei Zheng	4698b99262	[BasicAA] Add pre-commit tests for PR69096. NFC.	2023-10-16 01:48:39 +08:00
Mikhail Gudim	4a2a6a4111	[ValueTracking] Try to infer range of select from true and false values. (#68256 ) When computing range of `select` instruction, first compute the union of ranges of "True" and "False" operands of the `select` instruction.	2023-10-05 13:23:05 -04:00
Dhruv Chawla	3e992d81af	[InferAlignment] Enable InferAlignment pass by default This gives an improvement of 0.6%: https://llvm-compile-time-tracker.com/compare.php?from=7d35fe6d08e2b9b786e1c8454cd2391463832167&to=0456c8e8a42be06b62ad4c3e3cf34b21f2633d1e&stat=instructions:u Differential Revision: https://reviews.llvm.org/D158600	2023-09-20 12:08:52 +05:30
Nathan Sidwell	ef1722497b	[llvm] Remove unwanted attribute checking This test is checking alias analysis. The attribute tests are brittle but fortunately unneccesary. Delete them. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D156600	2023-08-08 20:47:28 -04:00
Nikita Popov	c31eb827b7	[BasicAA] Fix nsw handling for negated scales (PR63266) We currently preserve the nsw flag when negating scales, which is incorrect for INT_MIN. However, just dropping the NSW flag in this case makes BasicAA behavior unreliable and asymmetric, because we may or may not drop the NSW flag depending on which side gets subtracted. Instead, leave the Scale alone and add an additional IsNegated flag, which indicates that the whole VarIndex should be interpreted as a subtraction. This allows us to retain the NSW flag. When accumulating the offset range, we need to use subtraction instead of adding for IsNegated indices. Everything else works on the absolute value of the scale, so the negation does not matter there. Fixes https://github.com/llvm/llvm-project/issues/63266. Differential Revision: https://reviews.llvm.org/D153270	2023-06-27 09:40:09 +02:00
Nikita Popov	c26fe199c1	[BasicAA] Add test for PR63266 (NFC)	2023-06-19 14:40:54 +02:00
David Goldblatt	61042d2806	[AA][Intrinsics] Add separate_storage assumptions. This operand bundle on an assume informs alias analysis that the arguments point to regions of memory that were allocated separately (i.e. different heap allocations, different allocas, or different globals). As a safety measure, we leave the analysis flag-disabled by default. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D136514	2022-12-16 11:05:00 -08:00
Nikita Popov	303c308e45	[BasicAA] Convert tests to opaque pointers (NFC)	2022-12-16 11:57:17 +01:00
Nikita Popov	3caf301a8b	[BasicAA] Convert some tests to opaque pointers (NFC)	2022-12-16 10:54:23 +01:00
Nikita Popov	243acd5dcb	[BasicAA] Remove support for PhiValues analysis BasicAA currently has an optional dependency on the PhiValues analysis. However, at least with our current pipeline setup, we never actually make use of it. It's possible that this used to work with the legacy pass manager, but I'm not sure of that either. Given that this analysis has not actually been in use for a long time, and nobody noticed or complained, I think we should drop support for it and focus on one code path. It is worth noting that analysis quality for the non-PhiValues case has significantly improved in the meantime. If we really wanted to make use of PhiValues, the right way would probably be to pass it in via AAQI in places we want to use it, rather than using an optional pass manager dependency (which are an unpredictable PITA and should really only ever be used for analyses that are only preserved and not used). Differential Revision: https://reviews.llvm.org/D139719	2022-12-12 09:47:30 +01:00
Nikita Popov	cc1e2bb4d4	[BasicAA] Handle phi with itself as incoming value We can skip such incoming values. This was already done by PhiValues if present, but we can also do this without the additional analysis.	2022-12-09 16:17:45 +01:00
Nikita Popov	fe9e442c57	[BasicAA] Add test for phi that contains itself (NFC) This currently produces a better result with PhiValues.	2022-12-09 16:14:37 +01:00
Nikita Popov	258e551615	[BasicAA] Convert test to opaque pointers (NFC)	2022-12-09 16:05:46 +01:00
Nikita Popov	05ff7606c9	[BasicAA] Convert some tests to opaque pointers (NFC)	2022-12-09 15:49:46 +01:00
Nikita Popov	fa4b518f1d	[BasicAA] Guard against empty successors list (PR59360) Succs can be empty here if a phi predecessor is unreachable. Fixes https://github.com/llvm/llvm-project/issues/59360	2022-12-06 16:59:00 +01:00
Florian Hahn	ae852750b3	[MemoryLocation] Support memcpy_chk in getForArgument. Similar to 9f9e8ba114ce, add support for memcyp_chk to MemoryLocation::getForArgument. The size argument for memcpy_chk is an upper bound for the size of the pointer argument. memcpy_chk may read/write less than the specified length, if it exceeds the specified max size and aborts. Reviewed By: xbolva00, jdoerfert Differential Revision: https://reviews.llvm.org/D138613	2022-11-24 19:17:48 +00:00
Florian Hahn	4b4cbbd7fb	[BasicAA] Add tests with __memcpy_chk.	2022-11-23 22:09:53 +00:00
Nikita Popov	304f1d59ca	[IR] Switch everything to use memory attribute This switches everything to use the memory attribute proposed in https://discourse.llvm.org/t/rfc-unify-memory-effect-attributes/65579. The old argmemonly, inaccessiblememonly and inaccessiblemem_or_argmemonly attributes are dropped. The readnone, readonly and writeonly attributes are restricted to parameters only. The old attributes are auto-upgraded both in bitcode and IR. The bitcode upgrade is a policy requirement that has to be retained indefinitely. The IR upgrade is mainly there so it's not necessary to update all tests using memory attributes in this patch, which is already large enough. We could drop that part after migrating tests, or retain it longer term, to make it easier to import IR from older LLVM versions. High-level Function/CallBase APIs like doesNotAccessMemory() or setDoesNotAccessMemory() are mapped transparently to the memory attribute. Code that directly manipulates attributes (e.g. via AttributeList) on the other hand needs to switch to working with the memory attribute instead. Differential Revision: https://reviews.llvm.org/D135780	2022-11-04 10:21:38 +01:00
Nikita Popov	5fe9273c73	[BasicAA] Re-enable cs-cs-arm.ll test (PR58738) Fixes https://github.com/llvm/llvm-project/issues/58738.	2022-11-02 14:22:44 +01:00
Paul Robinson	9a4aa37dbf	Patch up attributes on a newly enabled test	2022-11-01 14:14:40 -07:00
Paul Robinson	4f0a1201a4	[lit][REQUIRES] Fix some tests with incorrect REQUIRES clauses These weren't running anywhere because of bad specifications. One test has bit-rotted and had to be XFAILed, the rest are okay. Differential Revision: https://reviews.llvm.org/D136612	2022-11-01 13:49:23 -07:00
Nikita Popov	6aa672f141	[IR] Take operand bundles into account for call argument readonly/writeonly We currently only take operand bundle effects into account when querying the function-level memory attributes. However, I believe that we also need to do the same for parameter attributes. For example, a call with deopt bundle to a function with readnone parameter attribute cannot treat that parameter as readnone, because the deopt bundle may read it. Differential Revision: https://reviews.llvm.org/D136834	2022-11-01 09:30:03 +01:00
Patrick Walton	01859da84b	[AliasAnalysis] Introduce getModRefInfoMask() as a generalization of pointsToConstantMemory(). The pointsToConstantMemory() method returns true only if the memory pointed to by the memory location is globally invariant. However, the LLVM memory model also has the semantic notion of locally-invariant: memory that is known to be invariant for the life of the SSA value representing that pointer. The most common example of this is a pointer argument that is marked readonly noalias, which the Rust compiler frequently emits. It'd be desirable for LLVM to treat locally-invariant memory the same way as globally-invariant memory when it's safe to do so. This patch implements that, by introducing the concept of a ModRefInfo mask. A ModRefInfo mask is a bound on the Mod/Ref behavior of an instruction that writes to a memory location, based on the knowledge that the memory is globally-constant memory (in which case the mask is NoModRef) or locally-constant memory (in which case the mask is Ref). ModRefInfo values for an instruction can be combined with the ModRefInfo mask by simply using the & operator. Where appropriate, this patch has modified uses of pointsToConstantMemory() to instead examine the mask. The most notable optimization change I noticed with this patch is that now redundant loads from readonly noalias pointers can be eliminated across calls, even when the pointer is captured. Internally, before this patch, AliasAnalysis was assigning Ref to reads from constant memory; now AA can assign NoModRef, which is a tighter bound. Differential Revision: https://reviews.llvm.org/D136659	2022-10-31 13:03:41 -07:00

1 2 3 4 5 ...

488 Commits