llvm-project

Author	SHA1	Message	Date
David Green	0f919444ad	[ValueTracking] Handle recursive phis in knownFPClass (#114008 ) As a follow-on to 113686, this breaks the recursion between phi nodes that have p1 = phi(x, p2) and p2 = phi(y, p1). The knownFPClass can be calculated from the classes of p1 and p2.	2024-11-01 13:38:29 +00:00
David Green	9735c05186	[ValueTracking] Compute KnownFP state from recursive select/phi. (#113686 ) Given a recursive phi with select: %p = phi [ 0, entry ], [ %sel, loop] %sel = select %c, %other, %p The fp state can be calculated using the knowledge that the select/phi pair can only be the initial state (0 here) or from %other. This adds a short-cut into computeKnownFPClass for PHI to detect that the select is recursive back to the phi, and if so use the state from the other operand. This helps to address a regression from #83200.	2024-10-31 07:50:44 +00:00
David Green	f358422268	[Attributor] Add nofpclass test for phi+select recurrences. NFC	2024-10-30 08:10:35 +00:00
Yingwei Zheng	8d8bb4032b	[Verifier] Verify attribute `denormal-fp-math[-f32]` (#112310 ) Some typos are also fixed. Address https://github.com/llvm/llvm-project/pull/112067#pullrequestreview-2363722447.	2024-10-15 17:32:16 +08:00
Johannes Doerfert	335e137267	[Attributor][FIX] Track returned pointer offsets (#110534 ) If the pointer returned by a function is not "the base pointer" but has an offset, we need to track the offset such that users can apply it to their offset chain when they create accesses. This was reported by @ye-luo and reduced test cases are included. The OffsetInfo was moved and the container was replaced with a set to avoid excessive growth. Otherwise, the patch just replaces the "returns pointer" flag with the "returned offsets", and deals with the applying to offsets at the call site. --------- Co-authored-by: Johannes Doerfert <jdoerfert@llnl.gov>	2024-10-01 12:41:15 -05:00
Shilei Tian	0b7a18bd4a	[Attributor] Use more appropriate approach to check flat address space (#108713 )	2024-09-27 18:26:55 -04:00
Yonghong Song	becc02ce93	Revert "[Transforms][IPO] Add func suffix in ArgumentPromotion and DeadArgume… (#105742 )" This reverts commit 959448fbd6bc6f74fb3f9655b1387d0e8a272ab8. Reverting because multiple test failures e.g. https://lab.llvm.org/buildbot/#/builders/187/builds/1290 https://lab.llvm.org/buildbot/#/builders/153/builds/9389 and maybe a few others.	2024-09-19 03:54:13 -07:00
yonghong-song	959448fbd6	[Transforms][IPO] Add func suffix in ArgumentPromotion and DeadArgume… (#105742 ) …ntElimination ArgumentPromotion and DeadArgumentElimination passes could change function signatures but the function name remains the same as before the transformation. This makes it hard for tracing with bpf programs where user tends to use function signature in the source. See discussion [1] for details. This patch added suffix to functions whose signatures are changed. The suffix lets users know that function signature has changed and they need to impact the IR or binary to find modified signature before tracing those functions. The suffix for ArgumentPromotion is ".argprom" and the suffixes for DeadArgumentElimination are ".argelim" and ".retelim". The suffix also gives user hints about what kind of transformation has been done. With this patch, I built a recent linux kernel with full LTO enabled. I got 4 functions with only argpromotion like ``` set_track_update.argelim.argprom pmd_trans_huge_lock.argprom ... ``` I got 1058 functions with only deadargelim like ``` process_bit0.argelim pci_io_ecs_init.argelim ... ``` I got 3 functions with both argpromotion and deadargelim ``` set_track_update.argelim.argprom zero_pud_populate.argelim.argprom zero_pmd_populate.argelim.argprom ``` [1] https://github.com/llvm/llvm-project/issues/104678	2024-09-19 10:21:58 +02:00
Johannes Doerfert	56a033462e	[Attributor] Keep track of reached returns in AAPointerInfo (#107479 ) Instead of visiting call sites in Attribute::checkForAllUses, we now keep track of returns in AAPointerInfo and use the call site return information as required. This way, the user of AAPointerInfo(CallSite)Argument can determine if the call return should be visited. We do not collect them as "may accesses" in the AAPointerInfo(CallSite)Argument itself in case a return user is found.	2024-09-10 08:13:21 -07:00
Johannes Doerfert	84bf0da34d	[Attributor][FIX] Ensure to always translate call site arguments (#107323 ) When we propagate call site arguments we always need to translate them, this is important as we ended up picking the function argument for a recurisve call not the call site argument. `@recBad` and `@recGood` in `returned.ll` show the problem as they used to transform them the same way. The restructuring cleans the code up and helps derive more "returned" arguments and better information in the presence of recursive calls. The "dropped" attributes are simply dropped because we do not query them anymore, not because we cannot derive them.	2024-09-05 13:37:21 -07:00
Johannes Doerfert	e6dece9f69	[Attributor][FIX] Mark "may" accesses through call sites as such (#107439 ) Before, we kept the call site access kind (may/must) when we translated the access. However, the pointer we access it through (by passing it to the callee) might not be the underlying object. We have similar logic when we add store and load accesses.	2024-09-05 13:33:58 -07:00
Johannes Doerfert	3726f9c575	[Attributor][NFC] Pre-commits for #107439 (#107457 )	2024-09-05 13:10:37 -07:00
Alex MacLean	369d8148e0	[ValueTracking] use KnownBits to compute fpclass from bitcast (#97762 ) When we encounter a bitcast from an integer type we can use the information from `KnownBits` to glean some information about the fpclass: - If the sign bit is known, we can transfer this information over. - If the float is IEEE format and enough of the bits are known, we may be able to prove or rule out some fpclasses such as NaN, Zero, or Inf.	2024-08-30 07:34:49 -07:00
Johannes Doerfert	8266d47cd1	[Attributor] Improve AAUnderlyingObjects (#104835 ) - Allocas and GlobalValues cannot be simplified, so we should not try. - If we never used any assumed state, the AAUnderlyingObjects doesn't require an additional update. - If we have seen an object (or it's underlying object) before, we do not need to inspect it anymore. The original logic for "SeenObjects" was flawed and caused us to add intermediate values to the underlying object list if a PHI or select instruction referenced the same underlying object twice. The test changes are all instances of this situation and we now correctly derive `memory(none)` for the functions that only access stack memory. --------- Co-authored-by: Shilei Tian <i@tianshilei.me>	2024-08-20 12:05:20 -07:00
Nikita Popov	472c79ca52	[IR] Check that arguments of naked function are not used (#104757 ) Verify that the arguments of a naked function are not used. They can only be referenced via registers/stack in inline asm, not as IR values. Doing so will result in assertion failures in the backend. There's probably more that we should verify, though I'm not completely sure what the constraints are (would it be correct to require that naked functions are exactly an inline asm call + unreachable, or is more allowed?) Fixes https://github.com/llvm/llvm-project/issues/104718.	2024-08-20 09:29:05 +02:00
Shilei Tian	907c7eb311	[Attributor] Enable `AAAddressSpace` in `OpenMPOpt` (#104363 ) This reverts commit e592c2dcf5b7d2da6c2564f5d9990aa34079bad4. We can finally reland the PR since the issue that caused the PR to be reverted has been resolved in https://github.com/llvm/llvm-project/pull/104051.	2024-08-16 13:33:48 -04:00
Johannes Doerfert	7156bcf286	[Attributor][FIX] Ensure we do not use stale references (#104495 ) When copying map entries, we might run into resizing and invalidate the RHS of the assignment. We dealt with this before and now use the proper helper to avoid the problem in another place. Fixes: https://github.com/llvm/llvm-project/issues/104397	2024-08-15 18:45:36 -04:00
Matt Arsenault	f9060f1b7e	AMDGPU: Fix using wrong alloca address space in test (#102108 )	2024-08-07 00:19:22 +04:00
Shilei Tian	9373a43218	[Attributor] Indicate optimistic fixed point if an instruction already has non-zero address space (#101589 )	2024-08-01 22:55:09 -04:00
Vidush Singhal	c7633ddb28	[Attributor]: Ensure cycle info is not null when handling PHI in AAPointerInfo (#97321 ) Ensure cycle info object is not null for simple PHI case for the test: `llvm/test/Transforms/Attributor/phi_bug_pointer_info.ll` Debug info Before the change: ``` Accesses by bin after update: [8-12] : 1 - 9 - store i32 %0, ptr %field2, align 4 - c: %0 = load i32, ptr %val, align 4 [32-36] : 1 - 9 - store i32 %1, ptr %field8, align 4 - c: %1 = load i32, ptr %val2, align 4 [2147483647-4294967294] : 1 - 6 - %ret = load i32, ptr %x, align 4 - c: <unknown> ``` Debug info After the change: ``` Accesses by bin after update: [8-12] : 2 - 9 - store i32 %0, ptr %field2, align 4 - c: %0 = load i32, ptr %val, align 4 - 6 - %ret = load i32, ptr %x, align 4 - c: <unknown> [32-36] : 2 - 9 - store i32 %1, ptr %field8, align 4 - c: %1 = load i32, ptr %val2, align 4 - 6 - %ret = load i32, ptr %x, align 4 - c: <unknown> ``` Co-authored-by: Vidush Singhal <singhal2@ruby964.llnl.gov>	2024-07-01 17:20:34 -07:00
Fangrui Song	89e8e63f47	[Attributor] Stabilize llvm.assume output Don't rely on the iteration order of DenseSet<StringRef>, which is not guaranteed to be deterministic.	2024-06-19 15:36:46 -07:00
Ethan Luis McDonough	b629d4b912	[Attributor] Prevent infinite loop in AAGlobalValueInfoFloating (#94941 ) Global variables that reference themselves alongside a function that is called indirectly can cause an infinite loop in `AAGlobalValueInfoFloating`. The recursive reference is continually pushed back into the workload, causing the attributor to hang indefinitely.	2024-06-18 09:36:42 -07:00
Stephen Tozer	094572701d	[RemoveDIs] Print IR with debug records by default (#91724 ) This patch makes the final major change of the RemoveDIs project, changing the default IR output from debug intrinsics to debug records. This is expected to break a large number of tests: every single one that tests for uses or declarations of debug intrinsics and does not explicitly disable writing records. If this patch has broken your downstream tests (or upstream tests on a configuration I wasn't able to run): 1. If you need to immediately unblock a build, pass `--write-experimental-debuginfo=false` to LLVM's option processing for all failing tests (remember to use `-mllvm` for clang/flang to forward arguments to LLVM). 2. For most test failures, the changes are trivial and mechanical, enough that they can be done by script; see the migration guide for a guide on how to do this: https://llvm.org/docs/RemoveDIsDebugInfo.html#test-updates 3. If any tests fail for reasons other than FileCheck check lines that need updating, such as assertion failures, that is most likely a real bug with this patch and should be reported as such. For more information, see the recent PSA: https://discourse.llvm.org/t/psa-ir-output-changing-from-debug-intrinsics-to-debug-records/79578	2024-06-14 15:07:27 +01:00
Johannes Doerfert	f33310e450	[Attributor][FIX] Ensure nonnull IR deduction is not optimistic (#93322 ) We cannot use assumed dead information for nonnull IR-implied deduction as we will never go back and re-check. This was reported in https://github.com/llvm/llvm-project/pull/85810	2024-06-13 08:42:36 +03:00
Arthur Eubanks	71497cc7a4	[CGSCC] Fix compile time blowup with large RefSCCs (#94815 ) In some modules, e.g. Kotlin-generated IR, we end up with a huge RefSCC and the call graph updates done as a result of the inliner take a long time. This is due to RefSCC::removeInternalRefEdges() getting called many times, each time removing one function from the RefSCC, but each call to removeInternalRefEdges() is proportional to the size of the RefSCC. There are two places that call removeInternalRefEdges(), in updateCGAndAnalysisManagerForPass() and LazyCallGraph::removeDeadFunction(). 1) Since LazyCallGraph can deal with spurious (edges that exist in the graph but not in the IR) ref edges, we can simply not call removeInternalRefEdges() in updateCGAndAnalysisManagerForPass(). 2) LazyCallGraph::removeDeadFunction() still ends up taking the brunt of compile time with the above change for the original reason. So instead we batch all the dead function removals so we can call removeInternalRefEdges() just once. This requires some changes to callers of removeDeadFunction() to not actually erase the function from the module, but defer it to when we batch delete dead functions at the end of the CGSCC run, leaving the function body as "unreachable" in the meantime. We still need to ensure that call edges are accurate. I had also tried deleting dead functions after visiting a RefSCC, but deleting them all at once at the end was simpler. Many test changes are due to not performing unnecessary revisits of an SCC (the CGSCC infrastructure deems ref edge refinements as unimportant when it comes to revisiting SCCs, although that seems to not be consistently true given these changes) because we don't remove some ref edges. Specifically for devirt-invalidated.ll this seems to expose an inlining order issue with the inliner. Probably unimportant for this type of intentionally weird call graph. Compile time: https://llvm-compile-time-tracker.com/compare.php?from=6f2c61071c274a1b5e212e6ad4114641ec7c7fc3&to=b08c90d05e290dd065755ea776ceaf1420680224&stat=instructions:u	2024-06-11 09:50:13 -07:00
Nikita Popov	deab451e7a	[IR] Remove support for icmp and fcmp constant expressions (#93038 ) Remove support for the icmp and fcmp constant expressions. This is part of: https://discourse.llvm.org/t/rfc-remove-most-constant-expressions/63179 As usual, many of the updated tests will no longer test what they were originally intended to -- this is hard to preserve when constant expressions get removed, and in many cases just impossible as the existence of a specific kind of constant expression was the cause of the issue in the first place.	2024-06-04 08:31:03 +02:00
Johannes Doerfert	d48d108bc6	[Attributor][FIX] Replace AANoFPClass MBEC propagation (#91030 ) The old use of must-be-executed-context (MBEC) did propagate through calls even if that was not allowed. We now only propagate from call site arguments. If there are calls/intrinsics that allows propagation, we need to add them explicitly. Fixes: https://github.com/llvm/llvm-project/issues/78507 --------- Co-authored-by: Matt Arsenault <arsenm2@gmail.com>	2024-06-03 20:41:52 -07:00
Simon Pilgrim	7952720625	[Attributor] Remove unused metadata checks from liveness tests Noticed while triaging the failures on #93673 - the attributor pass doesn't emit any range metadata in these tests	2024-06-03 14:26:42 +01:00
Nikita Popov	63dc31b68b	Reapply [IR] Avoid creating icmp/fcmp constant expressions (#92885 ) Reapply after https://github.com/llvm/llvm-project/pull/93548, which should address the lldb failure on macos. ----- Do not create icmp/fcmp constant expressions in IRBuilder etc anymore, i.e. treat them as "undesirable". This is in preparation for removing them entirely. Part of: https://discourse.llvm.org/t/rfc-remove-most-constant-expressions/63179	2024-05-31 08:55:59 +02:00
Matt Arsenault	2d61692d4b	Attributor: Do not assume function context in AANoCapture (#91462 ) If the calling function has the null_pointer_is_valid attribute, somehow a null constant reaches here. I'm not sure why exactly, it doesn't happen for other types of constants. Fixes #87856	2024-05-23 19:31:33 +02:00
Daniel Thornburgh	8baf96f306	Revert "[IR] Avoid creating icmp/fcmp constant expressions" (#93087 ) Reverts llvm/llvm-project#92885 due to LLDB CI breakages.	2024-05-22 11:27:55 -07:00
Nikita Popov	108575f02e	[IR] Avoid creating icmp/fcmp constant expressions (#92885 ) Do not create icmp/fcmp constant expressions in IRBuilder etc anymore, i.e. treat them as "undesirable". This is in preparation for removing them entirely. Part of: https://discourse.llvm.org/t/rfc-remove-most-constant-expressions/63179	2024-05-22 07:40:08 +02:00
Matt Arsenault	0cd2bf3521	ValueTracking: Correct undef handling for constant FP vectors (#92557 ) Treat undef as unknown, and poison as ignorable.	2024-05-19 20:56:21 +02:00
Johannes Doerfert	cd3a4c31bc	[Attributor][NFC] update tests (#91011 )	2024-05-03 16:38:55 -07:00
Matt Arsenault	f78b3466ca	ValueTracking: Treat poison more aggressively in computeKnownFPClass (#87990 ) Assume no valid values, and the sign bit is 0.	2024-04-15 12:51:29 +02:00
Matt Arsenault	bdf428af98	ValueTracking: Consider demanded elts for vector constants in computeKnownFPClass	2024-04-08 09:32:14 -04:00
Matt Arsenault	2bc637b1ce	ValueTracking: Handle ConstantAggregateZero in computeKnownFPClass	2024-04-08 09:26:12 -04:00
Matt Arsenault	0832b85e0f	ValueTracking: Add baseline tests for vector fpclass handling	2024-04-08 09:26:12 -04:00
Noah Goldstein	e4db938a4e	[ValueTracking] Support non-constant idx for `computeKnownFPClass` of `insertelement` Its same logic as before, we just need to intersect what we know about the new Elt and the entire pre-existing Vec. Closes #87708	2024-04-06 17:51:15 -05:00
Noah Goldstein	d5b48ceb74	[ValueTracking] Add tests for non-constant idx for fpclass of `insertelement`; NFC	2024-04-06 17:51:15 -05:00
Matt Arsenault	733640d29e	Attributor: Handle inferring align from use by atomics (#85762 )	2024-03-21 10:54:03 +05:30
Matt Arsenault	1a6953a75d	ValueTracking: Fix bug with fcmp false to nan constant If we had a comparison to a literal nan with a false predicate, we were incorrectly treating it as an unordered compare. This was correct for fcmp true, but not fcmp false. I noticed this in the review for e44d3b3e503fa12fdaead2936b28844aa36237c1 but misdiagnosed the reason. Also change the test for the fcmp true case to be more useful, but it wasn't wrong previously.	2024-03-19 14:52:45 +05:30
Stephen Tozer	2e39b57837	Reapply "[RemoveDIs] Print non-intrinsic debug info in textual IR output (#79281 )" This reapplication changes debug intrinsic declaration removal to only take place when printing final IR, so that the processing format of the Module does not affect the output. This reverts commit d128448efdd4e2bf3c9bc9a5b43ae642aa78026f.	2024-02-27 14:23:52 +00:00
Stephen Tozer	d128448efd	Revert "Reapply "[RemoveDIs] Print non-intrinsic debug info in textual IR output (#79281 )"" Reverted due to some test failures on some buildbots. https://lab.llvm.org/buildbot/#/builders/67/builds/14669 This reverts commit aa436493ab7ad4cf323b0189c15c59ac9dc293c7.	2024-02-27 10:17:24 +00:00
Stephen Tozer	aa436493ab	Reapply "[RemoveDIs] Print non-intrinsic debug info in textual IR output (#79281 )" Fixes the prior issue in which the symbol for a cl-arg was unavailable to some binaries. This reverts commit dc06d75ab27b4dcae2940fc386fadd06f70faffe.	2024-02-27 09:59:08 +00:00
Stephen Tozer	dc06d75ab2	Revert "[RemoveDIs] Print non-intrinsic debug info in textual IR output (#79281 )" Reverted due to failures on buildbots, where a new cl flag was placed in the wrong file, resulting in link errors. https://lab.llvm.org/buildbot/#/builders/198/builds/8548 This reverts commit 0b398256b3f72204ad1f7c625efe4990204e898a.	2024-02-26 18:49:18 +00:00
Stephen Tozer	0b398256b3	[RemoveDIs] Print non-intrinsic debug info in textual IR output (#79281 ) This patch adds support for printing the proposed non-instruction debug info ("RemoveDIs") out to textual IR. This patch does not add any bitcode support, parsing support, or documentation. Printing of the new format is controlled by a flag added in this patch, `--write-experimental-debuginfo`, which defaults to false. The new format will be printed iff this flag is true, so whether we use the IR format is completely independent of whether we use non-instruction debug info during LLVM passes (which is controlled by the `--try-experimental-debuginfo-iterators` flag). Even with the flag disabled, some existing tests need to be updated, as this patch causes debug intrinsic declarations to be changed in a round trip, such that they always appear at the end of a module and have no attributes (this has no functional change on the module). The design of this new IR format was proposed previously on Discourse, and any further discussion about the design can still be contributed there: https://discourse.llvm.org/t/rfc-debuginfo-proposed-changes-to-the-textual-ir-representation-for-debug-values/73491	2024-02-26 18:22:05 +00:00
Yingwei Zheng	a5865c3c3d	[ValueTracking] Fix computeKnownFPClass for fpext (#81972 ) This patch adds the missing `subnormal -> normal` part for `fpext` in `computeKnownFPClass`. Fixes the miscompilation reported by https://github.com/llvm/llvm-project/pull/80941#issuecomment-1947302100.	2024-02-17 23:30:45 +08:00
Jeremy Morse	89ad31fd93	[RemoveDIs][DebugInfo] Perform some pre-turn-on test maintenence (#80885 ) As we'll hopefully move away from using intrinsics for debug-info shortly, this commit stabilizes a few tests to avoid spurious changes in the process. Briefly, there are differences in output when we don't use intrinsics that we're going to suppress in case we have to revert, these are: * The attributor test gets different attributes for the dbg.value intrinsic because it's not present during optimisation. This has no functional effect and there's no need to test for it. * The Scalarizer test exposes a "debug-info affects codegen" problem, but fixing it is fiddly (updating 20 IRBuilder object calls). Pin this test to not change with RemoveDIs, we can relax it later and get the correct behaviour. * DIDefaultTemplateParam.ll tests for explicit metadata node numbers which is generally bad. Add explicit node-number capturing CHECK lines.	2024-02-07 11:13:24 +00:00
Nikita Popov	2d69827c5c	[Transforms] Convert tests to opaque pointers (NFC)	2024-02-05 11:57:34 +01:00

1 2 3 4 5 ...

745 Commits