llvm-project

Author	SHA1	Message	Date
OCHyams	4ece50737d	[Assignment Tracking][NFC] Replace LLVM command line option with a module flag Remove LLVM flag -experimental-assignment-tracking. Assignment tracking is still enabled from Clang with the command line -Xclang -fexperimental-assignment-tracking which tells Clang to ask LLVM to run the pass declare-to-assign. That pass converts conventional debug intrinsics to assignment tracking metadata. With this patch it now also sets a module flag debug-info-assignment-tracking with the value `i1 true` (using the flag conflict rule `Max` since enabling assignment tracking on IR that contains only conventional debug intrinsics should cause no issues). Update the docs and tests too. Reviewed By: CarlosAlbertoEnciso Differential Revision: https://reviews.llvm.org/D142027	2023-01-20 14:24:15 +00:00
Arthur Eubanks	c5ea42bcf4	Revert "[LoopUnroll] Directly update DT instead of DTU." This reverts commit d0907ce7ed9f159562ca3f4cfd8d87e89e93febe. Causes `opt -passes=loop-unroll-full` to crash on ``` define void @foo() { bb: br label %bb1 bb1: ; preds = %bb1, %bb1, %bb switch i1 true, label %bb1 [ i1 true, label %bb2 i1 false, label %bb1 ] bb2: ; preds = %bb1 ret void } ```	2023-01-19 17:01:15 -08:00
Florian Hahn	d0907ce7ed	[LoopUnroll] Directly update DT instead of DTU. The scope of DT updates are very limited when unrolling loops: the DT should only need updating for * new blocks added * exiting blocks we simplified branches This can be done manually without too much extra work. MergeBlockIntoPredecessor also needs to be updated to support direct DT updates. This fixes excessive time spent in DTU for same cases. In an internal example, time spent in LoopUnroll with this patch goes from ~200s to 2s. It also is slightly positive for CTMark: * NewPM-O3: -0.13% * NewPM-ReleaseThinLTO: -0.11% * NewPM-ReleaseLTO-g: -0.13% Notable improvements are mafft (~ -0.50%) and lencod (~ -0.30%), with no workload regressed. https://llvm-compile-time-tracker.com/compare.php?from=78a9ee7834331fb4360457cc565fa36f5452f7e0&to=687e08d011b0dc6d3edd223612761e44225c7537&stat=instructions:u Reviewed By: kuhar Differential Revision: https://reviews.llvm.org/D141487	2023-01-19 18:11:54 +00:00
Nikita Popov	b3b049a824	[Local] Preserve noundef metadata in copyMetadataForLoad() If we're only changing the type of the load, preserve the noundef metadata.	2023-01-19 16:56:09 +01:00
Christian Ulmann	e741b8c2e5	[llvm][ir] Purge MD_prof custom accessors This commit purges direct accesses to MD_prof metadata and replaces them with the accessors provided from the utility file wherever possible. This commit can be seen as the first step towards switching the branch weights to 64 bits. See post here: https://discourse.llvm.org/t/extend-md-prof-branch-weights-metadata-from-32-to-64-bits/67492 Reviewed By: davidxl, paulkirth Differential Revision: https://reviews.llvm.org/D141393	2023-01-19 14:26:26 +01:00
Matt Arsenault	e7cd42f8e4	Utils: Add utility pass to lower ifuncs Create a global constructor which will initialize a global table of function pointers. For now, this is only used as a reduction technique for llvm-reduce. In the future this may be useful to support ifunc on systems where the program loader doesn't natively support it.	2023-01-17 22:33:56 -05:00
Florian Hahn	a6549718d9	[LoopUnroll] Don't update DT for changeToUnreachable. There is no need to update the DT here, because there must be a unique latch. Hence if the latch is not exiting it must directly branch back to the original loop header and does not dominate any nodes. Skipping a DT update here simplifies D141487. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D141810	2023-01-16 12:25:34 +00:00
Benjamin Kramer	db6961db7a	[FuncitonComparator] Clamp StringRef compare output to [-1,1] The comparison can have different values (but same sign) on big endian platforms, avoid that to make the unit test green there.	2023-01-16 01:44:55 +01:00
Benjamin Kramer	931d04be2f	[ADT] Make StringRef::compare like std::string_view::compare string_view has a slightly weaker contract, which only specifies whether the value is bigger or smaller than 0. Adapt users accordingly and just forward to the standard function (that also compiles down to memcmp)	2023-01-15 20:59:21 +01:00
Roman Lebedev	333cdd4125	[SimplifyCFG] Reapply: when eliminating `unreachable` landing pads, mark `call`s as `nounwind` This time the change is in it's least intrusive form since only the return type in prototype for `removeUnwindEdge()` is changed, since only a single specific caller need that knowledge. We really can't recover that knowledge, and `nounwind` knowledge, (and not just a lack of the unwind edge, aka `call` instead of `invoke`), is e.g. part of the reasoning in e.g. `mayHaveSideEffects()`. Note that this is call-site-specific knowledge, just because some callsite had an `unreachable` unwind edge, does not mean that all will.	2023-01-13 21:04:17 +03:00
Guillaume Chatelet	b55f83d013	[NFC] Remove Function::getParamAlignment Differential Revision: https://reviews.llvm.org/D141696	2023-01-13 16:20:58 +00:00
Guillaume Chatelet	26bd6476c6	Deprecate DataLayout::getPrefTypeAlignment	2023-01-13 15:05:24 +00:00
Roman Lebedev	fbcefff9d0	Revert "[SimplifyCFG] When eliminating `unreachable` landing pads, mark `call`s as `nounwind`" The bool is in the wrong place and might get implicitly converted from the previous second argument - a pointer. Thinking about it more, it's not really the best place for that functionality anyways, only a single caller needs that. This reverts commit 3c5b1f2d94d021005ce3769a4402d4a4ae843989.	2023-01-13 01:18:56 +03:00
Roman Lebedev	3c5b1f2d94	[SimplifyCFG] When eliminating `unreachable` landing pads, mark `call`s as `nounwind` We really can't recover that knowledge, and `nounwind` knowledge, (and not just a lack of the unwind edge, aka `call` instead of `invoke`), is e.g. part of the reasoning in e.g. `mayHaveSideEffects()`. Note that this is call-site-specific knowledge, just because some callsite had an `unreachable` unwind edge, does not mean that all will.	2023-01-13 00:41:58 +03:00
Nikita Popov	0d674cf0ea	[Mem2Reg] Extract code for converting !nonull to assume (NFC)	2023-01-12 16:41:31 +01:00
OCHyams	83f7f86e7d	[NFC][Assignment Tracking] Add is/setKillAddress Unlike D140903 this patch folds in treating an empty metadata address component of a dbg.assign the same as undef because it was already being treated that way in the AssignmentTrackingAnalysis pass. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D141125	2023-01-12 09:46:01 +00:00
Sami Tolvanen	fd5e262706	[ModuleUtils][KCFI] Set patchable-function-prefix for synthesized functions When -fpatchable-function-entry is used to emit prefix nops before functions, KCFI assumes all indirectly called functions have the same number of prefix nops, because the nops are emitted between the KCFI type hash and the function entry. However, as patchable-function-prefix is a function attribute set by Clang, functions later synthesized by LLVM don't inherit this attribute and end up not having prefix nops. One of these functions is asan.module_ctor, which the Linux kernel ends up calling indirectly when KASAN is enabled. In order to avoid tripping KCFI, save the expected prefix offset to a module flag, and use it when we're setting KCFI type for the relevant synthesized functions. Link: https://github.com/ClangBuiltLinux/linux/issues/1742 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D141172	2023-01-11 23:45:49 +00:00
Sami Tolvanen	a78cfd5909	[GCOV] Set !kcfi_type metadata for indirectly called functions With CONFIG_GCOV_KERNEL, the Linux kernel indirectly calls the __llvm_gcov_* functions generated by LLVM. With -fsanitize=kcfi, these calls are made from instrumented code and fail indirect call checks as they don't have !kcfi_type metadata. Similarly to D138945, set type metadata for these functions to allow GCOV and KCFI to be both enabled. Link: https://github.com/ClangBuiltLinux/linux/issues/1778 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D141444	2023-01-11 22:53:22 +00:00
spupyrev	45b155924e	[BOLT] using jump weights in profi We want to use profile inference (profi) in BOLT for stale profile matching. This is the second change for existing usages of profi (e.g., CSSPGO): (i) Added the ability to provide (estimated) jump weights for the algorithm. The goal of the algorithm is to create a valid control flow for a given function (that is, one in which incoming counts equal outgoing counts for every basic block while minimally modifying the original input block and jump weights). The input jump weights will be provided based on collected LBR profiles in BOLT. (ii) Added the corresponding options to ProfiParams. (iii) Slightly modified / simplified the construction of the flow network in profi so as it utilizes fewer auxiliary nodes. This is done by introducing parallel edges to the network (which is supported by MMF) and reduces the size of the network from 3\|V\| to 2\|V\|, where \|V\| is the number of basic blocks in the function. Inference (profile quality) impact: The diff is supposed to be a no-op for the inferred counts. However, our implementation of MCF is not fully deterministic and might return different results depending on the input network model. Since we changed the model construction, there are a few differences in comparison to the original implementation. I checked manually on an internal benchmark and see a minor difference (+/- 1 count for certain basic blocks) in just a dozen of instances (out of 10000+ input functions). Hence, the diff is highly unlikely to have an impact for existing prod workloads. Runtime impact: I measure up to 10% speedup for block-only (ie CSSPGO/AutoFDO) inference and up to 50% speedup for block+jump inference (ie BOLT) in comparison to the original unoptimized version. Reviewed By: hoy Differential Revision: https://reviews.llvm.org/D139870	2023-01-11 14:34:43 -08:00
Guillaume Chatelet	8fd5558b29	[NFC] Use TypeSize::geFixedValue() instead of TypeSize::getFixedSize() This change is one of a series to implement the discussion from https://reviews.llvm.org/D141134.	2023-01-11 16:49:38 +00:00
Guillaume Chatelet	48f5d77eee	[NFC] Use TypeSize::getKnownMinValue() instead of TypeSize::getKnownMinSize() This change is one of a series to implement the discussion from https://reviews.llvm.org/D141134.	2023-01-11 16:36:39 +00:00
Owen Anderson	90c1846629	Do not short circuit hoistIVInc when recomputation of poison flags is needed. Fixes https://github.com/llvm/llvm-project/issues/59777 Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D140836	2023-01-10 23:03:07 -07:00
Nikita Popov	094ccee2c8	Reapply [Dominators] Add findNearestCommonDominator() for Instructions (NFC) Reapply with checks for instructions in unreachable blocks. A test case for this was added in 1ee4a93b15bb. ----- This is a recurring pattern: We want to find the nearest common dominator (instruction) for two instructions, but currently only provide an API for the nearest common dominator of two basic blocks. Add an overload that accepts and return instructions.	2023-01-10 12:16:31 +01:00
spupyrev	61eb12e1f4	[BOLT] introducing profi params We want to use profile inference (profi) in BOLT for stale profile matching. To this end, I am making a few changes modifying the interface of the algorithm. This is the first change for existing usages of profi (e.g., CSSPGO): - introducing an object holding the algorithmic parameters; - some renaming of existing options; - dropped unused option, SampleProfileInferEntryCount, as we don't plan to change its default value; - no changes in the output / tests. Reviewed By: hoy Differential Revision: https://reviews.llvm.org/D134756	2023-01-09 12:03:28 -08:00
Thomas Symalla	6c1cf201be	[NFC] Missing whitespace in SSAUpdaterBulk debug output. Adds a whitespace in a debug message before printing out a value in the SSAUpdaterBulk. Without this, debugging can end up a bit cumbersome. Differential Revision: https://reviews.llvm.org/D141262	2023-01-09 10:15:25 +01:00
Stephen Tozer	c383f4d655	[DebugInfo] Allow non-stack_value variadic expressions and use in DBG_INSTR_REF Prior to this patch, variadic DIExpressions (i.e. ones that contain DW_OP_LLVM_arg) could only be created by salvaging debug values to create stack value expressions, resulting in a DBG_VALUE_LIST being created. As of the previous patch in this patch stack, DBG_INSTR_REF's syntax has been changed to match DBG_VALUE_LIST in preparation for supporting variadic expressions. This patch adds some minor changes needed to allow variadic expressions that aren't stack values to exist, and allows variadic expressions that are trivially reduceable to non-variadic expressions to be handled similarly to non-variadic expressions. Reviewed by: jmorse Differential Revision: https://reviews.llvm.org/D133926	2023-01-06 19:31:10 +00:00
James Y Knight	1ae36b1387	Remove special cases for invoke of non-throwing inline-asm. Non-throwing inline asm infers the nounwind attribute in instcombine. Thus, it can be handled in the same manner as non-throwing target functions are generally. Further special casing is unnecessary complexity.	2023-01-06 13:53:10 -05:00
Nikita Popov	c60149b49e	Revert "[Dominator] Add findNearestCommonDominator() for Instructions (NFC)" This reverts commit 7f0de9573f758f5f9108795850337a5acbd17eef. This is missing handling for !isReachableFromEntry() blocks, which may be relevant for some callers. Revert for now.	2023-01-06 17:36:01 +01:00
Nikita Popov	7f0de9573f	[Dominator] Add findNearestCommonDominator() for Instructions (NFC) This is a recurring pattern: We want to find the nearest common dominator (instruction) for two instructions, but currently only provide an API for the nearest common dominator of two basic blocks. Add an overload that accepts and return instructions.	2023-01-06 17:06:25 +01:00
Nikita Popov	a6a526ec54	[IR] Add AllocaInst::getAllocationSize() (NFC) When fetching allocation sizes, we almost always want to have the size in bytes, but we were only providing an InBits API. Also add the corresponding byte-based conjugate to save some *8 and /8 juggling everywhere.	2023-01-06 15:36:16 +01:00
OCHyams	775af51209	[DebugInfo] Prefer setKillLocation rather than replacing operands with undef NFC-ish. There is a functional change but the outputs are semantically identical. Where we might've before replaced one operand with undef (which means "this is a kill location marker") the use of `setKillLocation` will replace all location operands with `undef` (which also means "this is a kill location marker"). Related to https://discourse.llvm.org/t/auto-undef-debug-uses-of-a-deleted-value Reviewed By: StephenTozer Differential Revision: https://reviews.llvm.org/D140904	2023-01-06 10:11:14 +00:00
OCHyams	042107494d	[DebugInfo][NFC] Rename is/setUndef to is/setKilllocation These names better reflect the semantics and also the implementation, since it's not just "undef" operands that are sentinels used to signal that the debug intrinsic terminates dominating locations definitions. Related to https://discourse.llvm.org/t/auto-undef-debug-uses-of-a-deleted-value Reviewed By: StephenTozer Differential Revision: https://reviews.llvm.org/D140903	2023-01-06 09:15:02 +00:00
Peter Rong	1db51d8eb2	[Transform] Rewrite LowerSwitch using APInt This rewrite fixes https://github.com/llvm/llvm-project/issues/59316. Previously LowerSwitch uses int64_t, which will crash on case branches using integers with more than 64 bits. Using APInt fixes this problem. This patch also includes a test Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D140747	2023-01-05 14:30:42 -08:00
serge-sans-paille	38818b60c5	Move from llvm::makeArrayRef to ArrayRef deduction guides - llvm/ part Use deduction guides instead of helper functions. The only non-automatic changes have been: 1. ArrayRef(some_uint8_pointer, 0) needs to be changed into ArrayRef(some_uint8_pointer, (size_t)0) to avoid an ambiguous call with ArrayRef((uint8_t), (uint8_t)) 2. CVSymbol sym(makeArrayRef(symStorage)); needed to be rewritten as CVSymbol sym{ArrayRef(symStorage)}; otherwise the compiler is confused and thinks we have a (bad) function prototype. There was a few similar situation across the codebase. 3. ADL doesn't seem to work the same for deduction-guides and functions, so at some point the llvm namespace must be explicitly stated. 4. The "reference mode" of makeArrayRef(ArrayRef<T> &) that acts as no-op is not supported (a constructor cannot achieve that). Per reviewers' comment, some useless makeArrayRef have been removed in the process. This is a follow-up to https://reviews.llvm.org/D140896 that introduced the deduction guides. Differential Revision: https://reviews.llvm.org/D140955	2023-01-05 14:11:08 +01:00
Joshua Cao	629d880dc5	[LoopUnrollAndJam] Visit phi operand dependencies in post-order Fixes https://github.com/llvm/llvm-project/issues/58565 The previous implementation visits operands in pre-order, but this does not guarantee an instruction is visited before its uses. This can cause instructions to be copied in the incorrect order. For example: ``` a = ... b = add a, 1 c = add a, b d = add b, a ``` Pre-order visits does not guarantee the order in which `a` and `b` are visited. LoopUnrollAndJam may incorrectly insert `b` before `a`. This patch implements post-order visits. By visiting dependencies first, we guarantee that an instruction's dependencies are visited first. Differential Revision: https://reviews.llvm.org/D140255	2023-01-05 00:05:49 -08:00
Joshua Cao	50be285944	[LoopUnrollAndJam] Forget scalar evolution dispositions. Do no explicitly forget subloop. Fixes https://github.com/llvm/llvm-project/issues/58454 Scalar evolution dispositions need to be forgotten to pass verification. We do not need to forget the subloop since it is automatically forgotten when forgetting the parent loop. Differential Revision: https://reviews.llvm.org/D140953	2023-01-04 19:35:50 -08:00
Owen Anderson	733740b189	Fix a phase-ordering problem in SimplifyCFG. Switch simplification could sometimes fail to notice when an intermediate case removal caused the switch condition to become constant. This would cause the switch to be simplified into a conditional branch rather than a direct branch. Most of the time this didn't matter, except that occasionally downstream parts of SimplifyCFG expect tautological branches to already have been eliminated. The missed handling in switch simplification would cause an assertion failure in the downstream code. Triggering the assertion failure is fairly sensitive to the exact order of various simplifications. Fixes https://github.com/llvm/llvm-project/issues/59768 Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D140831	2023-01-04 16:47:13 -07:00
Matt Arsenault	e60b0d2a22	llvm-reduce: Reduce prologue data Identical to the prefix data handling, and also had broken cloning.	2023-01-03 11:07:02 -05:00
Matt Arsenault	4afb10d572	llvm-reduce: Reduce prefix data Also fixes broken cloning.	2023-01-03 11:03:43 -05:00
Peter Rong	2838507bd4	precommit formatting of LowerSwitch.cpp for D140747 Signed-off-by: Peter Rong <PeterRong96@gmail.com>	2023-01-02 12:57:35 -08:00
Teresa Johnson	35c7e457e8	[MemProf] Fix inline propagation of memprof metadata It isn't correct to always remove memprof metadata MIBs from the original allocation call after inlining. Let's say we have the following partial call graph: C D \ / v v B E \| / v v A where A contains an allocation call. If both contexts including B have the same allocation behavior, the context in the memprof metadata on the allocation will be pruned, and we will have 2 MIBs with contexts: A,B and A,E. Previously, if we inlined A into B we propagate the matching MIBs onto the inlined allocation call in B' (A,B in this case), and remove it from the original out of line allocation in A. This is correct if we have a single round of bottom up inlining. However, in the compiler we can have multiple invocations of the inliner pass (e.g. LTO). We may also inline non-bottom up with an alternative inliner such as the ModuleInliner. In that case, we could end up first inlining B into C, without having inlined A into B. The call graph then looks like: D \| v C' B E \ \| / v v v A If we subsequently (perhaps on a later invocation of bottom up inlining) inline A into B, the previous handling would propagate the memprof MIB context A,B up into the inlined allocation in B', and remove it from the original allocation in A. The propagation into B' is fine, however, by removing it from A's allocation, we no longer reflect the context coming from C'. To fix this, simply prevent the removal of MIB from the original allocation callsites. Note that the memprof_inline.ll test has some changes to existing checking to replace "noncold" with "notcold" in the metadata. The corresponding CHECK was accidentally commented out in the old version and thus this mistake was not previously detected. Differential Revision: https://reviews.llvm.org/D140764	2022-12-30 07:31:47 -08:00
Mircea Trofin	946831ea2d	[NFC] Rename Function::isDebugInfoForProfiling to shouldEmit[...] The function name was misleading - the expectation set both by the name and by other members of Function (like isDeclaration or isIntrinsic) would be that the function somehow would "be" "debug info for profiling". But that's not the case - the property indicates (as the comment over the declaration also explains) whether debug info should be emitted (for profiling).	2022-12-21 18:36:59 -08:00
Matt Arsenault	95570af6fa	SCCPSolver: Remove unnecessary set empty check	2022-12-21 09:33:17 -05:00
Craig Topper	9b2fecec40	[BuildLibCalls][RISCV] Sign extend return value of bcmp on riscv64. riscv64 wants callees to sign extend signed and unsigned int returns. The caller can use this to avoid a sign extend if the result is used by a comparison since riscv64 only has 64-bit compares. InstCombine/SimplifyLibCalls aggressively turn memcmps that are only used by an icmp eq 0 into bcmp, but we lose the signext attribute that would have been present on the memcmp. This causes an unneeded sext.w in the generated assembly. This looks even sillier if bcmp is implemented alias to memcmp. In that case, not only did we not get any savings by using bcmp, we added an instruction. This probably applies to other functions, this just happens to be the one I noticed so far. See also the discussion here https://discourse.llvm.org/t/can-we-preserve-signext-return-attribute-when-converting-memcmp-to-bcmp/67126 Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D139901	2022-12-20 11:47:04 -08:00
Joshua Cranmer	e6b02214c6	[IR] Add a target extension type to LLVM. Target-extension types represent types that need to be preserved through optimization, but otherwise are not introspectable by target-independent optimizations. This patch doesn't add any uses of these types by an existing backend, it only provides basic infrastructure such that these types would work correctly. Reviewed By: nikic, barannikov88 Differential Revision: https://reviews.llvm.org/D135202	2022-12-20 11:02:11 -05:00
Roman Lebedev	3a8e009f97	Revert "Reland "[SimplifyCFG] `FoldBranchToCommonDest()`: deal with mismatched IV's in PHI's in common successor block"" One of these two changes is exposing (or causing) some more miscompiles. A reproducer is in progress, so reverting until resolved. This reverts commit 428f36401b1b695fd501ebfdc8773bed8ced8d4e.	2022-12-20 18:36:42 +03:00
Sebastian Neubauer	bb7940e25f	[llvm] Make llvm::Any similar to std::any This facilitates replacing llvm::Any with std::any. - Deprecate any_isa in favor of using any_cast(Any) and checking for nullptr because C++17 has no any_isa. - Remove the assert from any_cast(Any), so it returns nullptr if the type is not correct. This aligns it with std::any_cast(any). Use any_cast(Any) throughout LLVM instead of checks with any_isa. This is the first part outlined in https://discourse.llvm.org/t/rfc-switching-from-llvm-any-to-std-any/67176 Differential Revision: https://reviews.llvm.org/D139973	2022-12-20 13:28:30 +01:00
Archibald Elliott	f09cf34d00	[Support] Move TargetParsers to new component This is a fairly large changeset, but it can be broken into a few pieces: - `llvm/Support/TargetParser` are all moved from the LLVM Support component into a new LLVM Component called "TargetParser". This potentially enables using tablegen to maintain this information, as is shown in https://reviews.llvm.org/D137517. This cannot currently be done, as llvm-tblgen relies on LLVM's Support component. - This also moves two files from Support which use and depend on information in the TargetParser: - `llvm/Support/Host.{h,cpp}` which contains functions for inspecting the current Host machine for info about it, primarily to support getting the host triple, but also for `-mcpu=native` support in e.g. Clang. This is fairly tightly intertwined with the information in `X86TargetParser.h`, so keeping them in the same component makes sense. - `llvm/ADT/Triple.h` and `llvm/Support/Triple.cpp`, which contains the target triple parser and representation. This is very intertwined with the Arm target parser, because the arm architecture version appears in canonical triples on arm platforms. - I moved the relevant unittests to their own directory. And so, we end up with a single component that has all the information about the following, which to me seems like a unified component: - Triples that LLVM Knows about - Architecture names and CPUs that LLVM knows about - CPU detection logic for LLVM Given this, I have also moved `RISCVISAInfo.h` into this component, as it seems to me to be part of that same set of functionality. If you get link errors in your components after this patch, you likely need to add TargetParser into LLVM_LINK_COMPONENTS in CMake. Differential Revision: https://reviews.llvm.org/D137838	2022-12-20 11:05:50 +00:00
Anna Thomas	05b060b0b0	[LoopPeel] Expose ValueMap of last peeled iteration. NFC The value map of last peeled iteration is computed within peelLoop API. This patch exposes it for callers of peelLoop. While this is not currently used by upstream passes, we have a usecase downstream which benefits from this API update. Future users of peelLoop can also use the ValueMap if needed. Similar value maps are exposed by other loop utilities such as loop cloning. Differential Revision: https://reviews.llvm.org/D138228	2022-12-19 09:55:29 -05:00
Roman Lebedev	428f36401b	Reland "[SimplifyCFG] `FoldBranchToCommonDest()`: deal with mismatched IV's in PHI's in common successor block" This reverts commit 37b8f09a4b61bf9bf9d0b9017d790c8b82be2e17, and returns commit 1bd0b82e508d049efdb07f4f8a342f35818df341. The miscompile was in InstCombine, and it has been addressed. This tries to approach the problem noted by @arsenm: terrible codegen for `__builtin_fpclassify()`: https://godbolt.org/z/388zqdE37 Just because the PHI in the common successor happens to have different incoming values for these two blocks, doesn't mean we have to give up. It's quite easy to deal with this, we just need to produce a select: https://alive2.llvm.org/ce/z/000srb Now, the cost model for this transform is rather overly strict, so this will basically never fire. We tally all (over all preds) the selects needed to the NumBonusInsts Differential Revision: https://reviews.llvm.org/D139275	2022-12-17 05:18:54 +03:00

1 2 3 4 5 ...

6685 Commits