llvm-project

Author	SHA1	Message	Date
Gergely Bálint	29fef3a51e	[BOLT] Improve DWARF CFI generation for pac-ret binaries (#163381 ) During InsertNegateRAState pass we check the annotations on instructions, to decide where to generate the OpNegateRAState CFIs in the output binary. As only instructions in the input binary were annotated, we have to make a judgement on instructions generated by other BOLT passes. Incorrect placement may cause issues when an (async) unwind request is received during the new "unknown" instructions. This patch adds more logic to make a more informed decision on by taking into account: - unknown instructions in a BasicBlock with other instruction have the same RAState. Previously, if the BasicBlock started with an unknown instruction, the RAState was copied from the preceding block. Now, the RAState is copied from the succeeding instructions in the same block. - Some BasicBlocks may only contain instructions with unknown RAState, As explained in issue #160989, these blocks already have incorrect unwind info. Because of this, the last known RAState based on the layout order is copied. Updated bolt/docs/PacRetDesign.md to reflect changes.	2025-12-01 12:00:31 +01:00
Ming Yan	8ceeba8381	[MLIR][SCF] Canonicalize redundant scf.if from scf.while before region into after region (#169892 ) When a `scf.if` directly precedes a `scf.condition` in the before region of a `scf.while` and both share the same condition, move the if into the after region of the loop. This helps simplify the control flow to enable uplifting `scf.while` to `scf.for`.	2025-12-01 18:54:21 +08:00
Jim Lin	b7721c55fc	[RISCV] Remove the duplicate for RV32/RV64 in zicond-fp-select-zfinx.ll. NFC.	2025-12-01 18:36:07 +08:00
Luke Lau	d1500d12be	[SelectionDAG] Add SelectionDAG::getTypeSize. NFC (#169764 ) Similar to how getElementCount avoids the need to reason about fixed and scalable ElementCounts separately, this patch adds getTypeSize to do the same for TypeSize. It also goes through and replaces some of the manual uses of getVScale with getTypeSize/getElementCount where possible.	2025-12-01 10:33:50 +00:00
Timm Baeder	b1620996f4	[clang][bytecode] Fix discarding ImplitiValueInitExprs (#170089 ) They don't have side-effects, so this should be fine. Fixes https://github.com/llvm/llvm-project/issues/170064	2025-12-01 11:33:33 +01:00
Luke Lau	2c9e9ffa77	[SCCP] Handle llvm.experimental.get.vector.length calls (#169527 ) As noted in the reproducer provided in https://github.com/llvm/llvm-project/issues/164762#issuecomment-3554719231, on RISC-V after LTO we sometimes have trip counts exposed to vectorized loops. The loop vectorizer will have generated calls to @llvm.experimental.get.vector.length, but there are [some properties](https://llvm.org/docs/LangRef.html#id2399) about the intrinsic we can use to simplify it: - The result is always less than both Count and MaxLanes - If Count <= MaxLanes, then the result is Count This teaches SCCP to handle these cases with the intrinsic, which allows some single-iteration-after-LTO loops to be unfolded. #169293 is related and also simplifies the intrinsic in InstCombine via computeKnownBits, but it can't fully remove the loop since computeKnownBits only does limited reasoning on recurrences.	2025-12-01 10:29:21 +00:00
Tom Eccles	8ec2112ec8	[OMPIRBuilder] re-land cancel barriers patch #164586 (#169931 ) A barrier will pause execution until all threads reach it. If some go to a different barrier then we deadlock. This manifests in that the finalization callback must only be run once. Fix by ensuring we always go through the same finalization block whether the thread in cancelled or not and no matter which cancellation point causes the cancellation. The old callback only affected PARALLEL, so it has been moved into the code generating PARALLEL. For this reason, we don't need similar changes for other cancellable constructs. We need to create the barrier on the shared exit from the outlined function instead of only on the cancelled branch to make sure that threads exiting normally (without cancellation) meet the same barriers as those which were cancelled. For example, previously we might have generated code like ``` ... %ret = call i32 @__kmpc_cancel(...) %cond = icmp eq i32 %ret, 0 br i1 %cond, label %continue, label %cancel continue: // do the rest of the callback, eventually branching to %fini br label %fini cancel: // Populated by the callback: // unsafe: if any thread makes it to the end without being cancelled // it won't reach this barrier and then the program will deadlock %unused = call i32 @__kmpc_cancel_barrier(...) br label %fini fini: // run destructors etc ret ``` In the new version the barrier is moved into fini. I generate it after the destructors because the standard describes the barrier as occurring after the end of the parallel region. ``` ... %ret = call i32 @__kmpc_cancel(...) %cond = icmp eq i32 %ret, 0 br i1 %cond, label %continue, label %cancel continue: // do the rest of the callback, eventually branching to %fini br label %fini cancel: br label %fini fini: // run destructors etc // safe so long as every exit from the function happens via this block: %unused = call i32 @__kmpc_cancel_barrier(...) ret ``` To achieve this, the barrier is now generated alongside the finalization code instead of in the callback. This is the reason for the changes to the unit test. I'm unsure if I should keep the incorrect barrier generation callback only on the cancellation branch in clang with the OMPIRBuilder backend because that would match clang's ordinary codegen. Right now I have opted to remove it entirely because it is a deadlock waiting to happen. --- This re-lands #164586 with a small fix for a failing buildbot running address sanitizer on clang lit tests. In the previous version of the patch I added an insertion point guard "just to be safe" and never removed it. There isn't insertion point guarding on the other route out of this function and we do not preserve the insertion point around getFiniBB either so it is not needed here. The problem flagged by the sanitizers was because the saved insertion point pointed to an instruction which was then removed inside the FiniCB for some clang codegen functions. The instruction was freed when it was removed. Then accessing it to restore the insertion point was a use after free bug.	2025-12-01 10:07:19 +00:00
Tom Eccles	34c44f21ae	[flang][TBAA] refine TARGET/POINTER encoding (#169544 ) Previously we were less specific for POINTER/TARGET: encoding that they could alias with (almost) anything. In the new system, the "target data" tree is now a sibling of the other trees (e.g. "global data"). POITNTER variables go at the root of the "target data" tree, whereas TARGET variables get their own nodes under that tree. For example, ``` integer, pointer :: ip real, pointer :: rp integer, target :: it integer, target :: it2(:) real, target :: rt integer :: i real :: r ``` - `ip` and `rp` may alias with any variable except `i` and `r`. - `it`, `it2`, and `rt` may alias only with `ip` or `rp`. - `i` and `r` cannot alias with any other variable. Fortran 2023 15.5.2.14 gives restrictions on entities associated with dummy arguments. These do not allow non-target globals to be modified through dummy arguments and therefore I don't think we need to make all globals alias with dummy arguments. I haven't implemented it in this patch, but I wonder whether it is ever possible for `ip` to alias with `rt` or even `it2`. While I was updating the tests I fixed up some tests that still assumed that local alloc tbaa wasn't the default. I found no functional regressions in the gfortran test suite, fujitsu test suite, spec2017, or a selection of HPC apps we test internally.	2025-12-01 10:05:56 +00:00
Benjamin Maxwell	1317083530	[AArch64][SME] Support saving/restoring ZT0 in the MachineSMEABIPass (#166362 ) This patch extends the MachineSMEABIPass to support ZT0. This is done with the addition of two new states: - `ACTIVE_ZT0_SAVED` * This is used when calling a function that shares ZA, but does not share ZT0 (i.e., no ZT0 attributes) * This state indicates ZT0 must be saved to the save slot, but ZA must remain on, with no lazy save setup - `LOCAL_COMMITTED` * This is used for saving ZT0 in functions without ZA state * This state indicates ZA is off and ZT0 has been saved * This state is general enough to support ZA, but the required transitions have not been implemented† To aid with readability, the state transitions have been reworked to a switch of `transitionFrom(<FromState>).to(<ToState>)`, rather than nested ifs, which helps manage more transitions. † This could be implemented to handle some cases of undefined behavior better.	2025-12-01 09:55:49 +00:00
Igor Wodiany	dda15ad0aa	[mlir][spirv] Use MapVector for BlockMergeInfoMap (#169636 ) This should ensure that the structurizer while loop is deterministic across runs. Use of `MapVector` addresses the source of the nondeterminism which is use of a `Block*` as a map key. fixes #128547	2025-12-01 09:43:25 +00:00
Gergely Bálint	8e6fb0ee84	Reapply "[BOLT][BTI] Skip inlining BasicBlocks containing indirect tailcalls" (#169881 ) (#169929 ) This reapplies commit 5d6d74359d69d3aada6a46c7cf51d84eb0848b70. Fix: added assertions to the requirements of the test -------- Original commit message: In the Inliner pass, tailcalls are converted to calls in the inlined BasicBlock. If the tailcall is indirect, the `BR` is converted to `BLR`. These instructions require different BTI landing pads at their targets. As the targets of indirect tailcalls are unknown, inlining such blocks is unsound for BTI: they should be skipped instead.	2025-12-01 10:20:23 +01:00
Steven Wu	8079d033c9	[CAS] Temporarily skip tests on old windows version (#170063 )	2025-12-01 17:10:39 +08:00
Carlos Galvez	eb711d8e14	[clang-tidy][doc] Fix incorrect link syntax in cppcoreguidelines-pro-… (#170088 ) …bounds-avoid-unchecked-container-access Missing a trailing underscore to render it as a link. Co-authored-by: Carlos Gálvez <carlos.galvez@zenseact.com>	2025-12-01 09:50:19 +01:00
Matthias Springer	147c466bcd	[mlir][arith] Add support for min/max to `ArithToAPFloat` (#169760 ) Add support for `arith.minnumf`, `arith.maxnumf`, `arith.minimumf`, `arith.maximumf`.	2025-12-01 08:50:02 +00:00
ShashwathiNavada	9afb651613	Adding support for iterator in motion clauses. (#159112 ) As described in section 2.14.6 of openmp spec, the patch implements support for iterator in motion clauses. --------- Co-authored-by: Shashwathi N <nshashwa@pe31.hpc.amslabs.hpecorp.net>	2025-12-01 14:03:32 +05:30
Matthias Springer	05b1989551	[mlir][arith] Add support for `negf` to `ArithToAPFloat` (#169759 ) Add support for `arith.negf`.	2025-12-01 08:28:23 +00:00
Matthias Springer	f67b018470	[mlir][SPIRV] Improve ub.unreachable lowering test case (#170083 ) Addresses a comment on the PR that introduces the ub.reachable -> spriv.Unreachable lowering (https://github.com/llvm/llvm-project/pull/169872#discussion_r2573670611).	2025-12-01 08:15:15 +00:00
Abhishek Varma	7ce71414ec	[NFC][Linalg] Follow-up on ConvMatchBuilder (#170080 ) -- This commit addresses [follow-up review comments on 169704](https://github.com/llvm/llvm-project/pull/169704#pullrequestreview-3521785548). -- Contains NFC nit/minor changes. Signed-off-by: Abhishek Varma <abhvarma@amd.com>	2025-12-01 13:44:15 +05:30
David Sherwood	17677ad7eb	[LV] Don't create WidePtrAdd recipes for scalar VFs (#169344 ) While attempting to remove the use of undef from more loop vectoriser tests I discovered a bug where this assert was firing: ``` llvm::Constant* llvm::Constant::getSplatValue(bool) const: Assertion `this->getType()->isVectorTy() && "Only valid for vectors!"' failed. ... #8 0x0000aaaab9e2fba4 llvm::Constant::getSplatValue #9 0x0000aaaab9dfb844 llvm::ConstantFoldBinaryInstruction ``` This seems to be happening because we are incorrectly generating WidePtrAdd recipes for scalar VFs. The PR fixes this by checking whether a plan has a scalar VF only in legalizeAndOptimizeInductions. This PR also removes the use of undef from the test `both` in Transforms/LoopVectorize/iv_outside_user.ll, which is what started triggering the assert. Fixes #169334	2025-12-01 08:12:41 +00:00
Matthias Springer	4d7abe5355	[mlir][arith] Add support for `cmpf` to `ArithToAPFloat` (#169753 ) Add support for `arith.cmpf`.	2025-12-01 09:12:11 +01:00
Vasily Leonenko	a751ed97ac	[BOLT] Support runtime library hook via DT_INIT_ARRAY (#167467 ) Major part of this PR is commit implementing support for DT_INIT_ARRAY for BOLT runtime libraries initialization. Also, it adds related hook-init test & fixes couple of X86 instrumentation tests. This commit follows implementation of instrumentation hook via DT_FINI_ARRAY (https://github.com/llvm/llvm-project/pull/67348) and extends it for BOLT runtime libraries (including instrumentation library) initialization hooking. Initialization has has differences compared to finalization: - Executables always use ELF entry point address. Update code checks it and updates init_array entry if ELF is shared library (have no interp entry) and have no DT_INIT entry. Also this commit introduces "runtime-lib-init-hook" option to select primary initialization hook (entry_point, init, init_array) with fall back to next available hook in input binary. e.g. in case of libc we can explicitly set it to init_array. - Shared library init_array entries relocations usually has R_AARCH64_ABS64 type on AArch64 binaries. We check relocation type and adjust methods for reading init_array relocations in discovery and update methods. --------- Co-authored-by: Vasily Leonenko <vasily.leonenko@huawei.com>	2025-12-01 10:55:00 +03:00
Timm Baeder	bbb0dbadfa	[clang][AST] Add `RecordDecl::getNumFields()` (#170022 ) Not sure why that didn't exist yet, but we have quite a few places using the same `std::distance` pattern.	2025-12-01 08:33:54 +01:00
Luke Lau	dc5ce79cc1	[LV] Regenerate some check lines. NFC The scalar loop doesn't exist anymore after 8907b6d39371d439461cdd3475d5590f87821377	2025-12-01 15:25:08 +08:00
Yingwei Zheng	9416b19e4f	[InstCombine] Add missing constant check (#170068 ) `cast<Constant>` is not guarded by a type check during canonicalization of predicates. This patch adds a type check in the outer if to avoid the crash. `dyn_cast` may introduce another nested if, so I just use `isa<Constant>` instead. Address the crash reported in https://github.com/llvm/llvm-project/pull/153053#issuecomment-3593914124.	2025-12-01 15:20:45 +08:00
Jason Molenda	036279addf	[lldb][debugserver] Return shared cache filepath in jGetSharedCacheInfo (#168474 ) Add a "shared_cache_path" key-value to the jGetSharedCacheInfo response, if we can fetch the shared cache path. If debugserver and the inferior process are running with the same shared cache UUID, there is a simple SPI to get debugserver's own shared cache filepath and we will return that. On newer OSes, there are SPI we can use to get the inferior process' shared cache filepath, use that if necessary and the SPI are available. The response for the jGetSharedCacheInfo packet will now look like {"shared_cache_base_address":6609256448,"shared_cache_uuid":"B69FF43C-DBFD-3FB1-B4FE-A8FE32EA1062","no_shared_cache":false,"shared_cache_private_cache":false,"shared_cache_path":"/System/Volumes/Preboot/Cryptexes/OS/System/Library/dyld/dyld_shared_cache_arm64e"} when we have the full information about the shared cache in the inferior. There are three possible types of responses: 1. inferior has not yet mapped in a shared cache (read: when stopped at dyld_start and dyld hasn't started executing yet). In this case, no "shared_cache_path" is listed. ("shared_cache_base_address" will be 0, "shared_cache_uuid" will be all-zeroes uuid) 2. inferior has a shared cache, but it is different than debugserver's and we do not have the new SPI to query the shared cache filepath. No "shared_cache_path" is listed. 3. We were able to find the shared cache filepath, and it is included in the response, as above. I'm not using this information in lldb yet, but changes that build on this will be forthcoming. rdar://148939795	2025-11-30 21:40:13 -08:00
Men-cotton	81c5d468cf	[MLIR][NVVM] Propagate verification failure for unsupported SM targets (#170001 ) Fixes: https://github.com/llvm/llvm-project/issues/169113 Correctly propagate verification failure when `NVVM::RequiresSMInterface` check fails during `gpu.module` verification. Previously, the walk was interrupted but the function returned `success()`, causing a mismatch between the emitted diagnostic and the return status. This led to assertion failures in Python bindings which expect `failure()` when diagnostics are emitted. CC: @grypp	2025-12-01 09:50:13 +05:30
Brandon Wu	e2181400d7	[RISCV][llvm] Correct shamt in P extension EXTRACT_VECTOR_ELT lowering (#169823 ) During operation legalization, element type should have been turn into XLenVT which makes the SHL a no-op. We need to use exact vector element type instead.	2025-12-01 11:03:50 +08:00
Matt Arsenault	6369279a0c	Revert "Revert "LangRef: Clarify llvm.minnum and llvm.maxnum about sNaN and signed zero (#112852 )"" (#170067 ) Reverts llvm/llvm-project#168838 Justification is confused and this did not receive adequate discussion, particularly during a holiday week	2025-12-01 02:56:47 +00:00
Aadesh Premkumar	7494f3df14	[SPIRV] Added support for extension SPV_ALTERA_arbitrary_precision_fixed_point and name change of SPV_INTEL_arbitrary_precision_integers to SPV_ALTERA_arbitrary_precision_integers (#136085 ) --Added support for extension SPV_ALTERA_arbitrary_precision_fixed_point --Added test files for extension SPV_ALTERA_arbitrary_precision_fixed_point	2025-11-30 18:44:51 -08:00
fennecJ	2e21bb815d	[RISCV][ISelLowering] Use Zicond for FP selects on Zfinx/Zdinx (#169299 ) ### Summary This patch let RISCVTargetLowering::lowerSELECT to lower some floating-point select operations through an integer zicond select when: * Zicond is available, and * FP values live in GPRs (Zfinx/Zdinx), and * Select condition is an integer type. In that scenario there is no extra cost for GPR <-> "FP GPR" moves, so we can implement FP selects with a CZERO-based sequence instead of a branch. For example, for ```c float foo(int cond, float x) { return (cond != 0) ? x : 0.0f; } ``` the current lowering produces: ```asm foo: mv a2, a0 li a0, 0 beqz a2, .LBB0_2 .LBB0_1: mv a0, a1 .LBB0_2: ret ``` With this patch, when targeting rv64ima_zicond_zfinx we instead get: ```asm foo: czero.nez a2, zero, a0 czero.eqz a0, a1, a0 or a0, a2, a0 ret ``` The existing branch-based lowering is preserved for: * targets without Zicond * targets where FP registers are separate (+f, +d without zfinx/zdinx) ### Testing Adds llvm/test/CodeGen/RISCV/zicond-fp-select-zfinx.ll to cover: * RV64 Zfinx/Zicond vs Zfinx without Zicond * RV64 Zdinx/Zicond vs Zdinx without Zicond * RV32 Zfinx/Zicond vs Zfinx without Zicond Also adds baseline RV32F/RV64F/RV64D cases to ensure we still use branches when FP registers are separate. The tests check that: * With Zicond + Zfinx/Zdinx, FP select lowers to a CZERO+OR sequence with no conditional branches. * Without Zicond (or without Zfinx/Zdinx), we still get branch-based code and no czero.* instructions.	2025-12-01 10:19:56 +08:00
lonely eagle	e110abc3c6	[mlir][affine] Use iter argument replace init when delete loop in the coalesceLoops function (#169514 ) Fix https://github.com/llvm/llvm-project/issues/169483 by using iter argument replace init when delete loop in the coalesceLoops function.	2025-12-01 10:00:54 +08:00
Phoebe Wang	75aa01b895	Revert "LangRef: Clarify llvm.minnum and llvm.maxnum about sNaN and signed zero (#112852 )" (#168838 ) This reverts commit 363b05944f9212511ee6811d0eb1af841c177226. This is a follow up of #166912. Sorry for not noticing the change at the beginning, but I disagree with both sNaN and signed zero semantics change. I have 3 justifications: - llvm.minnum and llvm.maxnum are common intrinsics, we cannot change the definition just because "some architectures" support the changed semantic. For example, X86 min/max instructions neither distinguish sNaN nor signed zero. We have to add couples of extra instructions to match with the new definition, which makes the intrinsics less efficient. But efficient is not the reason for the objection. I object because such cost is unnecessary; - As the example ``minnum(fadd(sNaN, -0.0), 1.0)`` shows, minnum/maxnum themself cannot guarantee consistent result if multiple FP arithmetic operations involved. It makes the sacrifice of performance totally unnecessary. `Behavior of Floating-Point NaN values` notes all NaNs can be treated as quiet NaNs unless using Constrained Floating-Point Intrinsics. So the cost is only worth for constrained minnum/maxnum ones if we want to define them; - Signed zero handling is unnecessary either, because even the C functions don't require it. If any other front ends require, they can use the existing fminnum_ieee/fmaxnum_ieee or define new intrinsics; Fixes: https://github.com/llvm/llvm-project/issues/138303 and https://github.com/llvm/llvm-project/issues/169122	2025-12-01 09:35:00 +08:00
Fangrui Song	ef3785887c	ELF: Move .eh_frame_hdr code closer to .eh_frame . NFC ... as they are closely related. Also improve the comments.	2025-11-30 14:37:35 -08:00
Florian Hahn	c465a56e9d	[VPlan] Handle canonical IVs in ::isSingleScalar. (NFCI) The canonical IV is always a single scalar. They are already treated as uniform-across-UF-and-VF. This should currently be NFC.	2025-11-30 21:51:03 +00:00
Florian Hahn	113e0c95a8	[LV] Add additional tests for argmin with find-first wrapping IV ranges. Add test cases for upcoming argmin vectorization changes that have wrapping IV ranges.	2025-11-30 21:07:28 +00:00
Nico Weber	f42e58f616	[gn] port a6643f27ecda (libc++ picolib/newlib)	2025-11-30 13:59:11 -05:00
Simon Pilgrim	38678a91f3	[DAG] getCarry - always succeed if we encounter a i1 type during trunc/ext peeling (#169777 ) If we are force reconstructing a carry from a raw MVT::i1 type, make sure we don't miss any cases while peeling through trunc/ext chains - check for i1 types at the start of the while loop Fixes #169691	2025-11-30 18:26:24 +00:00
Shih-Po Hung	76d5dd5f9e	[TTI][RISCV] Add cost modelling for intrinsic vp.load.ff (#169890 ) This patch is a rework of #160470 (which was reverted). With getMemIntrinsicCost() now available, we can re‑land the change and reduce vp_load_ff boilerplate.	2025-11-30 17:46:46 +00:00
Jakub Kuderski	0bd2f12753	[mlir][linalg] Restrict fill initial value type to output element type (#169567 ) Disallow implicit casting, which is surprising, and, IME, usually indicative of copy-paste errors. Because the initial value must be a scalar, I don't expect this to affect any data movement.	2025-11-30 09:51:37 -05:00
David Green	b228256312	[ARM] Introduce intrinsics for MVE fma under strict-fp. (#169771 ) Similar to #169156, this adds an @arm.mve.fma intrinsic for strict-fp. A Builder class is added to act as the common subclass of IRBuilder and IRInt.	2025-11-30 11:12:53 +00:00
Jonas Hahnfeld	ce2c0813f0	[clang] Move and update comment in getASTRecordLayout, NFC. isDefinition was already renamed to isCompleteDefinition by commit f937c023bf in 2011, later the day the comment was originally written.	2025-11-30 11:58:41 +01:00
Aiden Grossman	22257e8d6e	[bazel] Port #169873 (#170027 ) A new dependency was added.	2025-11-30 00:45:18 -08:00
Tomer Shafir	dda1fcf7b1	[llc][NFC] Remove unreachable return statement (#169915 ) `reportError()` is a `[[noreturn]]` that calls `exit(1)`.	2025-11-30 10:42:29 +02:00
sathvikreddy853	70970d0a5b	[flang] Implement lowering for the PAUSE statement (Fixes #166821 ) (#167115 ) Implements lowering for the Fortran `PAUSE` statement. - Handles PAUSE with no operand. - Handles PAUSE with integer argument. - Handles PAUSE with character literal argument. - Adds a new lowering test: flang/test/Lower/pause-statement.f90. Unlike STOP, PAUSE does not unconditionally terminate control flow. The lowering preserves labels and GOTOs, consistent with legacy Fortran behavior. Fixes: #166821 --------- Co-authored-by: aditya nath <adityanath5002@gmail.com> Co-authored-by: Eugene Epshteyn <eepshteyn@nvidia.com>	2025-11-29 22:07:14 -05:00
Matthias Springer	3de11e9251	[mlir][CF] Add `ub.unreachable` canonicalization (#169873 ) Basic blocks with only a `ub.unreachable` terminator are unreachable. This commit adds a canonicalization pattern that folds to `cf.cond_br` to `cf.br` if one of the destination branches is unreachable.	2025-11-30 02:36:57 +00:00
Fangrui Song	a8cffb8299	Remove unused MCObjectFileInfo::SupportsWeakOmittedEHFrame The code is related to pre-AsmPrinter legacy code (see 9cb0e94dc79657144d639c722619e1e4fc19040e in 2008). The only caller has been removed by bb237c72a69e6294258874a40aaaf14ad2747710 in 2011.	2025-11-29 17:43:26 -08:00
Florian Hahn	24b87b8d48	[VPlan] Skip cost verification for loops with EVL gather/scatter. The VPlan-based cost model use vp_gather/vp_scatter for gather/scatter costs, which is different to the legacy cost model and cannot be matched there. Don't verify the costs match for plans containing gather/scatters with EVL. Fixes https://github.com/llvm/llvm-project/issues/169948.	2025-11-29 22:00:30 +00:00
Lucie Choi	9ffd2e40c1	[SimplifyCFG] Fix `SimplifyCFG` pass to skip folding when both blocks contain convergence loop/entry intrinsics. (#166452 ) Fixes a bug https://github.com/llvm/llvm-project/issues/165642. [Similar fix](https://github.com/llvm/llvm-project/pull/165643) is being made in `IndVarSimplify` pass to account for convergence tokens. [LLVM Spec](https://llvm.org/docs/ConvergentOperations.html#llvm-experimental-convergence-loop) states that only a single loop / entry convergence token can be included in a basic block. This PR fixes the issue in `SimplifyCFG` pass so that when a basic block and its predecessor both contain such convergence intrinsics, it skips merging the two blocks.	2025-11-30 06:57:06 +09:00
Florian Hahn	cd3192a2c9	[VPlan] Turn IVOp assertion into early exit. Turn assertion added in 99addbf73 [0] into an early exit. There are cases where the operand may not be a VPWidenIntOrFpInductionRecipe, e.g. if the IV increment is selected, as in the test cases. [0] https://github.com/llvm/llvm-project/pull/141431	2025-11-29 20:49:22 +00:00
Fangrui Song	f571293124	ARC: Override pseudos with pointers This ports #159881 fix for other targets and fixes ``` error: missing target override for pseudoinstruction using PointerLikeRegClass ```	2025-11-29 11:23:59 -08:00

1 2 3 4 5 ...

561022 Commits