llvm-project

Author	SHA1	Message	Date
David Blaikie	42043c423f	Reapply "Verifier: Add check for DICompositeType elements being null" This remove some erroneous debug info from tests that should address the test failures that showed up when the this was previously committed. This reverts commit 6716ce8b641f0e42e2343e1694ee578b027be0c4.	2025-01-23 22:29:30 +00:00
Ellis Hoag	d87441a242	[llvm-profdata] Fix detailed summary format on Windows (#124169 ) The detailed summary format was changed in https://github.com/llvm/llvm-project/pull/105915 which broke `llvm/test/tools/llvm-profdata/general.proftext` (XFAILed in https://github.com/llvm/llvm-project/pull/124165). Apparently the behavior of `%lu` is different between Linux and Windows, so I reverted back to using `<<` style formats.	2025-01-23 13:46:26 -08:00
Ellis Hoag	a2453097e3	[llvm-profdata] Add block percent to detailed summary (#105915 )	2025-01-23 09:30:23 -08:00
Mats Jun Larsen	d7c14c8f97	[IR] Replace of PointerType::getUnqual(Type) with opaque version (NFC) (#123909 ) Follow up to https://github.com/llvm/llvm-project/issues/123569	2025-01-23 18:23:05 +09:00
David Green	547bfda56b	[AArch64] Improve bcvtn2 and remove aarch64_neon_bfcvt intrinsics (#120363 ) This started out as trying to combine bf16 fpround to BFCVT2 instructions, but ended up removing the aarch64.neon.nfcvt intrinsics in favour of generating fpround instructions directly. This simplifies the patterns and can lead to other optimizations. The BFCVT2 instruction is adjusted to makes sure the types are valid, and a bfcvt2 is now generated in more place. The old intrinsics are auto-upgraded to fptrunc instructions too.	2025-01-21 09:16:04 +00:00
Mats Jun Larsen	416f1c465d	[IR] Replace of PointerType::get(Type) with opaque version (NFC) (#123617 ) In accordance with https://github.com/llvm/llvm-project/issues/123569 In order to keep the patch at reasonable size, this PR only covers for the llvm subproject, unittests excluded.	2025-01-21 00:32:56 +09:00
Joseph Huber	723a3e746a	[OpenMP] Fix mispelled attribute and warning Summary: This is spelled `ompx_aligned_barrier` when used directly, but wasn't included in the list of known assumptions. Fix that so now th test works.	2025-01-20 08:40:19 -06:00
Nikita Popov	0b1ae8963e	[AutoUpgrade] Avoid unnecessary pointer bitcasts (NFCI) Not needed with opaque pointers.	2025-01-20 09:55:35 +01:00
Michael Buch	a5fb2bbb2a	Reapply "[clang][DebugInfo] Emit DW_AT_object_pointer on function declarations with explicit `this`" (#123455 ) This reverts commit c3a935e3f967f8f22f5db240d145459ee621c1e0. The only change to the reverted commit is that this also updates the OCaml bindings according to the C debug-info API changes. The build failure originally introduced was: ``` FAILED: bindings/ocaml/debuginfo/debuginfo_ocaml.o /b/1/llvm-clang-x86_64-expensive-checks-debian/build/bindings/ocaml/debuginfo/debuginfo_ocaml.o cd /b/1/llvm-clang-x86_64-expensive-checks-debian/build/bindings/ocaml/debuginfo && /usr/bin/ocamlfind ocamlc -c /b/1/llvm-clang-x86_64-expensive-checks-debian/build/bindings/ocaml/debuginfo/debuginfo_ocaml.c -ccopt "-I/b/1/llvm-clang-x86_64-expensive-checks-debian/llvm-project/llvm/bindings/ocaml/debuginfo/../llvm -D_GNU_SOURCE -D_DEBUG -D_GLIBCXX_ASSERTIONS -DEXPENSIVE_CHECKS -D_GLIBCXX_DEBUG -D__STDC_CONSTANT_MACROS -D__STDC_FORMAT_MACROS -D__STDC_LIMIT_MACROS -I/b/1/llvm-clang-x86_64-expensive-checks-debian/build/include -I/b/1/llvm-clang-x86_64-expensive-checks-debian/llvm-project/llvm/include -DNDEBUG " /b/1/llvm-clang-x86_64-expensive-checks-debian/build/bindings/ocaml/debuginfo/debuginfo_ocaml.c: In function ‘llvm_dibuild_create_object_pointer_type’: /b/1/llvm-clang-x86_64-expensive-checks-debian/build/bindings/ocaml/debuginfo/debuginfo_ocaml.c:620:30: error: too few arguments to function ‘LLVMDIBuilderCreateObjectPointerType’ 620 \| LLVMMetadataRef Metadata = LLVMDIBuilderCreateObjectPointerType( \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ In file included from /b/1/llvm-clang-x86_64-expensive-checks-debian/build/bindings/ocaml/debuginfo/debuginfo_ocaml.c:23: /b/1/llvm-clang-x86_64-expensive-checks-debian/llvm-project/llvm/include/llvm-c/DebugInfo.h:880:17: note: declared here 880 \| LLVMMetadataRef LLVMDIBuilderCreateObjectPointerType(LLVMDIBuilderRef Builder, \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ```	2025-01-18 18:03:41 +00:00
Michał Górny	c3a935e3f9	Revert "[clang][DebugInfo] Emit DW_AT_object_pointer on function declarations with explicit `this`" (#123455 ) Reverts llvm/llvm-project#122928	2025-01-18 07:59:30 +00:00
Akshay Deodhar	5b6a26ccdd	Add option to print entire function instead of just the loops for loo… (#123229 ) print-after-all is useful for diffing IR between two passes. When one of the two is a function pass, and the other is a loop pass, the diff becomes useless. Add an option which prints the entire function for loop passes.	2025-01-17 17:55:54 -08:00
Michael Buch	10fdd09c3b	[clang][DebugInfo] Emit DW_AT_object_pointer on function declarations with explicit `this` (#122928 ) In https://github.com/llvm/llvm-project/pull/122897 we started attaching `DW_AT_object_pointer` to function definitions. This patch does the same but for function declarations (which we do for implicit object pointers already). Fixes https://github.com/llvm/llvm-project/issues/120974	2025-01-17 19:51:14 +00:00
David Blaikie	6716ce8b64	Revert "Verifier: Add check for DICompositeType elements being null" Asserts on various tests/buildbots, at least one example is DebugInfo/X86/set.ll This reverts commit 2dc5682dacab2dbb52a771746fdede0e938fc6e9.	2025-01-17 19:36:35 +00:00
David Blaikie	2dc5682dac	Verifier: Add check for DICompositeType elements being null Came up recently with some nodebug case on codeview, that caused a null entry in elements and crashed LLVM. Original clang fix to avoid generating IR like this: 504dd577675e8c85cdc8525990a7c8b517a38a89	2025-01-17 19:18:25 +00:00
Florian Hahn	22637a877a	[Loads] Respect UseDerefAtPointSemantics in isDerefAndAlignedPointer. (#123196 ) If a pointer gets freed, it may not be dereferenceable any longer, even though there is a dominating dereferenceable assumption. As first step, only consider assumptions if the pointer value cannot be freed if UseDerefAtPointSemantics is used. PR: https://github.com/llvm/llvm-project/pull/123196	2025-01-17 12:52:24 +00:00
Florian Hahn	8c75ecb373	[IR] Provide array with poison-generating metadata IDs. (#123188 ) Add Metadata::PoisonGeneratingIDs containing IDs of poison-generating metadata to allow easier re-use. PR: https://github.com/llvm/llvm-project/pull/123188	2025-01-16 20:45:56 +00:00
Florian Hahn	b769758056	[Options] Use UseDerefAtPointSemantics cl::opt<bool>. (#123192 ) It is used as boolean option, use cl::opt<bool> instead of vl::opt<unsigned>. PR: https://github.com/llvm/llvm-project/pull/123192	2025-01-16 14:07:03 +00:00
Sergey Kachkov	04b002bbb8	[IRBuilder] Add Align argument for CreateMaskedExpandLoad and CreateMaskedCompressStore (#122878 ) This patch adds possibility to specify alignment for llvm.masked.expandload/llvm.masked.compressstore intrinsics in IRBuilder (this is mostly NFC for now since it's only used in MemorySanitizer, but there is an intention to generate these intrinsics in the compiler passes, e.g. in LoopVectorizer)	2025-01-15 12:19:23 +03:00
Ramkumar Ramachandra	0e7b754ecc	[ValueTracking] Squash compile-time regression from 66badf2 (#122700 ) 66badf2 (VT: teach a special-case optz about samesign) introduced a compile-time regression due to the use of CmpPredicate::getMatching, which is unnecessarily inefficient. Introduce CmpPredicate::getPreferredSignedPredicate, which alleviates the inefficiency problem and squashes the compile-time regression.	2025-01-14 19:57:36 +00:00
Ramkumar Ramachandra	5187482fd0	IR: handle FP predicates in CmpPredicate::getMatching (#122924 ) CmpPredicate::getMatching implicitly assumes that both predicates are integer-predicates, and this has led to a crash being reported in VectorCombine after e409204 (VectorCombine: teach foldExtractedCmps about samesign). FP predicates are simple enough to handle as there is never any samesign information associated with them: hence handle them in CmpPredicate::getMatching, fixing the VectorCombine crash and guarding against future incorrect usages.	2025-01-14 18:17:07 +00:00
Ramkumar Ramachandra	f1632d25db	IR: introduce ICmpInst::isImpliedByMatchingCmp (#122597 ) Create an abstraction over isImplied{True,False}ByMatchingCmp to faithfully communicate the result of both functions, cleaning up code in callsites. While at it, fix a bug in the implied-false version of the function, which was inadvertedenly dropping samesign information.	2025-01-13 16:20:00 +00:00
Nikita Popov	22e9024c9f	[IR] Introduce captures attribute (#116990 ) This introduces the `captures` attribute as described in: https://discourse.llvm.org/t/rfc-improvements-to-capture-tracking/81420 This initial patch only introduces the IR/bitcode support for the attribute and its in-memory representation as `CaptureInfo`. This will be followed by a patch to upgrade and remove the `nocapture` attribute, and then by actual inference/analysis support. Based on the RFC feedback, I've used a syntax similar to the `memory` attribute, though the only "location" that can be specified is `ret`. I've added some pretty extensive documentation to LangRef on the semantics. One non-obvious bit here is that using ptrtoint will not result in a "return-only" capture, even if the ptrtoint result is only used in the return value. Without this requirement we wouldn't be able to continue ordinary capture analysis on the return value.	2025-01-13 14:40:25 +01:00
Ramkumar Ramachandra	f38c40bff3	VT: teach isImpliedCondMatchingOperands about samesign (#122474 ) Move isImplied{True,False}ByMatchingCmp from CmpInst to ICmpInst, so that it can operate on CmpPredicate instead of CmpInst::Predicate, and teach it about samesign. There are two callers of this function, and we choose to migrate the one in ValueTracking, namely isImpliedCondMatchingOperands to CmpPredicate, hence teaching it about samesign, with visible test impact.	2025-01-11 09:08:57 +00:00
Fangrui Song	7c886d5d92	PassTimingInfo: test TheTimeInfo first. NFC TheTimeInfo is a member variable and is often non-null, allowing the caller `getPassTimer` to skip one check.	2025-01-10 22:12:47 -08:00
Benjamin Maxwell	f88ef1bd1b	[LV] Teach LoopVectorizationLegality about struct vector calls (#119221 ) This is a split-off from #109833 and only adds code relating to checking if a struct-returning call can be vectorized. This initial patch only allows the case where all users of the struct return are `extractvalue` operations that can be widened. ``` %call = tail call { float, float } @foo(float %in_val) %extract_a = extractvalue { float, float } %call, 0 %extract_b = extractvalue { float, float } %call, 1 ``` Note: The tests require the VFABI changes from #119000 to pass.	2025-01-09 09:27:29 +00:00
Yingwei Zheng	a77346bad0	[IRBuilder] Refactor FMF interface (#121657 ) Up to now, the only way to set specified FMF flags in IRBuilder is to use `FastMathFlagGuard`. It makes the code ugly and hard to maintain. This patch introduces a helper class `FMFSource` to replace the original parameter `Instruction *FMFSource` in IRBuilder. To maximize the compatibility, it accepts an instruction or a specified FMF. This patch also removes the use of `FastMathFlagGuard` in some simple cases. Compile-time impact: https://llvm-compile-time-tracker.com/compare.php?from=f87a9db8322643ccbc324e317a75b55903129b55&to=9397e712f6010be15ccf62f12740e9b4a67de2f4&stat=instructions%3Au	2025-01-06 14:37:04 +08:00
Craig Topper	6f69f8c9fe	[IR] Use Instruction::isBinaryOp to simplify code. NFC	2025-01-04 22:33:10 -08:00
Alex MacLean	56e944bede	[NFC] add anonymous namespace to a couple classes (#121511 ) This ensures these classes are visible only to the appropriate translation unit and allows for more optimizations.	2025-01-02 20:13:18 -08:00
Stephen Senran Zhang	2feffecb88	[ConstantRange] Estimate tighter lower (upper) bounds for masked binary and (or) (#120352 ) Fixes #118108. Co-author: Yingwei Zheng (@dtcxzyw)	2024-12-31 18:40:17 -08:00
Sander de Smalen	2ce168baed	[AArch64] SME implementation for agnostic-ZA functions (#120150 ) This implements the lowering of calls from agnostic-ZA functions to non-agnostic-ZA functions, using the ABI routines `__arm_sme_state_size`, `__arm_sme_save` and `__arm_sme_restore`. This implements the proposal described in the following PRs: * https://github.com/ARM-software/acle/pull/336 * https://github.com/ARM-software/abi-aa/pull/264	2024-12-23 19:10:21 +00:00
Dominik Steenken	fa9cef50b1	Only guard loop metadata that has non-debug info in it (#118825 ) This PR is motivated by a mismatch we discovered between compilation results with vs. without `-g3`. We noticed this when compiling SPEC2017 testcases. The specific instance we saw is fixed in this PR by modifying a guard (see below), but it is likely similar instances exist elsewhere in the codebase. The specific case fixed in this PR manifests itself in the `SimplifyCFG` pass doing different things depending on whether DebugInfo is generated or not. At the end of this comment, there is reduced example code that shows the behavior in question. The differing behavior has two root causes: 1. Commit https://github.com/llvm/llvm-project/commit/c07e19b adds loop metadata including debug locations to loops that otherwise would not have loop metadata 2. Commit https://github.com/llvm/llvm-project/commit/ac28efa6c100 adds a guard to a simplification action in `SImplifyCFG` that prevents it from simplifying away loop metadata So, the change in 2. does not consider that when compiling with debug symbols, loops that otherwise would not have metadata that needs preserving, now have debug locations in their loop metadata. Thus, with `-g3`, `SimplifyCFG` behaves differently than without it. The larger issue is that while debug info is not supposed to influence the final compilation result, commits like 1. blur the line between what is and is not debug info, and not all optimization passes account for this. This PR does not address that and rather just modifies this particular guard in order to restore equivalent behavior between debug and non-debug builds in this one instance. --- Here is a reduced version of a file from `f526.blender_r` that showcases the behavior in question: ```C struct LinkNode; typedef struct LinkNode { struct LinkNode next; void link; } LinkNode; void do_projectpaint_thread_ph_v_state() { int ps = do_projectpaint_thread_ph_v_state; LinkNode node; while (do_projectpaint_thread_ph_v_state) for (node = ps; node; node = node->next) ; } ``` Compiling this with and without DebugInfo, and then disassembling the results, leads to different outcomes (tested on SystemZ and X86). The reason for this is that the `SimplifyCFG` pass does different things in either case.	2024-12-20 15:15:51 +01:00
Paul Walker	3146911eb0	[LLVM][AsmPrinter] Add vector ConstantInt/FP support to emitGlobalConstantImpl. (#120077 ) The fixes a failure path for fixed length vector globals when ConstantInt/FP is used to represent splats instead of ConstantDataVector.	2024-12-18 11:51:01 +00:00
Jan Patrick Lehr	9826201093	LLVMContext: rem constexpr to unblock build w/ gcc (#120402 ) Address issues observed in buildbots with older GCC versions: https://lab.llvm.org/buildbot/#/builders/140/builds/13302	2024-12-18 06:36:22 -05:00
Benjamin Maxwell	1ee740a796	[VFABI] Add support for vector functions that return struct types (#119000 ) This patch updates the `VFABIDemangler` to support vector functions that return struct types. For example, a vector variant of `sincos` that returns a vector of sine values and a vector of cosine values within a struct. This patch also adds some helpers for vectorizing types (including struct types). Some of these are used in the `VFABIDemangler`, and others will be used in subsequent patches, so this patch simply adds tests for them.	2024-12-18 09:46:45 +00:00
Matt Arsenault	3666de9c8e	LLVMContext: Cleanup registration of known bundle IDs (#120359 )	2024-12-18 14:41:55 +07:00
Florian Mayer	514580b438	[MTE] Apply alignment / size in AsmPrinter rather than IR (#111918 ) This makes sure no optimizations are applied that assume the bigger alignment or size, which could be incorrect if we link together with non-instrumented code.	2024-12-17 00:47:02 -08:00
Paul Walker	02328e0465	[LLVM][ConstantFold] Remove remaining uses of ConstantInt/FP::get(LLVMContext... (#119912 ) This extends the constant folds to support vector ConstantInt/FP.	2024-12-16 11:37:50 +00:00
Matt Arsenault	bb18e49edb	RegAlloc: Use DiagnosticInfo to report register allocation failures (#119492 ) Improve the non-fatal cases to use DiagnosticInfo, which will now provide a location. The allocators attempt to report different errors if it happens to see inline assembly is involved (this detection is quite unreliable) using srcloc instead of dbgloc. For now, leave this behavior unchanged. I think reporting the full location and context function would be more useful.	2024-12-16 10:49:08 +09:00
Ramkumar Ramachandra	5528388e36	EarlyCSE: fix CmpPredicate duplicate-hashing (#119902 ) Strip hash_value() for CmpPredicate, as different callers have different hashing use-cases. In this case, there is just one caller, namely EarlyCSE, which calls hash_combine() on a CmpPredicate, which used to call hash_combine() on a CmpInst::Predicate prior to 4a0d53a (PatternMatch: migrate to CmpPredicate). This has uncovered a bug where two icmp instructions differing in just the fact that one of them has the samesign flag on it are hashed differently, leading to divergent hashing, and a crash. Fix this crash by dropping samesign information on icmp instructions before hashing them, preserving the former behavior. Fixes #119893.	2024-12-13 22:06:39 +00:00
Ramkumar Ramachandra	4a0d53a0b0	PatternMatch: migrate to CmpPredicate (#118534 ) With the introduction of CmpPredicate in 51a895a (IR: introduce struct with CmpInst::Predicate and samesign), PatternMatch is one of the first key pieces of infrastructure that must be updated to match a CmpInst respecting samesign information. Implement this change to Cmp-matchers. This is a preparatory step in migrating the codebase over to CmpPredicate. Since we no functional changes are desired at this stage, we have chosen not to migrate CmpPredicate::operator==(CmpPredicate) calls to use CmpPredicate::getMatching(), as that would have visible impact on tests that are not yet written: instead, we call CmpPredicate::operator==(Predicate), preserving the old behavior, while also inserting a few FIXME comments for follow-ups.	2024-12-13 14:18:33 +00:00
Nikita Popov	07aab4a3cd	[DataLayout] Remove getMaxIndexSizeInBits() API The last use was removed in #119365, and we should not add more uses of this concept in the future either.	2024-12-13 13:01:01 +01:00
Pedro Lobo	8820de68dd	[debug] Use poison instead of undef to set a killed dbg.assign address [NFC] (#119760 )	2024-12-13 11:07:02 +00:00
Matt Arsenault	ea632e1b34	Reapply "DiagnosticInfo: Clean up usage of DiagnosticInfoInlineAsm" (#119575 ) (#119634 ) This reverts commit 40986feda8b1437ed475b144d5b9a208b008782a. Reapply with fix to prevent temporary Twine from going out of scope.	2024-12-11 16:01:48 -08:00
Vitaly Buka	40986feda8	Revert "DiagnosticInfo: Clean up usage of DiagnosticInfoInlineAsm" (#119575 ) Reverts llvm/llvm-project#119485 Breaks builders, details in llvm/llvm-project#119485	2024-12-11 07:51:36 -08:00
Matt Arsenault	1bc1703eb5	DiagnosticPrinter: Use printAsOperand to handle anonymous values (#119491 ) To avoid changing the behavior in the general case, only do this for anonymous functions. Otherwise, we'll end up with a leading '@' on the name, which may not be meaningful to end users.	2024-12-11 17:19:07 +09:00
Matt Arsenault	884f2ad6f9	DiagnosticInfo: Clean up usage of DiagnosticInfoInlineAsm (#119485 ) Currently LLVMContext::emitError emits any error as an "inline asm" error which does not make any sense. InlineAsm appears to be special, in that it uses a "LocCookie" from srcloc metadata, which looks like a parallel mechanism to ordinary source line locations. This meant that other types of failures had degraded source information reported when available. Introduce some new generic error types, and only use inline asm in the appropriate contexts. The DiagnosticInfo types are still a bit of a mess, and I'm not sure why DiagnosticInfoWithLocationBase exists instead of just having an optional DiagnosticLocation in the base class. DK_Generic is for any error that derives from an IR level instruction, and thus can pull debug locations directly from it. DK_GenericWithLoc is functionally the generic codegen error, since it does not depend on the IR and instead can construct a DiagnosticLocation from the MI debug location.	2024-12-11 17:16:07 +09:00
Joshua Batista	1a5e18a492	[HLSL] Do not print details in IR for target extension types (#115971 ) This PR changes how target extension types are printed when they are emitted as IR. This prevents repetitive phrases like "struct = type {...}" from being repeated over and over in the outputted IR. Additionally, it should allow opt to not crash when parsing the DXIL output. Fixes [#114131](https://github.com/llvm/llvm-project/issues/114131)	2024-12-10 10:07:30 -08:00
Dan Gohman	c5ab70c508	[WebAssembly] Add `-i128:128` to the `datalayout` string. (#119204 ) Clang [defaults to aligning `__int128_t` to 16 bytes], while LLVM `datalayout` strings [default to aligning `i128` to 8 bytes]. Wasm is currently using the defaults for both, so it's inconsistent. Fix this by adding `-i128:128` to Wasm's `datalayout` string so that it aligns `i128` to 16 bytes too. This is similar to [llvm/llvm-project@dbad963](`dbad963a69`) for SPARC. This fixes rust-lang/rust#133991; see that issue for further discussion. [defaults to aligning `__int128_t` to 16 bytes]: `f8b4182f07/clang/lib/Basic/TargetInfo.cpp (L77)` [default to aligning `i128` to 8 bytes]: https://llvm.org/docs/LangRef.html#langref-datalayout	2024-12-10 09:21:58 -08:00
Lei Huang	a13ec9cd54	[PowerPC] Update data layout aligment of i128 to 16 (#118004 ) Fix 64-bit PowerPC part of https://github.com/llvm/llvm-project/issues/102783.	2024-12-09 18:02:24 -05:00
Chandler Carruth	f0297ae552	Switch the intrinsic names to a string table (#118929 ) This avoids the need to dynamically relocate each pointer in the table. To make this work, this PR also moves the binary search of intrinsic names to an internal function with an adjusted signature, and switches the unittesting to test against actual intrinsics.	2024-12-07 17:53:59 -08:00

1 2 3 4 5 ...

6763 Commits