llvm-project

Author	SHA1	Message	Date
Ruiling, Song	e33e087a17	[MachineSink] Update register dependency correctly (#109763 ) The accumulateUsedDefed() was missing if block prologue interference check does not pass. This would cause incorrect register dependency, which cause incorrect sinking.	2024-09-25 08:42:40 +08:00
Fangrui Song	4a9da96dc6	[clang] Add cc1 --output-asm-variant= to set output syntax 2fcaa549a824efeb56e807fcf750a56bf985296b (2010) added cc1as option `-output-asm-variant` (untested) to set the output syntax. `clang -cc1as -filetype asm -output-asm-variant 1` allows AT&T input and Intel output (`AssemblerDialect` is also used by non-x86 targets). This patch renames the cc1as option (to avoid collision with -o) and makes it available for cc1 to set output syntax. This allows different input & output syntax: ``` echo 'asm("mov $1, %eax");' \| clang -xc - -S -o - -Xclang --output-asm-variant=1 ``` Note: `AsmWriterFlavor` (with a misleading name), used to initialize MCAsmInfo::AssemblerDialect, is primarily used for assembly input, not for output. Therefore, `echo 'asm("mov $1, %eax");' \| clang -x c - -mllvm --x86-asm-syntax=intel -S -o -`, which achieves a similar goal before Clang 19, was unintended. Close #109157 Pull Request: https://github.com/llvm/llvm-project/pull/109360	2024-09-24 15:59:33 -07:00
Matt Arsenault	71ca9fcb8d	llvm-reduce: Don't print verifier failed machine functions (#109673 ) This produces far too much terminal output, particularly for the instruction reduction. Since it doesn't consider the liveness of of the instructions it's deleting, it produces quite a lot of verifier errors.	2024-09-24 22:32:53 +04:00
spupyrev	36dce5091e	[CodeLayout][NFC] Format and minor refactoring of MBP (#109729 ) This PR has two (NFC) commits: - clang-format MBP - move a part of tail duplication and block aligning into helper functions for better readability.	2024-09-24 08:36:57 -07:00
Bevin Hansson	12033e550b	[ISelDAG] Salvage debug info at isel by referring to frame indices. (#109126 ) We can refer to frame index locations when salvaging debug info for certain nodes, which prevents the compiler from optimizing out the location.	2024-09-24 15:02:04 +02:00
Benjamin Maxwell	3073c3c229	[SDAG] Avoid creating redundant stack slots when lowering FSINCOS (#108401 ) When lowering `FSINCOS` to a library call (that takes output pointers) we can avoid creating new stack allocations if the results of the `FSINCOS` are being stored. Instead, we can take the destination pointers from the stores and pass those to the library call. --- Note: As a NFC this also adds (and uses) `RTLIB::getFSINCOS()`.	2024-09-24 13:36:21 +01:00
Dominik Montada	8ba334bc4a	[MIR] Allow overriding isSSA, noPhis, noVRegs in MIR input (#108546 ) Allow setting the computed properties IsSSA, NoPHIs, NoVRegs for MIR functions in MIR input. The default value is still the computed value. If the property is set to false, the computed result is ignored. Conflicting values (e.g. setting IsSSA where the input MIR is clearly not SSA) lead to an error. Closes #37787	2024-09-24 14:21:45 +02:00
Vyacheslav Levytskyy	4f8e76684f	[AsmPrinter] Do not emit label instructions after the function body if the target is SPIR-V (#107013 ) AsmPrinter always creates a symbol for the end of function if valid debug info is present. However, this breaks SPIR-V target's output, because SPIR-V specification allows label instructions only inside a block, not after the function body (see https://registry.khronos.org/SPIR-V/specs/unified1/SPIRV.html#OpLabel). This PR proposes to disable emission of label instructions after the function body if the target is SPIR-V. This PR is a fix of the https://github.com/llvm/llvm-project/issues/102732 issue.	2024-09-24 13:33:31 +02:00
Matt Arsenault	b30b9eb7a8	LiveInterval: Make verify functions return bool (#109672 ) This will allow the MachineVerifier to check these properties instead of just asserting	2024-09-24 08:32:47 +04:00
Craig Topper	93baa018e0	[LegalizeVectorTypes] Preserve original MemoryOperand and MemVT when widening fixed vector load to vp_load. (#109473 ) Previously we were building a new memoperand with the size of the widened VT. This was causing a failure in our downstream with non-power of 2 vectorization. Alias analysis allowed rescheduling a 3 element vector load past 2 out of 3 scalar stores that overwrite what it was supposed to read. Alias analysis considers it undefined behavior to read more than the size of the underlying object. There is an exception if the underying objects is sufficiently aligned, but that doesn't apply in my failing case.	2024-09-23 10:20:52 -07:00
Nikita Popov	ecb98f9fed	[IRBuilder] Remove uses of CreateGlobalStringPtr() (NFC) Since the migration to opaque pointers, CreateGlobalStringPtr() is the same as CreateGlobalString(). Normalize to the latter.	2024-09-23 16:30:50 +02:00
chuongg3	b0dc7b5b86	[AArch64][GlobalISel] Prefer to use Vector Truncate (#105692 ) Tries to combine scalarised truncates into vector truncate operations EXAMPLE: `%a(i32), %b(i32) = G_UNMERGE %src(<2 x i32>)` `%T_a(i16) = G_TRUNC %a(i32)` `%T_b(i16) = G_TRUNC %b(i32)` `%Imp(i16) = G_IMPLICIT_DEF(i16)` `%dst(v8i16) = G_MERGE_VALUES %T_a(i16), %T_b(i16), %Imp(i16), %Imp(i16)` ===> `%Imp(<2 x i32>) = G_IMPLICIT_DEF(<2 x i32>)` `%Mid(<4 x s16>) = G_CONCAT_VECTORS %src(<2 x i32>), %Imp(<2 x i32>)` `%dst(<4 x s16>) = G_TRUNC %Mid(<4 x s16>)`	2024-09-23 13:52:37 +01:00
Pawan Nirpal	26f272ebbd	[X86][SelectionDAG] - Add support for llvm.canonicalize intrinsic (#106370 ) Enable support for fcanonicalize intrinsic lowering.	2024-09-23 12:15:38 +01:00
futog	3e0a76b1fd	[Codegen][LegalizeIntegerTypes] Improve shift through stack (#96151 ) Minor improvement on cc39c3b17fb2598e20ca0854f9fe6d69169d85c7. Use an aligned stack slot to store the shifted value. Use the native register width as shifting unit, so the load of the shift result is aligned. If the shift amount is a multiple of the native register width, there is no need to do a follow-up shift after the load. I added new tests for these cases. Co-authored-by: Gergely Futo <gergely.futo@hightec-rt.com>	2024-09-23 11:45:43 +02:00
abhishek-kaushik22	f28a035536	Fix memory leak in LLVMTargetMachine.cpp (#109610 ) Because `MAB` is a raw pointer, it could potentially leak memory because of the `\|\|` in the null check.	2024-09-23 17:11:56 +08:00
Nikita Popov	5a4c6f9799	[Loads] Check context instruction for context-sensitive derefability (#109277 ) If a dereferenceability fact is provided through `!dereferenceable` (or similar), it may only hold on the given control flow path. When we use `isSafeToSpeculativelyExecute()` to check multiple instructions, we might make use of `!dereferenceable` information that does not hold at the speculation target. This doesn't happen when speculating instructions one by one, because `!dereferenceable` will be dropped while speculating. Fix this by checking whether the instruction with `!dereferenceable` dominates the context instruction. If this is not the case, it means we are speculating, and cannot guarantee that it holds at the speculation target. Fixes https://github.com/llvm/llvm-project/issues/108854.	2024-09-23 09:13:09 +02:00
Kazu Hirata	46df454c9a	[CodeGen] Construct SmallVector with ArrayRef (NFC) (#109566 )	2024-09-22 00:17:10 -07:00
Kazu Hirata	22486e031d	[LiveDebugValues] Avoid repeated hash lookups (NFC) (#109509 )	2024-09-21 09:11:19 -07:00
Youngsuk Kim	d31e314131	[llvm] Don't call raw_string_ostream::flush() (NFC) Don't call raw_string_ostream::flush(), which is essentially a no-op. As specified in the docs, raw_string_ostream is always unbuffered. ( 65b13610a5226b84889b923bae884ba395ad084d for further reference )	2024-09-20 12:19:59 -05:00
Juan Manuel Martinez Caamaño	3c83102f06	[NFC][EarlyIfConverter] Remove unused member variables	2024-09-20 10:31:28 +02:00
Juan Manuel Martinez Caamaño	9e73159126	[NFC][EarlyIfConverter] Replace boolean Predicate for a class (#108519 ) Currently SSAIfConv is used in 2 scenarios. Generalize them to support more scenarios.	2024-09-20 09:54:05 +02:00
Akshat Oke	d2d78e584b	[NewPM][CodeGen] Port MachineLICM to NPM (#107376 )	2024-09-20 11:34:18 +05:30
Matt Arsenault	5326614e2f	AtomicExpand: Really allow incremental legalization (#108613 ) Fix up 100d9b89947bb1d42af20010bb594fa4c02542fc. The iterator fixes ended up defeating the point, since newly inserted blocks were not visited. This never erases the current block, so we can simply not preincrement the block iterator. The AArch64 FP atomic tests now expand the cmpxchg in the second round of legalization.	2024-09-20 08:18:33 +04:00
Thorsten Schütt	ccfe7d4b20	[GlobalIsel] Cleanup G_EXTRACT_VECTOR_ELT combines (#109047 ) Reduce duplicated build vector patterns by exploiting variadic args. Make index parameter const to improve hit rate. Use `getIConstantFromReg` to retrieve immediate because they are not fallible anymore. Improve extraction from build vector and shuffle vector.	2024-09-19 20:36:33 +02:00
Kazu Hirata	2d7d74d990	[LiveDebugValues] Avoid repeated hash lookups (NFC) (#109242 )	2024-09-19 09:15:06 -07:00
Jay Foad	e03f427196	[LLVM] Use {} instead of std::nullopt to initialize empty ArrayRef (#109133 ) It is almost always simpler to use {} instead of std::nullopt to initialize an empty ArrayRef. This patch changes all occurrences I could find in LLVM itself. In future the ArrayRef(std::nullopt_t) constructor could be deprecated or removed.	2024-09-19 16:16:38 +01:00
Jonas Paulsson	14120227a3	Target ABI: improve call parameters extensions handling (#100757 ) For the purpose of verifying proper arguments extensions per the target's ABI, introduce the NoExt attribute that may be used by a target when neither sign- or zeroextension is required (e.g. with a struct in register). The purpose of doing so is to be able to verify that there is always one of these attributes present and by this detecting cases where sign/zero extension is actually missing. As a first step, this patch has the verification step done for the SystemZ backend only, but left off by default until all known issues have been addressed. Other targets/front-ends can now also add NoExt attribute where needed and do this check in the backend.	2024-09-19 16:59:31 +02:00
Nikita Popov	7183771834	[InitUndef] Also handle inline asm (#108951 ) InitUndef should also handle early-clobber / undef conflicts in inline asm operands. Do this by iterating over all_defs() instead of defs(). The newly added ARM test was generating an "unpredictable STXP instruction, status is also a source" error prior to this change. Fixes https://github.com/llvm/llvm-project/issues/106380.	2024-09-19 09:59:36 +02:00
Phoebe Wang	c18be32185	Reland "[X86][BF16] Add libcall for F80 -> BF16 (#109116 )" (#109143 ) This reverts commit ababfee78714313a0cad87591b819f0944b90d09. Add X86 FP80 check.	2024-09-19 15:39:07 +08:00
Pierre van Houtryve	758444ca3e	[AMDGPU] Promote uniform ops to I32 in DAGISel (#106383 ) Promote uniform binops, selects and setcc between 2 and 16 bits to 32 bits in DAGISel Solves #64591	2024-09-19 09:00:21 +02:00
Craig Topper	80f6b42a26	[MachinePipeliner] Fix incorrect use of getPressureSets. (#109179 ) The code was passing a physical register directly to getPressureSets which expects a register unit. Fix this by looping over the register units and calling getPressureSets for each of them. Found while trying to add a RegisterUnit class to stop storing register units in `Register`. 0 is a valid register unit but not a valid Register.	2024-09-18 21:34:05 -07:00
Craig Topper	009398b3b3	[MachineVerifier] Improve checks for G_INSERT_SUBVECTOR. (#109209 ) -Improve messages. -Remove redundant checks that are handled in generic code. -Add check that the subvector is smaller than the vector. -Add checks that subvector is smaller than the vector.	2024-09-18 18:31:16 -07:00
Craig Topper	e494e2a294	[MachineVerifier] Improve G_EXTRACT_SUBVECTOR checking (#109202 ) Check that the destination of G_EXTRACT_SUBVECTOR is smaller than the source. Improve wording of error messages.	2024-09-18 18:29:32 -07:00
Craig Topper	d21a43579e	[LegalizeVectorOps][RISCV] Don't scalarize FNEG in ExpandFNEG if FSUB is marked Promote. We have a special check that tries to determine if vector FP operations are supported for the type to determine whether to scalarize or not. If FP arithmetic would be promoted, don't unroll. This improves Zvfhmin codegen on RISC-V.	2024-09-18 18:19:21 -07:00
Craig Topper	d5d1417659	[RISCV][GISel] Use libcalls for rint, nearbyint, trunc, round, and roundeven intrinsics. (#108779 )	2024-09-18 12:07:44 -07:00
Craig Topper	292ee93a87	[CodeGen] Use Register in SwitchLoweringUtils. NFC (#109092 ) Use an empty Register() instead of -1U.	2024-09-18 09:43:21 -07:00
Phoebe Wang	a10c9f994b	Revert "[X86][BF16] Add libcall for F80 -> BF16" (#109140 ) Reverts llvm/llvm-project#109116	2024-09-18 21:35:38 +08:00
Phoebe Wang	76eda76f9f	[X86][BF16] Add libcall for F80 -> BF16 (#109116 ) This fixes #108936, but the calling convention doesn't match with GCC. I doubt we have such a lib function for now, so leave the calling convention as is.	2024-09-18 21:23:10 +08:00
Craig Topper	9d3ab1c36e	[SelectionDAGBuilder] Use Register in more places. NFC"	2024-09-17 23:49:58 -07:00
Craig Topper	fe012bd52d	[SelectionDAG] Use Register around RegisterSDNode related functions. NFC RegisterSDNode itself already stored a Register.	2024-09-17 23:26:56 -07:00
Craig Topper	ca0613e0fc	[LegalizeFloatTypes] Handle replacement for strict ops inside SoftPromoteHalfOp_FP_TO_XINT. NFC Return SDValue() so we can notify the caller we did all replacements. Restore the getNumValues() == 1 check in the assert in the caller now that all handles only return nodes with a single result.	2024-09-17 16:25:10 -07:00
Michael Maitland	e08c2178ef	[MachineVerifier] Fix bug in MachineVerifier for G_INSERT_SUBVECTOR (#109048 )	2024-09-17 16:57:41 -04:00
Stephen Tozer	51a29b5f16	Revert2 "[DebugInfo][DWARF] Set is_stmt on first non-line-0 instruction in BB (#105524 )" Reverted due to large .debug_line size regressions for some configurations; work currently in place to improve the output of this behaviour in PR #108251. This patch also modifies two tests that were created or modified after the original commit landed and are affected by the revert: llvm/test/CodeGen/X86/pseudo_cmov_lower2.ll llvm/test/DebugInfo/X86/empty-line-info.ll This reverts commit 5fef40c2c477e92187bd4e5c18091eca6b8465cc.	2024-09-17 18:29:20 +01:00
Craig Topper	da46244e49	Revert "[LegalizeVectorOps] Make the AArch64 hack in ExpandFNEG more specific." This reverts commit 884ff9e3f9741ac282b6cf8087b8d3f62b8e138a. Regression was reported in Halide for arm32.	2024-09-17 09:04:43 -07:00
Craig Topper	f36580fcb5	[LegalizeVectorOps] Remove calls to DAG.UnrollVectorsOps from some expansion handlers. NFC (#108930 ) Instead, return SDValue() to tell the caller to do the unrolling. This is consistent with how some other handler work. Especially the handlers that live in TLI. ExpandBITREVERSE was rewritten to not take the Results vector an argument.	2024-09-17 08:35:22 -07:00
David Green	2242cd2b6a	[DAG] Fold vecreduce.or(sext(x)) to sext(vecreduce.or(x)) (#108959 ) The same is true for and / xor reductions, where the sext / zext can be sank down through the bitwise operation. https://alive2.llvm.org/ce/z/TvzCd5	2024-09-17 15:24:00 +01:00
Mikhail R. Gadelha	d2125e1db6	[RISCV] Support STRICT_UINT_TO_FP and STRICT_SINT_TO_FP (#102503 ) This patch adds support for the missing STRICT_UINT_TO_FP and STRICT_SINT_TO_FP for riscv and adds a test case for rv32 which was previously crashing. The code is in line with how other strict_* nodes are handled (e.g., getting op(1) instead of op(0) when it's a strict node, as op(0) in a strict node is the entry token).	2024-09-17 11:21:52 -03:00
Michael Maitland	ee2add0683	[GISEL] Fix bugs and clarify spec of G_EXTRACT_SUBVECTOR (#108848 ) The implementation was missing the fact that `G_EXTRACT_SUBVECTOR` destination and source vector can be different types. Also fix a bug in the MIR builder for `G_EXTRACT_SUBVECTOR` to generate the correct opcode. Clarify the G_EXTRACT_SUBVECTOR specification.	2024-09-17 10:08:39 -04:00
Thorsten Schütt	acfa294b5e	[GlobalIsel] Canonicalize G_FCMP (#108891 ) As a side-effect, we start constant folding fcmps.	2024-09-17 09:42:04 +02:00
Craig Topper	884ff9e3f9	[LegalizeVectorOps] Make the AArch64 hack in ExpandFNEG more specific. Only scalarize single element vectors when vector FSUB is not supported and scalar FNEG is supported.	2024-09-16 21:48:42 -07:00

1 2 3 4 5 ...

36493 Commits