llvm-project

Author	SHA1	Message	Date
Craig Topper	2d58925362	[LegalizeVectorOps][RISCV] Support condition code legalization for ISD::STRICT_FSETCC/FSETCCS during LegalizeVectorOps. Switch RISC-V to legalize during LegalizeVectorOps instead of LegalizeDAG. LegalizeDAG uses the OpVT for legalize action while LegalizeVectorOps uses the result VT. We really should fix that.	2023-04-29 22:55:41 -07:00
Craig Topper	344368fb98	[TargetLowering] Stop passing an ISD::CondCode to isOperationLegalOrCustom. ISD::CondCode is a separate num space from opcodes. isOperationLegalOrCustom should take an opcode. Reviewed By: barannikov88 Differential Revision: https://reviews.llvm.org/D149528	2023-04-29 15:23:09 -07:00
Sergei Barannikov	e744e51b12	[SelectionDAG] Rename ADDCARRY/SUBCARRY to UADDO_CARRY/USUBO_CARRY (NFC) This will make them consistent with other overflow-aware nodes. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D148196	2023-04-29 21:59:58 +03:00
Craig Topper	df017ba9d3	[TargetLowering] Don't use ISD::SELECT_CC in expandFP_TO_INT_SAT. This function gets called for vectors and ISD::SELECT_CC was never intended to support vectors. Some updates were made to support it when this function started getting used for vectors. Overall, using separate ISD::SETCC and ISD::SELECT looks like an improvement even for scalar. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D149481	2023-04-29 10:23:08 -07:00
Matt Arsenault	bc37be1855	LangRef: Add "dynamic" option to "denormal-fp-math" This is stricter than the default "ieee", and should probably be the default. This patch leaves the default alone. I can change this in a future patch. There are non-reversible transforms I would like to perform which are legal under IEEE denormal handling, but illegal with flushing zero behavior. Namely, conversions between llvm.is.fpclass and fcmp with zeroes. Under "ieee" handling, it is legal to translate between llvm.is.fpclass(x, fcZero) and fcmp x, 0. Under "preserve-sign" handling, it is legal to translate between llvm.is.fpclass(x, fcSubnormal\|fcZero) and fcmp x, 0. I would like to compile and distribute some math library functions in a mode where it's callable from code with and without denormals enabled, which requires not changing the compares with denormals or zeroes. If an IEEE function transforms an llvm.is.fpclass call into an fcmp 0, it is no longer possible to call the function from code with denormals enabled, or write an optimization to move the function into a denormal flushing mode. For the original function, if x was a denormal, the class would evaluate to false. If the function compiled with denormal handling was converted to or called from a preserve-sign function, the fcmp now evaluates to true. This could also be of use for strictfp handling, where code may be changing the denormal mode. Alternative name could be "unknown". Replaces the old AMDGPU custom inlining logic with more conservative logic which tries to permit inlining for callees with dynamic handling and avoids inlining other mismatched modes.	2023-04-29 08:44:59 -04:00
Luo, Yuanke	40222ddcf8	[X86] Fix the vnni machine combine issue. The previous patch (D148980) didn't set the InstrIdxForVirtReg correctly in genAlternativeDpCodeSequence(). It causes vnni lit test failure when LLVM_ENABLE_EXPENSIVE_CHECKS is on.	2023-04-29 13:51:08 +08:00
Wang, Xin10	9c1e4ee690	[NFC]Fix 2 logic dead code First, in CodeGenPrepare.cpp, line 6891, the VectorCond will always be false because if not function will return at 6888. Second, in SelectionDAGBuilder.cpp, line 5443, getSExtValue() will return value as int type, but now we use unsigned Val to maintain it, which make the if condition at 5452 meaningless. Reviewed By: skan Differential Revision: https://reviews.llvm.org/D149033	2023-04-28 03:02:59 -04:00
Jordan Rupprecht	fbf42f1fe2	Revert "[CodeGenPrepare] Estimate liveness of loop invariants when checking for address folding profitability" This reverts commit 5344d8e10bb7d8672d4bfae8adb010465470d51b. It causes non-determinism when building clang. See the review thread on D143897.	2023-04-27 19:16:32 -07:00
Nick Desaulniers	012ea747ed	[CodeGen][MachineLastInstrsCleanup] fix INLINEASM_BR hazard If the removable definition resides in an INLINEASM_BR target, the reuseable candidate might not dominate the INLINEASM_BR. bb0: INLINEASM_BR &"" %bb.1 renamable $x8 = MOVi64imm 29273397577910035 B %bb.2 ... bb1: renamable $x8 = MOVi64imm 29273397577910035 renamable $x8 = ADDXri killed renamable $x8, 2048, 0 bb2: Removing the second mov is a hazard when the inline asm branches to bb1. Skip such replacements when the to be removed instruction is in the target of such an INLINEASM_BR instruction. We could get more aggressive about this in the future, but for now simply abort. This is causing a boot failure on linux-4.19.y branches of the LTS Linux kernel for ARCH=arm64 with CONFIG_RANDOMIZE_BASE=y (KASLR) and CONFIG_UNMAP_KERNEL_AT_EL0=y (KPTI). Link: https://reviews.llvm.org/D123394 Link: https://github.com/ClangBuiltLinux/linux/issues/1837 Thanks to @nathanchance for the report, and @ardb for debugging. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D149191	2023-04-27 13:40:00 -07:00
ManuelJBrito	d22edb9794	[IR][NFC] Change UndefMaskElem to PoisonMaskElem Following the change in shufflevector semantics, poison will be used to represent undefined elements in shufflevector masks. Differential Revision: https://reviews.llvm.org/D149256	2023-04-27 18:01:54 +01:00
Alexis Engelke	1e743732e7	[RegAllocFast] Use uint16_t SparseT for LiveRegMap For functions with very large numbers of live variables, lookups into LiveRegMap previously detoriated to linear searches. This slightly increases memory usage, but that is barely measurable. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D149330	2023-04-27 18:58:49 +02:00
Craig Topper	0b5396b163	[LegalizeVectorOps] Use all ones mask when expanding i1 VP_SELECT. We were previously using the condition as the mask. By the semantics of VP operations, that means that anywhere the condition is false returns poison and not the false operand. Use an all ones mask instead. No tests are affected because RISC-V drops the mask when lowering. Reviewed By: fakepaper56 Differential Revision: https://reviews.llvm.org/D149310	2023-04-27 08:26:16 -07:00
Jay Foad	fdc0d5f399	[DAG] Do not call computeKnownBits from isKnownToBeAPowerOfTwo The only way known bits could help identify a known power of two is if it knows exactly which power of two it is, i.e. if it is a known constant. But in that case the value should have been simplified to a constant already. So save some compile time by not calling computeKnownBits. Differential Revision: https://reviews.llvm.org/D149325	2023-04-27 11:05:56 +01:00
Luo, Yuanke	8f7f9d86a7	[X86] Machine combine vnni instruction. "vpmaddwd + vpaddd" can be combined to vpdpwssd and the latency is reduced after combination. However when vpdpwssd is in a critical path the combination get less ILP. It happens when vpdpwssd is in a loop, the vpmaddwd can be executed in parallel in multi-iterations while vpdpwssd has data dependency for each iterations. If vpaddd is in a critical path while vpmaddwd is not, it is profitable to split vpdpwssd into "vpmaddwd + vpaddd". This patch is based on the machine combiner framework to acheive decision on "vpmaddwd + vpaddd" combination. The typical example code is as below. ``` __m256i foo(int cnt, __m256i c, __m256i b, __m256i *p) { for (int i = 0; i < cnt; ++i) { __m256i a = p[i]; __m256i m = _mm256_madd_epi16 (b, a); c = _mm256_add_epi32(m, c); } return c; } ``` Differential Revision: https://reviews.llvm.org/D148980	2023-04-27 16:42:04 +08:00
Jay Foad	47d3cbcf84	[BranchFolder] Skip redundant IMPLICIT_DEFs of subregs Differential Revision: https://reviews.llvm.org/D148509	2023-04-27 09:40:06 +01:00
Mingming Liu	9879e5865a	[InlineAsm][AArch64]Add backend support for flag output parameters - The set of flag is from https://gcc.gnu.org/onlinedocs/gcc/Extended-Asm.html#Flag-Output-Operands Before: - ARM64 GCC supports flag output constraints, while Clang doesn't parse condition code, as shown in https://gcc.godbolt.org/z/7jzMEK796 - LLVM ISel won't lower them either (as shown in https://gcc.godbolt.org/z/Pv4PPf56c) After: - Given flag output constraints in LLVM IR, condition code is parsed and flag output is lowered to 'cset'. - Clang parse is not added in this patch. Differential Revision: https://reviews.llvm.org/D149032	2023-04-26 09:18:41 -07:00
Felipe de Azevedo Piovezan	815eab2d3c	[DebugLocEntry][nfc] Remove redundant cast A cast from DIExpression->DIExpression is not needed. Differential Revision: https://reviews.llvm.org/D149178	2023-04-26 07:56:15 -04:00
OCHyams	ac6e177ce6	[Assignment Tracking] Remove overly defensive AllocaInst assertion Remove assert from AssignmentTrackingAnalysis that fires if a local variable has non-alloca storage. The analysis can emit these locations but the assignment tracking code in SelectionDAG isn't ready to handle non-alloca storage for locals yet. The AssignmentTrackingPass (pass that adds assignment tracking metadata) ignores non-alloca dbg.declares, so the only variables affected are those who's backing storage is changed from an alloca during optimisation, and the result is the variables are dropped. Fixes: https://ci.chromium.org/ui/p/pigweed/builders/toolchain/ toolchain-ci-pigweed-linux/b8783274592206481489/overview Reviewed By: StephenTozer Differential Revision: https://reviews.llvm.org/D149135	2023-04-26 11:24:31 +01:00
OCHyams	b59d672ed4	[Assignment Tracking] Fix faulty assertion inside std::sort predicate The vectors being sorted here shouldn't contain duplicate entries. Prior to this patch this was checked with an assert within the `std::sort` predicate. However, `std::sort` may compare an element against itself which causes the assert to fire (false positive). Move the assert outside of the sort predicate to avoid such issues. Reviewed By: StephenTozer Differential Revision: https://reviews.llvm.org/D149045	2023-04-26 11:14:51 +01:00
OCHyams	2b3c13b716	[DebugInfo] Treat empty metadata operands the same as undef operands in SelectionDAG Without this patch SelectionDAG silently drops dbg.values using `!{}` operands. Related to https://discourse.llvm.org/t/auto-undef-debug-uses-of-a-deleted-value Reviewed By: StephenTozer Differential Revision: https://reviews.llvm.org/D140990	2023-04-26 09:03:07 +01:00
Kazu Hirata	1ca0cb717a	[llvm] Replace None with std::nullopt in comments (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-04-25 23:53:32 -07:00
Mikael Holmen	f7bee65728	[TailDuplicator] Don't constrain register classes due to debug instructions If cloning a DBG_VALUE instruction, register uses in that instruction could lead to constraining of a virtual register that would not happen if the DBG_VALUE was not present at all. This lead to different code with/without debug info. Now we only do that register class constraining if we dealing with a non debug instruction. Differential Revision: https://reviews.llvm.org/D149146	2023-04-26 08:17:42 +02:00
Julian Lettner	c3f0153ec2	[MachO] Disable atexit()-based lowering when LTO'ing kernel/kext code The kernel and kext environments do not provide the `__cxa_atexit()` function, so we can't use it for lowering global module destructors. Unfortunately, just querying for "compiling for kernel/kext?" in the LTO pipeline isn't possible (kernel/kext identifier isn't part of the triple yet) so we need to pass down a CodeGen flag. rdar://93536111 Differential Revision: https://reviews.llvm.org/D148967	2023-04-25 12:13:40 -07:00
Jeremy Morse	7561f9cd5c	[DebugInfo][CSInfo] Avoid crash when defining super-regs In rare situations involving AVX intrinsics, it seems LLVM can be coaxed into generating copies to arguments that look like this: $xmm0 = VMOVAPSrr $xmm1, implicit-def $ymm0 CALL64 @something ymm0 This particular form of copy implicitly zeros the upper lanes of ymm0, hence there's an implicit-def for the register in the copy. The X86 implementation of describeLoadedValue doesn't attempt to describe this sort of copy which causes the generic implementation in TargetInstrInfo::describeLoadedValue to fire an assertion saying it expected the target hook to handle it. Play it safe in the generic implementation and return the "no location / value" return value, rather than asserting. Differential Revision: https://reviews.llvm.org/D148626	2023-04-25 14:15:33 +01:00
Piyou Chen	8a3950510f	[RISCV] Support scalar/fix-length vector NTLH intrinsic with different domain This commit implements the two NTLH intrinsic functions. ``` type __riscv_ntl_load (type ptr, int domain); void __riscv_ntl_store (type ptr, type val, int domain); ``` ``` enum { __RISCV_NTLH_INNERMOST_PRIVATE = 2, __RISCV_NTLH_ALL_PRIVATE, __RISCV_NTLH_INNERMOST_SHARED, __RISCV_NTLH_ALL }; ``` We encode the non-temporal domain into MachineMemOperand flags. 1. Create the RISC-V built-in function with custom semantic checking. 2. Assume the domain argument is a compile time constant, and make it as LLVM IR metadata (nontemp_node). 3. Encode domain value as two bits MachineMemOperand TargetMMOflag. 4. According to MachineMemOperand TargetMMOflag, select corrsponding ntlh instruction. Currently, it supports scalar type and fixed-length vector type. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D143364	2023-04-24 20:15:14 -07:00
NAKAMURA Takumi	d45fae6010	Move CodeGen/LowLevelType => CodeGen/LowLevelTypeUtils Before restoring `CodeGen/LowLevelType`, rename this to `LowLevelTypeUtils`. Differential Revision: https://reviews.llvm.org/D148768	2023-04-25 08:53:17 +09:00
Simon Pilgrim	be93256655	[VP] Add IR expansion for fneg Followup to D149052, it wasn't worthwhile to add general support for unary opcodes	2023-04-24 16:14:06 +01:00
Simon Pilgrim	0b7f53efec	[VP] IR expansion for fabs/fsqrt/fma/fmadd Add basic handling for VP ops that can expand to FP intrinsics Fixes #60464 Differential Revision: https://reviews.llvm.org/D149052	2023-04-24 15:20:07 +01:00
Simon Pilgrim	b0832fca3f	[DAG] Add ISD::isExtVecInRegOpcode helper. Match ISD::ANY_EXTEND_VECTOR_INREG\ZERO_EXTEND_VECTOR_INREG\SIGN_EXTEND_VECTOR_INREG opcodes	2023-04-24 14:47:23 +01:00
Tom Weaver	b63c08c773	Revert "[Coverity] Fix explicit null dereferences" This reverts commit 22b23a5213b57ce1834f5b50fbbf8a50297efc8a. This commit caused the following two build bots to start failing: https://lab.llvm.org/buildbot/#/builders/216/builds/20322 https://lab.llvm.org/buildbot/#/builders/123/builds/18511	2023-04-24 11:14:10 +01:00
Momchil Velikov	5344d8e10b	[CodeGenPrepare] Estimate liveness of loop invariants when checking for address folding profitability When checking the profitability of folding an address computation into a memory instruction, the compiler tries to determine the liveness of the values, comprising the address, at the point of the memory instruction. This patch improves on the live variable estimates by including the loop invariants which are references in the loop body. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D143897	2023-04-24 10:21:36 +01:00
Christian Ulmann	f5425c128a	[LoopInfo] Move generic LoopInfo into own files This commit splits the generic part of `LoopInfo` into separate files. These new `GenericLoopInfo` files are located in `llvm/Support` to be inline with `GenericDomTree`. Furthermore, this change ensures that MLIR's Bazel build does not have to link against `LLVMAnalysis` just to use these template headers. Depends on D148219 Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D148235	2023-04-24 06:07:05 +00:00
Wang, Xin10	76cc949212	Clean come dead code These codes deleted are dead code, we never go into it. 1. In AggressiveAntiDepBreaker.cpp, have assert AntiDepReg != 0. 2. IfConversion.cpp, Kind can only be one unique value, so isFalse && isRev can never be true. 3. DAGCombiner.cpp, at line 3675, we have considered the condition like ``` // fold (sub x, c) -> (add x, -c) if (N1C) { return DAG.getNode(ISD::ADD, DL, VT, N0, DAG.getConstant(-N1C->getAPIntValue(), DL, VT)); } ``` 4. ScheduleDAGSDNodes.cpp, we have Latency > 1 at line 663 5. MasmParser.cpp, code exists in a switch-case block which decided by the value FirstTokenKind, at line 1621, FirstTokenKind could only be one of AsmToken::Dollar, AsmToken::At and AsmToken::Identifier. Reviewed By: skan Differential Revision: https://reviews.llvm.org/D148610	2023-04-23 20:46:34 -04:00
Fangrui Song	0d333bf0e3	Remove ExplicitEmulatedTLS and simplify -femulated-tls handling Currently clangDriver passes -femulated-tls and -fno-emulated-tls to cc1. cc1 forwards the option to LLVMCodeGen and ExplicitEmulatedTLS is used to decide the value. Simplify this by moving the Clang decision to clangDriver and moving the LLVM decision to InitTargetOptionsFromCodeGenFlags.	2023-04-23 11:55:12 -07:00
Akshay Khadse	22b23a5213	[Coverity] Fix explicit null dereferences This change fixes static code analysis errors Reviewed By: skan Differential Revision: https://reviews.llvm.org/D148912	2023-04-23 12:07:11 +08:00
David Green	33fe899cef	[DAG][AArch64] Limit preferIncOfAddToSubOfNot until after legalization if the node has wrap flags If the add node has wrap flags then they will be destroyed by converting to sub/not. The flags can be useful in converting to rhadd, for example, but that may be required late if the node types need to be legalized. This limits the preferIncOfAddToSubOfNot fold until after legalize DAG if the node have flags to allow more folding. Differential Revision: https://reviews.llvm.org/D148809	2023-04-21 18:35:58 +01:00
Momchil Velikov	4f02a0f606	[NFC][CodeGenPrepare] Match against the correct instruction when checking profitability of folding an address The "nested" `AddressingModeMatcher`s in `AddressingModeMatcher::isProfitableToFoldIntoAddressingMode` are constructed using the original memory instruction, even though they check whether the address operand of a differrent memory instructon is foldable. The memory instruction is used only for a dominance check (when not checking for profitability), and using the wrong memory instruction does not change the outcome of the test - if an address is foldable, the dominance test afects which of the two possible ways to fold is chosen, but this result is discarded. As an example, in target triple = "x86_64-linux" declare i1 @check(i64, i64) define i32 @f(i1 %cc, ptr %p, ptr %q, i64 %n) { entry: br label %loop loop: %iv = phi i64 [ %i, %C ], [ 0, %entry ] %offs = mul i64 %iv, 4 %c.0 = icmp ult i64 %iv, %n br i1 %c.0, label %A, label %fail A: br i1 %cc, label %B, label %C C: %u = phi i32 [0, %A], [%w, %B] %i = add i64 %iv, 1 %a.0 = getelementptr i8, ptr %p, i64 %offs %a.1 = getelementptr i8, ptr %a.0, i64 4 %v = load i32, ptr %a.1 %c.1 = icmp eq i32 %v, %u br i1 %c.1, label %exit, label %loop B: %a.2 = getelementptr i8, ptr %p, i64 %offs %a.3 = getelementptr i8, ptr %a.2, i64 4 %w = load i32, ptr %a.3 br label %C exit: ret i32 -1 fail: ret i32 0 } the dominance test is perfomed between `%i = ...` and `%v = ...` at the moment we're checking whether `%a3 = ...` is foldable Using the memory instruction, which uses the interesting address is "more correct" and this change is needed by a future patch. Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D143896	2023-04-21 18:09:51 +01:00
Momchil Velikov	6c9066fe2e	Recommit "[AArch64] Fix incorrect `isLegalAddressingMode`" This patch recommits 0827e2fa3fd15b49fd2d0fc676753f11abb60cab after reverting it in ed7ada259f665a742561b88e9e6c078e9ea85224. Added workround for `Targetlowering::AddrMode` no longer being an aggregate in C++20. `AArch64TargetLowering::isLegalAddressingMode` has a number of defects, including accepting an addressing mode, which consists of only an immediate operand, or not checking the offset range for an addressing mode in the form `1*ScaledReg + Offs`. This patch fixes the above issues. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D143895 Change-Id: I41a520c13ce21da503ca45019979bfceb8b648fa	2023-04-21 16:21:01 +01:00
Igor Kirillov	6850bc35c6	[CodeGen] Enable AArch64 SVE FCMLA/FCADD instruction generation in ComplexDeinterleaving This commit adds support for scalable vector types in theComplexDeinterleaving pass, allowing it to recognize and handle `llvm.vector.interleave2` and `llvm.vector.deinterleave2` intrinsics for both fixed and scalable vectors Differential Revision: https://reviews.llvm.org/D147451	2023-04-21 09:58:35 +00:00
Serguei Katkov	aa5cc39b6d	[BreakFalseDeps] Respect dead blocks. The pass uses ReachingDefAnalysis which has no information about instructions in dead blocks. So do not process them. Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D148329	2023-04-21 12:17:04 +07:00
Akshay Khadse	aab0ca3e79	Fix uninitialized scalar members in CodeGen This change fixes some static code analysis warnings. Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D148811	2023-04-21 12:22:34 +08:00
Sp00ph	3c9083f675	Fix i1 vector reduction miscompilation Previously, `vecreduce_{and,or} vNi1` could lead to miscompilations because the legalizer first decides to `any_ext` the operand (which is correct for `vecreduce_{and,or}`) and then decides to use `vecreduce_u{min,max}` instead (for which `any_ext` is incorrect). This patch changes it so the `vecreduce_u{min,max}` operations use `sign_ext` instead of `any_ext`. Issue: https://github.com/llvm/llvm-project/issues/62211 Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D148672	2023-04-20 15:26:49 -07:00
David Green	bbc983d33a	[DAG] Retain nuw flags when reassociating adds Given two adds that are both nuw, they will still be nuw after being reassociated. (They only increase in value and at no point wrap). https://alive2.llvm.org/ce/z/JrYM6H Differential Revision: https://reviews.llvm.org/D148804	2023-04-20 19:05:45 +01:00
Momchil Velikov	ed7ada259f	Revert "[AArch64] Fix incorrect `isLegalAddressingMode`" This reverts commit 0827e2fa3fd15b49fd2d0fc676753f11abb60cab. Failing buildbot, perhaps due to `-std=c++20`.	2023-04-20 16:10:45 +01:00
Momchil Velikov	0827e2fa3f	[AArch64] Fix incorrect `isLegalAddressingMode` `AArch64TargetLowering::isLegalAddressingMode` has a number of defects, including accepting an addressing mode which consists of only an immediate operand, or not checking the offset range for an addressing mode in the form `1*ScaledReg + Offs`. This patch fixes the above issues. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D143895 Change-Id: I756fa21941844ded44f082ac7eea4391219f9851	2023-04-20 15:43:11 +01:00
Simon Pilgrim	41053053e3	[DAG] SimplifyVCastOp - ensure we select the correct value type from an SDValue operand As reported on Issue #62234 - we weren't correctly using the SDValue operand to get its value type, resulting in a failure when it came from a SDNode with multiple results We haven't been able to create a suitable upstream regression test, but its been confirmed by inspection by both myself and @topperc Fixes #62234	2023-04-20 10:38:10 +01:00
Akshay Khadse	43b38696aa	Fix uninitialized class members Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D148692	2023-04-20 11:18:34 +08:00
Dinar Temirbulatov	e6096871fd	[DAGCombine][AArch64] Allow transformable to legal vectors to be used for MULH lowering. It looks like it is still profitable to accept a transformable to a legal vector type, not just a legal vector, as long as vector elements are the same between two of those types. Differential Revision: https://reviews.llvm.org/D148229	2023-04-19 13:24:58 +00:00
Fangrui Song	d14460d00e	[AsmPrinter] Fix placement of function entry comments The placement is currently wrong in the presence of function entry related instrumentations (prefixdata, -fpatchable-function-entry=, -fsanitize=kcfi, etc).	2023-04-18 15:01:36 -07:00
Rahman Lavaee	ef8dedc79c	Refactor BasicBlockSectionsProfileReader::getBBClusterInfoForFunction.	2023-04-18 21:21:54 +00:00

1 2 3 4 5 ...

33936 Commits