llvm-project

Author	SHA1	Message	Date
luxufan	cf79773a90	[SCCP] Replace new value's value state with removed value's In replaceSignedInst, if a signed instruction can be repalced with unsigned instruction, we created a new instruction and removed the old instruction's value state. If the following instructions has this new instruction as a use operand, transformations like replaceSignedInst and refineInstruction would be blocked. The reason is there is no value state for the new instrution. This patch set the new instruction's value state with the removed instruction's value state. I believe it is correct bacause when we repalce a signed instruction with unsigned instruction, the value state is not changed. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D152337	2023-06-12 11:40:47 +08:00
Kazu Hirata	2509c93edd	[Transforms] Remove AddDiscriminatorsLegacyPass The last use was removed by: commit ae0987d242e266847f21f5fa1bffa97ce3eff586 Author: Kazu Hirata <kazu@google.com> Date: Sat Jun 10 13:51:35 2023 -0700 Differential Revision: https://reviews.llvm.org/D152636	2023-06-10 15:32:47 -07:00
Kazu Hirata	ae0987d242	[Transforms] Remove unused function createAddDiscriminatorsPass commit f7ca01333214f934c580c162afdee933e7430b6c Author: Nikita Popov <npopov@redhat.com> Date: Tue Feb 28 16:38:45 2023 +0100	2023-06-10 13:51:35 -07:00
Kazu Hirata	98183da637	[Transforms] Fix an unused variable warning This patch fixes: llvm/lib/Transforms/Utils/AMDGPUEmitPrintf.cpp:415:18: error: unused variable 'StBuff' [-Werror,-Wunused-variable]	2023-06-10 11:57:48 -07:00
Matt Arsenault	6d2e5c3445	LowerMemIntrinsics: Skip memmove with different address spaces This is a quick fix for an assert when the source and dest have different address spaces. The pointer compare needs to have matching types, but we can't generically introduce addrspacecast and we don't know if the address spaces alias.	2023-06-10 12:28:05 -04:00
Vikram	631c965483	[AMDGPU] Non hostcall printf support for HIP This is an alternative to currently existing hostcall implementation and uses printf buffer similar to OpenCL, The data stored in the buffer (i.e the data frame) for each printf call are as follows, 1. Control DWord - contains info regarding stream, format string constness and size of data frame 2. Hash of the format string (if constant) else the format string itself 3. Printf arguments (each aligned to 8 byte boundary) The format string Hash is generated using LLVM's MD5 Message-Digest Algorithm implementation and only low 64 bits are used. The implementation still uses amdhsa metadata and hash is stored as part of format string itself to ensure minimal changes in runtime. Differential Revision: https://reviews.llvm.org/D150427	2023-06-10 09:55:00 -04:00
Nuno Lopes	7ffeb8efe8	PromoteMem2Reg: use poison instead of undef as placeholder in phi entries from unreachable predecessors [NFC]	2023-06-10 11:19:03 +01:00
luxufan	25d9fde22e	[SCCP] Skip computing intrinsics if one of its args is unknownOrUndef For constant range supported intrinsics, we got consantrange from args no matter if they are unknown or undef. And the constant range computed from unknown or undef value state is full range. I think compute with full constant range is harmful since although we can do mergeIn after these args value state are changed, the merge operation of two constant ranges is union. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D152499	2023-06-10 15:48:46 +08:00
Teresa Johnson	e5479f27f2	[MemProf] Remove stale comment (NFC) We already do the simplification described in the FIXME comment.	2023-06-08 12:30:23 -07:00
Alexandros Lamprineas	475ddca56e	Reland "[FuncSpec] Replace LoopInfo with BlockFrequencyInfo" Using AvgLoopIters on any loop is too imprecise making the cost model favor users inside loop nests regardless of the actual tripcount. Differential Revision: https://reviews.llvm.org/D150375	2023-06-08 17:44:47 +01:00
Zain Jaffal	d65c0527ab	change checking for auto-init metadata to use `equalsStr` instead of casing MDOperand nodes. Since `MD_annotation` metadata now supports having mutliple strings in the annotation node. casing Operand to string directly will cause a crash. When checking if `MDOperand` equals str you can use `equalsStr` method. Reviewed By: serge-sans-paille Differential Revision: https://reviews.llvm.org/D152372	2023-06-08 15:58:01 +01:00
Nikita Popov	3c9cf023db	Revert "[LCSSA] Don't invalidate SCEV" This reverts commit 5cbb9f7a58d98ba432c6ddeefa581f6fc521315c. Causes verifier error reported at https://reviews.llvm.org/D149331#4387931.	2023-06-05 17:28:32 +02:00
Nikita Popov	143ed21b26	Revert "[LCSSA] Remove unused ScalarEvolution argument (NFC)" This reverts commit 5362a0d859d8e96b3f7c0437b7866e17a818a4f7. In preparation for reverting a dependent revision.	2023-06-05 16:45:38 +02:00
Mikhail Gudim	d37d4072f2	Reapply [SCCP] Constant propagation through freeze instruction Reapply with extra check for struct types, which caused buildbot failures last time. ----- The freeze instruction has not been handled by SCCPInstVisitor. This patch adds SCCPInstVisitor::visitFreezeInst(FreezeInst &I) method to handle freeze instructions. Differential Revision: https://reviews.llvm.org/D151659	2023-06-05 11:47:36 +02:00
zhongyunde	34d380e1f6	[IndVars] Add check of loop invariant for indirect use We usually only check direct use instruction of IV, while the bitcast of 'ptrtoint ptr to i64' doesn't affect the result, so go a step further. Fix https://github.com/llvm/llvm-project/issues/59633. Reviewed By: markoshorro Differential Revision: https://reviews.llvm.org/D151877	2023-06-03 22:29:09 +08:00
Matt Arsenault	1536e299e6	InstSimplify: Require instruction be parented Unlike every other analysis and transform, simplifyInstruction permitted operating on instructions which are not inserted into a function. This created an edge case no other code needs to really worry about, and limited transforms in cases that can make use of the context function. Only the inliner and a handful of other utilities were making use of this, so just fix up these edge cases. Results in some IR ordering differences since cloned blocks are inserted eagerly now. Plus some additional simplifications trigger (e.g. some add 0s now folded out that previously didn't).	2023-06-02 18:14:28 -04:00
Nikita Popov	0930ee8e86	[SimplifyLibCalls] Fix isKnownNeverInfinity() call after ORE removal Missed this in 97b5cc214aee48e30391bfcd2cde4252163d7406.	2023-06-02 09:20:57 +02:00
Alexandros Lamprineas	b1f41685a6	[IPSCCP] Decouple queries for function analysis results. The SCCPSolver is using a structure (AnalysisResultsForFn) where it keeps pointers to various analyses needed by the IPSCCP pass. These analyses are requested all at the same time, which can become problematic in some cases. For example one could be retrieved via getCachedAnalysis() prior to the actual execution of the analysis. In more detail: The IPSCCP pass uses a DomTreeUpdater to preserve the PostDominatorTree in case the PostDominatorTreeAnalysis had run before IPSCCP. Starting with commit 1b1232047e83b the IPSCCP pass may use BlockFrequencyAnalysis for some functions in the module. As a result, the PostDominatorTreeAnalysis may not run until the BlockFrequencyAnalysis has run, since the latter analysis depends on the former. Currently, we setup the DomTreeUpdater using getCachedAnalysis to retrieve a PostDominatorTree. This happens before BlockFrequencyAnalysis has run, therefore the cached analysis can become invalid by the time we use it. Differential Revision: https://reviews.llvm.org/D151666	2023-06-01 16:38:04 +01:00
Nikita Popov	223f9b096e	Revert "[SCCP] Constant propagation through freeze instruction" This reverts commit 559d47a1790e1a9f9b1f8838a443eb7624ef1ac7. Caused failure on sanitizer-aarch64-linux-bootstrap-ubsan: clang++: /b/sanitizer-aarch64-linux-bootstrap-ubsan/build/llvm-project/llvm/lib/Transforms/Utils/SCCPSolver.cpp:442: llvm::ValueLatticeElement &llvm::SCCPInstVisitor::getValueState(llvm::Value *): Assertion `!V->getType()->isStructTy() && "Should use getStructValueState"' failed.	2023-06-01 15:30:46 +02:00
Mikhail Gudim	559d47a179	[SCCP] Constant propagation through freeze instruction The freeze instruction has not been handled by SCCPInstVisitor. This patch adds SCCPInstVisitor::visitFreezeInst(FreezeInst &I) method to handle freeze instructions. Differential Revision: https://reviews.llvm.org/D151659	2023-06-01 15:06:59 +02:00
Nikita Popov	dc81e69eb1	[IndVars] Check expansion safety in makeIVComparisonInvariant() (PR62992) Make sure the invariant expressions are safe to expand. In particular, we should not speculative a trapping division into the preheader. Fixes https://github.com/llvm/llvm-project/issues/62992.	2023-05-31 11:21:35 +02:00
Nikita Popov	96a14f388b	Revert "[FuncSpec] Replace LoopInfo with BlockFrequencyInfo" As reported on https://reviews.llvm.org/D150375#4367861 and following, this change causes PDT invalidation issues. Revert it and dependent commits. This reverts commit 0524534d5220da5ecb2cd424a46520184d2be366. This reverts commit ced90d1ff64a89a13479a37a3b17a411a3259f9f. This reverts commit 9f992cc9350a7f7072a6dbf018ea07142ea7a7ed. This reverts commit 1b1232047e83b69561fd64b9547cb0a0d374473a.	2023-05-30 14:49:03 +02:00
Hongtao Yu	23da210624	[PseudoProbe] Do not force the calliste debug loc to inlined probes from __nodebug__ functions. For pseudo probes we would like to keep their original dwarf discriminator (either a zero or null) until the first FS-discriminator pass. The inliner is a violation of that, given that it assigns inlinee instructions with no debug info with the that of the callsite. This is being disabled in this patch. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D151568	2023-05-26 13:00:16 -07:00
Nikita Popov	d5c56c5162	[SCEVExpander] Remember phi nodes inserted by LCSSA construction SCEVExpander keeps track of all instructions it inserted. However, it currently misses some phi nodes created during LCSSA construction. Fix this by collecting these into another argument. This also removes the IRBuilder argument, which was added for essentially the same purpose, but only handles the root LCSSA nodes, not those inserted by SSAUpdater. This was reported as a regression on D149344, but the reduced test case also reproduces without it. Differential Revision: https://reviews.llvm.org/D150681	2023-05-25 09:34:19 +02:00
Joshua Cao	849d01bf3d	[LoopUnroll] Peel iterations based on select conditions This also allows us to peel loops with a `select`: ``` for (int i = 0; i <= N; ++i); f3(i == 0 ? a : b); // select instruction ``` into: ``` f3(a); // peel one iteration for (int i = 1; i <= N; ++i) f3(b); ``` Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D151052	2023-05-24 00:57:57 -07:00
Joshua Cao	0c316f0067	[BBUtils][NFC] Delete SplitBlockAndInsertIfThen with DT. The method is marked for deprecation. Delete the method and move all of its consumers to use the DomTreeUpdater version. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D149428	2023-05-23 21:02:37 -07:00
Shubham Sandeep Rastogi	775258d758	Add support for salvaging debug info from icmp instrcuctions. salvageDebugInfo is a function that allows us to reatin debug info for instructions that have been optimized out. Currently, it doesn't support salvaging the debug information from icmp instrcutions, but DWARF expressions can emulate an icmp by using the DWARF conditional expressions. This patch adds support for salvaging debug information from icmp instructions. Differential Revision: https://reviews.llvm.org/D150216	2023-05-23 15:31:31 -07:00
Simon Pilgrim	f2a6a97069	Fix MSVC "ignoring return value of function declared with 'nodiscard' attribute" warning. NFC.	2023-05-23 11:40:33 +01:00
khei4	1362dfe165	[SimplifyCFG] add nsw on SwitchToLookupTable index calculation on MinCaseVal subtraction Differential Revision: https://reviews.llvm.org/D146903 Reviewed By: nikic	2023-05-23 18:02:31 +09:00
Matt Arsenault	591ba11b93	Reapply "SimplifyLibCalls: Pass AssumptionCache to isKnownNeverInfinity" This reverts commit b357f379c81811409348dd0e0273a248b055bb7a.	2023-05-23 08:48:25 +01:00
Alexandros Lamprineas	1b1232047e	[FuncSpec] Replace LoopInfo with BlockFrequencyInfo. Using AvgLoopIters on any loop is too imprecise making the cost model favor users inside loop nests regardless of the actual tripcount. Differential Revision: https://reviews.llvm.org/D150375	2023-05-22 17:49:52 +01:00
eopXD	c8eb535aed	[1/11][IR] Permit load/store/alloca for struct of the same scalable vector type This patch-set aims to simplify the existing RVV segment load/store intrinsics to use a type that represents a tuple of vectors instead. To achieve this, first we need to relax the current limitation for an aggregate type to be a target of load/store/alloca when the aggregate type contains homogeneous scalable vector types. Then to adjust the prolog of an LLVM function during lowering to clang. Finally we re-define the RVV segment load/store intrinsics to use the tuple types. The pull request under the RVV intrinsic specification is riscv-non-isa/rvv-intrinsic-doc#198 --- This is the 1st patch of the patch-set. This patch is originated from D98169. This patch allows aggregate type (StructType) that contains homogeneous scalable vector types to be a target of load/store/alloca. The RFC of this patch was posted in LLVM Discourse. https://discourse.llvm.org/t/rfc-ir-permit-load-store-alloca-for-struct-of-the-same-scalable-vector-type/69527 The main changes in this patch are: Extend `StructLayout::StructSize` from `uint64_t` to `TypeSize` to accommodate an expression of scalable size. Allow `StructType:isSized` to also return true for homogeneous scalable vector types. Let `Type::isScalableTy` return true when `Type` is `StructType` and contains scalable vectors Extra description is added in the LLVM Language Reference Manual on the relaxation of this patch. Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Co-Authored-by: eop Chen <eop.chen@sifive.com> Reviewed By: craig.topper, nikic Differential Revision: https://reviews.llvm.org/D146872	2023-05-19 09:39:36 -07:00
Alina Sbirlea	b357f379c8	Revert "SimplifyLibCalls: Pass AssumptionCache to isKnownNeverInfinity" This reverts commit faa32734bf9a55fa3f91d91f6fdf0f8a951a9c0e. Revert due to test failures introduced by 73925ef8b0eacc6792f0e3ea21a3e6d51f5ee8b0	2023-05-18 23:31:51 -07:00
khei4	e21a90f091	[SimplifyCFG] add nuw/nsw on BuildLookuptable BitMap shiftwidth calculation Differential Revision: https://reviews.llvm.org/D150838	2023-05-19 14:10:05 +09:00
Matt Arsenault	faa32734bf	SimplifyLibCalls: Pass AssumptionCache to isKnownNeverInfinity Let's assumes work for determining no infinities.	2023-05-18 19:44:56 +01:00
Matt Arsenault	f42136d4d6	ValueTracking: Check instruction is in a parent in computeKnownFPClass For some reason the inliner calls simplifyInstruction with disembodied instructions. I consider this to be an API defect. Either the instruction should always be inserted prior to simplification, or we at least should pass in the new function for the context.	2023-05-18 12:21:47 +01:00
Kazu Hirata	b56fb22787	[Utils] Use LLVMContext::MD_loop (NFC)	2023-05-17 21:28:38 -07:00
Matt Arsenault	86d0b524f3	ValueTracking: Expand signature of isKnownNeverInfinity/NaN This is in preparation for replacing the implementation with a wrapper around computeKnownFPClass.	2023-05-16 20:42:58 +01:00
Bjorn Pettersson	3ce72cd5f6	Remove some includes that shouldn't be needed any longer This remove a bunch of #include statements in Scalar.cpp. I do not think those should be needed any longer (assuming that they once upon a time possibly were needed for legacy PM C bindings, but that is not supported any longer). Also removing some other #include statements not needed any longer due to deprecation of legacy PM. Differential Revision: https://reviews.llvm.org/D149438	2023-05-16 16:10:51 +02:00
Christian Ulmann	794b58b467	[IR] Drop const in DILocation::getMergedLocation This commit removes constness from DILocation::getMergedLocation and fixes all its users accordingly. Having constness on the parameters forced the return type to be const as well, which does force usage of `const_cast` when the location needs to be used in metadata nodes. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D149942	2023-05-15 07:21:43 +00:00
Teresa Johnson	19c5740a5a	[MemProf] Set hot/cold new values with option Adds support to set the hot/cold new hint values with an option. Change the defaults slightly to make it easier to distinguish between compiler synthesized vs manually inserted calls to the interface. Differential Revision: https://reviews.llvm.org/D150488	2023-05-12 15:47:10 -07:00
spupyrev	2ee4ddae04	profilie inference changes for stale profile matching This diff facilitates a new stale profile matching in BOLT: D144500 This is a no-op for existing usages of profi (CSSPGO). Reviewed By: hoy Differential Revision: https://reviews.llvm.org/D150466	2023-05-12 11:58:11 -07:00
Hongtao Yu	9272d0f079	[PseudoProbe] Clean up dwarf discriminator and avoid duplicating factor. A pseudo probe is created with dwarf line information shared with its nearest instruction. If the instruction comes with a dwarf discriminator, it will be shared with the probe as well. This can confuse the later FS-AFDO discriminator assignment pass. To fix this, I'm cleaning up the discriminator fields for probes when they are inserted. I also notice another possibility to change the discriminator field of pseudo probes in the pipeline before the FS discriminator assignment pass. That is the loop unroller, which assigns duplication factor to instruction being vectorized. I'm disabling that for pseudo probe intrinsics specifically, also for callsites with probes. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D148569	2023-05-10 11:26:23 -07:00
Nathan Sidwell	03fde97d49	[NFC] Refactor loop metadata movement * Use 'if (T v = expr)' idiom * llvm.loop is a fixed metadata ID Differential Revision: https://reviews.llvm.org/D150109 Reviewed By: kazu	2023-05-09 18:43:59 -04:00
Zain Jaffal	5d3a884229	[IRGen] Change annotation metadata to support inserting tuple of strings into annotation metadata array. Annotation metadata supports adding singular annotation strings to annotation block. This patch adds the ability to insert a tuple of strings into the metadata array. The idea here is that each tuple of strings represents a piece of information that can be all related. It makes it easier to parse through related metadata information given it will be contained in one tuple. For example in remarks any pass that implements annotation remarks can have different type of remarks and pass additional information for each. The original behaviour of annotation remarks is preserved here and we can mix tuple annotations and single annotations for the same instruction. Reviewed By: paquette Differential Revision: https://reviews.llvm.org/D148328	2023-05-09 17:51:28 +03:00
Dávid Bolvanský	9892094a8b	[SLC] Use unsigned char to fix test failures on some platforms	2023-05-07 15:50:49 +02:00
Dávid Bolvanský	6321e4ddf7	[SimplifyLibCalls] Transform memchr(STR, C, N) to chain of ORs Motivation: ``` #include <string_view> size_t findFirst_ABCDEF(std::string_view sv) { return sv.find_first_of("ABCDEF"); } ``` memchr("ABCDEF", C, 6) != NULL -> (C == 'A' \|\| C == 'B' \|\| C == 'C' \|\| C == 'D' \|\| C == 'E' \|\| C == 'F') != 0 Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D128011	2023-05-07 15:12:03 +02:00
Yeting Kuo	42601e116b	[ASAN] Support memory checks on vp.load/store. The patch adds new member MaybeEVL into InterestingMemoryOperand to represent the effective vector length for vp intrinsics. It may be extended for some target intrinsics in the future. Reviewed By: kito-cheng Differential Revision: https://reviews.llvm.org/D146208	2023-05-07 19:30:16 +08:00
Nikita Popov	00ff746f6e	[LoopSimplify] Reduce amount of redundant SCEV invalidation (NFCI) We are simplifying the loop and all its children. Each time, we invalidate the top-most loop. The top-most loop is going to be the same every time. The cost of SCEV invalidation is largely independent from how data about the loop is actually cached, so we should avoid redundant invalidations.	2023-05-05 10:03:42 +02:00
Fangrui Song	3460f727ea	EntryExitInstrumenter: skip naked functions The asm in a naked function may reasonably expect the argument registers and the return address register (if present) to be live. When using -pg and -finstrument-functions, functions are instrumented by adding a function call to `_mcount/__cyg_profile_func_enter/__cyg_profile_func_enter_bare`/etc, which will clobber these registers. If the return address register is clobbered, the function will be unable to return to the caller, possibly causing an infinite loop. ``` __attribute__((naked)) void g() { #if defined(__arm__) __asm__("bx lr"); #else __asm__("ret"); #endif } int main() { g(); } ``` It seems that the only one reasonable way to handle the combination is to disable instrumenting for naked functions. GCC PR: https://gcc.gnu.org/PR109707 Close https://github.com/llvm/llvm-project/issues/62504 Reviewed By: hans Differential Revision: https://reviews.llvm.org/D149721	2023-05-04 09:21:18 -07:00

1 2 3 4 5 ...

6902 Commits