llvm-project

Author	SHA1	Message	Date
Wenju He	0bda12b5bc	[NewPM] Add OptimizerEarly module extension point VectorizerStart extension is module callback in old PM, but is function callback in new PM. We lack a module extension point between end of buildModuleSimplificationPipeline and the function optimization (including vectorizer) pipeline. So this patch adds a new module extension point before the function optimization pipeline. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D122296	2022-03-31 08:22:27 -07:00
Johannes Doerfert	a81fff8afd	Reapply "[Intrinsics] Add `nocallback` to the default intrinsic attributes" This reverts commit c5f789050daab25aad6770790987e2b7c0395936 and reapplies 7aea3ea8c3b33c9bb338d5d6c0e4832be1d09ac3 with additional test changes.	2022-03-25 09:36:50 -05:00
Arthur Eubanks	df1a216a62	[test] Set -verify-cfg-preserved=0 in new-pm-defaults.ll Matches other similar tests.	2022-03-23 11:09:41 -07:00
Arthur Eubanks	9bd66b312c	[PassManager][Coroutine] Run passes under -O0 conditionally and run GlobalDCE CoroSplit lowers various coroutine intrinsics. It's a CGSCC pass and CGSCC passes don't run on unreachable functions. Normally GlobalDCE will come along and delete unreachable functions, but we don't run GlobalDCE under -O0, so an unreachable function with coroutine intrinsics may never have CoroSplit run on it. This patch adds GlobalDCE when coroutines intrinsics are present. It also now runs all coroutine passes conditional when coroutine intrinsics are present. This should also solve the -O0 regression reported in D105877 due to LazyCallGraph construction. Fixes https://github.com/llvm/llvm-project/issues/54117 Reviewed By: ChuanqiXu Differential Revision: https://reviews.llvm.org/D122275	2022-03-23 11:03:26 -07:00
Arthur Eubanks	2362c4ecdc	Revert "Revert "[OpaquePtr][LLParser] Automatically detect opaque pointers in .ll files"" This reverts commit 9c96a6bbfdde665b5c2389100a15acdeea0f4145. Issues were already fixed at head.	2022-03-21 17:24:56 -07:00
Arthur Eubanks	10ffe80a24	Revert "Revert "[OpaquePointers][BitcodeReader] Enable -opaque-pointers if we see an opaque pointer type"" This reverts commit 80ec0ebfdc5692a58e0832125f2c6a991df9d63f. Issues were already fixed at head.	2022-03-21 17:24:56 -07:00
Mitch Phillips	80ec0ebfdc	Revert "[OpaquePointers][BitcodeReader] Enable -opaque-pointers if we see an opaque pointer type" This reverts commit 46216aa97729aa994dce79e75cd25707fb4b7874. Dependency https://reviews.llvm.org/D119482 broke the ASan buildbot.	2022-03-21 16:17:11 -07:00
Mitch Phillips	9c96a6bbfd	Revert "[OpaquePtr][LLParser] Automatically detect opaque pointers in .ll files" This reverts commit 295172ef51c6b9a73bc0fdcfd25f8c41ead9034a. Reason: Broke the ASan buildbot. More details are available on the original Phab review at https://reviews.llvm.org/D119482.	2022-03-21 16:04:36 -07:00
Arthur Eubanks	46216aa977	[OpaquePointers][BitcodeReader] Enable -opaque-pointers if we see an opaque pointer type This allows us to more easily test opaque pointers e.g. in the case of ThinLTO where we only have to pass -opaque-pointers to the frontend. Reviewed By: #opaque-pointers, nikic Differential Revision: https://reviews.llvm.org/D122048	2022-03-21 09:19:26 -07:00
Arthur Eubanks	ddc702376a	[NewPM] Don't skip SCCs not in current RefSCC With D107249 I saw huge compile time regressions on a module (150s -> 5700s). This turned out to be due to a huge RefSCC in the module. As we ran the function simplification pipeline on functions in the SCCs in the RefSCC, some of those SCCs would be split out to their RefSCC, a child of the current RefSCC. We'd skip the remaining SCCs in the huge RefSCC because the current RefSCC is now the RefSCC just split out, then revisit the original huge RefSCC from the beginning. This happened many times because many functions in the RefSCC were optimizable to the point of becoming their own RefSCC. This patch makes it so we don't skip SCCs not in the current RefSCC so that we split out all the child RefSCCs on the first iteration of RefSCC. When we split out a RefSCC, we invalidate the original RefSCC and add the remainder of the SCCs into a new RefSCC in RCWorklist. This happens repeatedly until we finish visiting all SCCs, at which point there is only one valid RefSCC in RCWorklist from the original RefSCC containing all the SCCs that were not split out, and we visit that. For example, in the newly added test cgscc-refscc-mutation-order.ll, we'd previously run instcombine in this order: f1, f2, f1, f3, f1, f4, f1 Now it's: f1, f2, f3, f4, f1 This can cause more passes to be run in some specific cases, e.g. if f1<->f2 gets optimized to f1<-f2, we'd previously run f1, f2; now we run f1, f2, f2. This improves kimwitu++ compile times by a lot (12-15% for various -O3 configs): https://llvm-compile-time-tracker.com/compare.php?from=2371c5a0e06d22b48da0427cebaf53a5e5c54635&to=00908f1d67400cab1ad7bcd7cacc7558d1672e97&stat=instructions Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D121953	2022-03-18 14:16:29 -07:00
Florian Hahn	5ab421fb4e	[LICM] Add allowspeculation pass options. This adds a new option to control AllowSpeculation added in D119965 when using `-passes=...`. This allows reproducing #54023 using opt. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D121944	2022-03-18 16:51:57 +00:00
Arthur Eubanks	295172ef51	[OpaquePtr][LLParser] Automatically detect opaque pointers in .ll files This allows us to not have to specify -opaque-pointers when updating IR tests from typed pointers to opaque pointers. We detect opaque pointers in .ll files by looking for relevant tokens, either "ptr" or "*". Reviewed By: #opaque-pointers, nikic Differential Revision: https://reviews.llvm.org/D119482	2022-03-17 08:37:18 -07:00
Nikita Popov	07d5339819	[Tests] Update pipeline tests After 20531b3a6b091b8dd2e3992aec11ac10581e91e2, we now use a cached TTI instance, so the TargetIRAnalysis analysis run goes away.	2022-03-16 10:56:44 +01:00
Arthur Eubanks	4fc7c55fff	[NewPM] Actually recompute GlobalsAA before module optimization pipeline RequireAnalysis<GlobalsAA> doesn't actually recompute GlobalsAA. GlobalsAA isn't invalidated (unless specifically invalidated) because it's self-updating via ValueHandles, but can be imprecise during the self-updates. Rather than invalidating GlobalsAA, which would invalidate AAManager and any analyses that use AAManager, create a new pass that recomputes GlobalsAA. Fixes #53131. Differential Revision: https://reviews.llvm.org/D121167	2022-03-14 09:42:34 -07:00
Florian Hahn	f98125abb2	Revert "[PassManager] Add pretty stack entries before P->run() call." This reverts commit 128745cc2681c284bc6d0150a319673a6d6e8424. This increased compile-time unnecessarily. Revert this change and follow ups 2c7afadb4789 & add0c5856d5f. http://llvm-compile-time-tracker.com/compare.php?from=338dfcd60f843082bb589b287d890dbd9394eb82&to=128745cc2681c284bc6d0150a319673a6d6e8424&stat=instructions	2022-03-09 18:46:32 +00:00
Florian Hahn	add0c5856d	[PassManager] Another adjustment to account for different printing. On some system, there's extra `class` specifiers when printing pass manager types.	2022-03-09 14:09:23 +00:00
Florian Hahn	2c7afadb47	[PassManager] Adjust check lines to account for different printing. Relax some checks to account for types being printed differently on different platforms, surfaced by different buildbots.	2022-03-09 13:20:59 +00:00
Florian Hahn	128745cc26	[PassManager] Add pretty stack entries before P->run() call. This patch adds PrettyStackEntries before running passes. The entries include the pass name and the IR unit the pass runs on. The information is used the print additional information when a pass crashes, including the name and a reference to the IR unit on which it crashed. This is similar to the behavior of the legacy pass manager. The improved stack trace now includes: Stack dump: 0. Program arguments: bin/opt -loop-vectorize -force-vector-width=4 crash.ll 1. Running pass 'ModuleToFunctionPassAdaptor' on module 'crash.ll' 2. Running pass 'LoopVectorizePass' on function '@a' Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D120993	2022-03-09 13:01:09 +00:00
Martin Storsjö	9e2236fb7b	Reapply [lit] Read command stdout/stderr as text on Windows This takes care of normalizing newlines back to single LF instead of CRLF. Fix up a couple tests that accidentally pass binary data to stdout. Differential Revision: https://reviews.llvm.org/D120623	2022-03-03 13:31:31 +02:00
Martin Storsjö	f570da28e6	Revert "[lit] Read command stdout/stderr as text on Windows" This reverts commit 640e45b9b2205ba18a4a562a9404418d8b90abda. That commit seemed to quite conclusively break a number of buildbots.	2022-03-01 14:43:55 +02:00
Martin Storsjö	640e45b9b2	[lit] Read command stdout/stderr as text on Windows This takes care of normalizing newlines back to single LF instead of CRLF. This on itself breaks on a couple tests that accidentally seem to be writing binary data to stdout; make sure those cases are piped to /dev/null instead of actually written to a terminal. Differential Revision: https://reviews.llvm.org/D120623	2022-03-01 14:24:02 +02:00
Kirill Stoimenov	b7fd30eac3	[ASan] Removed unused AddressSanitizerPass functional pass. This is a clean-up patch. The functional pass was rolled into the module pass in D112732. Reviewed By: vitalybuka, aeubanks Differential Revision: https://reviews.llvm.org/D120674	2022-03-01 00:41:29 +00:00
Elia Geretto	5e4b68cc3b	[NewPM] Fix tests for OptimizerLast extension point This PR fixes the tests for the `OptimizerLast` extension point in the new pass manager. The extension point uses module passes, but it was being tested with a function pass. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D120581	2022-02-25 14:49:19 -08:00
Elia Geretto	942efa5927	[NewPM] Add extension points to LTO pipeline in PassBuilder This PR adds two extension points to the default LTO pipeline in PassBuilder, one at the beginning and one at the end. These two extension points already existed in the old pass manager, the aim is to replicate the same functionality in the new one. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D120491	2022-02-25 14:48:54 -08:00
Roman Lebedev	371fcb720e	[SimplifyCFG][PhaseOrdering] Defer lowering switch into an integer range comparison and branch until after at least the IPSCCP That transformation is lossy, as discussed in https://github.com/llvm/llvm-project/issues/53853 and https://github.com/rust-lang/rust/issues/85133#issuecomment-904185574 This is an alternative to D119839, which would add a limited IPSCCP into SimplifyCFG. Unlike lowering switch to lookup, we still want this transformation to happen relatively early, but after giving a chance for the things like CVP to do their thing. It seems like deferring it just until the IPSCCP is enough for the tests at hand, but perhaps we need to be more aggressive and disable it until CVP. Fixes https://github.com/llvm/llvm-project/issues/53853 Refs. https://github.com/rust-lang/rust/issues/85133 Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D119854	2022-02-17 12:13:55 +03:00
Joseph Huber	9d3a47576c	[PassBuilder] Add OpenMPOpt to default LTO pipeline The LTO support for OpenMP offloading allows us to run the OpenMPOpt pass during the LTO pipeline. This patch introduces an early run of the Module pass and a late run of the CGSCC pass. These are quick no-ops if there is no OpenMP in the module. Depends on D118198 Differential Revision: https://reviews.llvm.org/D118611	2022-01-31 23:11:43 -05:00
Bjorn Pettersson	464be7af28	[NewPM] Add debugify and check-debugify to the PassRegistry Adding -debugify and -check-debugify in the PassRegistry will make sure the passes are listed properly by -print-pipeline-passes as well as -print-passes. It also allows removal of the custom pipeline parsing callback that has been used in the NewPMDriver. Differential Revision: https://reviews.llvm.org/D118369	2022-01-27 19:23:56 +01:00
Nikita Popov	97916673d4	[IR] Support ifuncs in opaque pointer mode Relax the type assertion for opaque pointers, and enumerate the value type in TypeFinder and ValueEnumerator.	2022-01-27 13:01:33 +01:00
Kevin Athey	7ea175d1c6	Add 'eager-checks' as a module parameter to MSAN. This creates a way to configure MSAN to for eager checks that will be leveraged by the introduction of a clang flag (-fsanitize-memory-param-retval). This is redundant with the existing flag: -mllvm -msan-eager-checks. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D116855	2022-01-11 14:30:49 -08:00
Philip Reames	0b09313cd5	[funcattrs] Infer writeonly argument attribute [part 2] This builds on the code from D114963, and extends it to handle calls both direct and indirect. With the revised code structure (from series of previously landed NFCs), this is pretty straight forward. One thing to note is that we can not infer writeonly for arguments which might be captured. If the pointer can be read back by the caller, and then read through, we have no way to track that. This is the same restriction we have for readonly, except that we get no mileage out of the "callee can be readonly" exception since a writeonly param on a readonly function is either a) readnone or b) UB. This means we can't actually infer much unless nocapture has already been inferred. Differential Revision: https://reviews.llvm.org/D115003	2022-01-04 09:07:54 -08:00
Jamie Schmeiser	864ca42483	Revert "A new hidden option exec-on-ir-change=exe that calls exe each time IR changes" This reverts commit dff0e8b4ff13af311512c369d059f1e095e83a60. Failing build: https://lab.llvm.org/buildbot/#/builders/42/builds/2800	2021-12-16 10:54:11 -05:00
Jamie Schmeiser	dff0e8b4ff	A new hidden option exec-on-ir-change=exe that calls exe each time IR changes Summary: A new option exec-on-ir-changed is defined that allows one to specify an exe that is called after each pass in the opt pipeline that changes the IR. The exec-on-ir-change=exe option saves the IR in a temporary file and calls exe with the name of the file and the name of the pass that just changed it after each pass alters the IR. exe is also called with the initial IR. This can be used, for example, to determine which pass corrupts the IR by having exe as a script that calls llc and runs a test to see after which pass the results change. The print-changed filtering options are respected. Note that this is only supported with the new pass manager. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: aeubanks (Arthur Eubanks) Differential Revision: https://reviews.llvm.org/D110776	2021-12-16 09:01:49 -05:00
Florian Hahn	acea6e9cfa	[Passes] Only run extra vector passes if loops have been vectorized. This patch uses a similar trick as in D113947 to only run the extra passes after vectorization on functions where loops have been vectorized. The reason for running the 'extra vector passes' is simplification/unswitching of the runtime checks created by LV, there should be no need to run them if nothing got vectorized To do that, a new dummy analysis ShouldRunExtraVectorPasses has been added. If loops have been vectorized for a function, LV will cache the analysis. At the moment it uses MadeCFGChanges as proxy for loop vectorized, which isn't perfect (it could be too aggressive, e.g. because no runtime checks have been added), but should be good enough for now. The extra passes are now managed by a new FunctionPassManager that runs its passes only if ShouldRunExtraVectorPasses has been cached. Without this patch, `-extra-vectorizer-passes` has the following compile-time impact: NewPM-O3: +4.86% NewPM-ReleaseThinLTO: +3.56% NewPM-ReleaseLTO-g: +7.17% http://llvm-compile-time-tracker.com/compare.php?from=ead3979a92fc33add4710c4510d6906260dcb4ad&to=c292da649e2c6e88a31e702fdc474727d09c72bc&stat=instructions With this patch, that gets reduced to NewPM-O3: +1.43% NewPM-ReleaseThinLTO: +1.00% NewPM-ReleaseLTO-g: +1.58% http://llvm-compile-time-tracker.com/compare.php?from=ead3979a92fc33add4710c4510d6906260dcb4ad&to=e67d86b57810011cf285eb9aa1944781be6096f0&stat=instructions It is probably still too high to enable by default, but much better. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D115052	2021-12-10 11:42:45 +00:00
Arthur Eubanks	c4ce426504	Revert "A new hidden option test-changed=exe that calls exe after each time IR changes" This reverts commit f9235e45fd1f5ca21f95105427184a6afd0f9d95. Causes breakages on Windows: http://45.33.8.238/win/50453/step_11.txt.	2021-12-08 13:58:10 -08:00
Jamie Schmeiser	f9235e45fd	A new hidden option test-changed=exe that calls exe after each time IR changes Summary: A new option test-changed is defined that allows one to specify an exe that is called after each pass in the opt pipeline that changes the IR. The test-changed=exe option saves the IR in a temporary file and calls exe with the name of the file and the name of the pass that just changed it after each pass alters the IR. exe is also called with the initial IR. This can be used, for example, to determine which pass corrupts the IR by having exe as a script that calls llc and runs a test to see after which pass the results change. The print-changed filtering options are respected. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: aeubanks (Arthur Eubanks) Differential Revision: https://reviews.llvm.org/D110776	2021-12-08 14:23:31 -05:00
Nikita Popov	8bd62119f9	[NewPM] Test more options in pipeline test (NFC) As suggested on D115098, this tests the positioning of HotColdSplitting, IROutliner and MergeFunctions in the optimization pipeline.	2021-12-04 17:30:30 +01:00
Anton Afanasyev	c34d157fc7	[Passes] Move AggressiveInstCombine after InstCombine Swap AIC and IC neighbouring in pipeline. This looks more natural and even almost has no effect for now (three slightly touched tests of test-suite). Also this could be the first step towards merging AIC (or its part) to -O2 pipeline. After several changes in AIC (like D108091, D108201, D107766, D109515, D109236) there've been observed several regressions (like PR52078, PR52253, PR52289) that were fixed in different passes (see D111330, D112721) by extending their functionality, but these regressions were exposed since changed AIC prevents IC from making some of early optimizations. This is common problem and it should be fixed by just moving AIC after IC which looks more logically by itself: make aggressive instruction combining only after failed ordinary one. Fixes PR52289 Reviewed By: spatel, RKSimon Differential Revision: https://reviews.llvm.org/D113179	2021-12-04 14:22:43 +03:00
Florian Hahn	31413c4555	[Passes] Adjust SLPVectorizer placement in test. SLPVectorizer runs after the extra vector passes.	2021-12-03 20:27:09 +00:00
Florian Hahn	5da920bf3a	[Passes] Improve opt-pipeline-vector-passes.ll test. Add -NOT lines to ensure that no extra passes are run if -extra-vectorizer-passes is not specified. Also add a loop that actually gets vectorized in preparation for D115052.	2021-12-03 20:15:59 +00:00
Florian Hahn	770a50b28c	[AnnotationRemarks] Support generating annotation remarks with -O0. This matches the legacy pass manager behavior. If remarks are not enabled the pass is effectively a no-op.	2021-12-02 15:01:02 +00:00
Jamie Schmeiser	2a1bf9e0cf	Expand testing of necessary features for print-changed=dot-cfg. Summary: Expand the testing for whether the lit tests for print-changed=dot-cfg are supported to include checking whether dot supports pdf output. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: hvdijk (Harald van Dijk) Differential Revision: https://reviews.llvm.org/D113187	2021-12-02 08:16:18 -05:00
Markus Lavin	ce22b7f17b	[NPM] Fix LoopNestPasses in -print-pipeline-passes Fix printing of LoopNestPasses when using the opt pipeline printer option -print-pipeline-passes. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D114771	2021-12-01 07:57:17 +01:00
Bjorn Pettersson	8ebb3eac02	[test] Use -passes syntax when specifying pipeline in some more tests The legacy PM is deprecated, so update a bunch of lit tests running opt to use the new PM syntax when specifying the pipeline. In this patch focus has been put on test cases for ConstantMerge, ConstraintElimination, CorrelatedValuePropagation, GlobalDCE, GlobalOpt, SCCP, TailCallElim and PredicateInfo. Differential Revision: https://reviews.llvm.org/D114516	2021-11-27 09:52:55 +01:00
Arthur Eubanks	e3e25b5112	[NewPM] Add option to prevent rerunning function pipeline on functions in CGSCC adaptor In a CGSCC pass manager, we may visit the same function multiple times due to SCC mutations. In the inliner pipeline, this results in running the function simplification pipeline on a function multiple times even if it hasn't been changed since the last function simplification pipeline run. We use a newly introduced analysis to keep track of whether or not a function has changed since the last time the function simplification pipeline has run on it. If we see this analysis available for a function in a CGSCCToFunctionPassAdaptor, we skip running the function passes on the function. The analysis is queried at the end of the function passes so that it's available after the first time the function simplification pipeline runs on a function. This is a per-adaptor option so it doesn't apply to every adaptor. The goal of this is to improve compile times. However, currently we can't turn this on by default at least for the higher optimization levels since the function simplification pipeline is not robust enough to be idempotent in many cases, resulting in performance regressions if we stop running the function simplification pipeline on a function multiple times. We may be able to turn this on for -O1 in the near future, but turning this on for higher optimization levels would require more investment in the function simplification pipeline. Heavily inspired by D98103. Example compile time improvements with flag turned on: https://llvm-compile-time-tracker.com/compare.php?from=998dc4a5d3491d2ae8cbe742d2e13bc1b0cacc5f&to=5c27c913687d3d5559ef3ab42b5a3d513531d61c&stat=instructions Reviewed By: asbirlea, nikic Differential Revision: https://reviews.llvm.org/D113947	2021-11-17 09:06:46 -08:00
Arthur Eubanks	19867de9e7	[NewPM] Only invalidate modified functions' analyses in CGSCC passes + turn on eagerly invalidate analyses Previously, any change in any function in an SCC would cause all analyses for all functions in the SCC to be invalidated. With this change, we now manually invalidate analyses for functions we modify, then let the pass manager know that all function analyses should be preserved since we've already handled function analysis invalidation. So far this only touches the inliner, argpromotion, function-attrs, and updateCGAndAnalysisManager(), since they are the most used. This is part of an effort to investigate running the function simplification pipeline less on functions we visit multiple times in the inliner pipeline. However, this causes major memory regressions especially on larger IR. To counteract this, turn on the option to eagerly invalidate function analyses. This invalidates analyses on functions immediately after they're processed in a module or scc to function adaptor for specific parts of the pipeline. Within an SCC, if a pass only modifies one function, other functions in the SCC do not have their analyses invalidated, so in later function passes in the SCC pass manager the analyses may still be cached. It is only after the function passes that the eager invalidation takes effect. For the default pipelines this makes sense because the inliner pipeline runs the function simplification pipeline after all other SCC passes (except CoroSplit which doesn't request any analyses). Overall this has mostly positive effects on compile time and positive effects on memory usage. https://llvm-compile-time-tracker.com/compare.php?from=7f627596977624730f9298a1b69883af1555765e&to=39e824e0d3ca8a517502f13032dfa67304841c90&stat=instructions https://llvm-compile-time-tracker.com/compare.php?from=7f627596977624730f9298a1b69883af1555765e&to=39e824e0d3ca8a517502f13032dfa67304841c90&stat=max-rss D113196 shows that we slightly regressed compile times in exchange for some memory improvements when turning on eager invalidation. D100917 shows that we slightly improved compile times in exchange for major memory regressions in some cases when invalidating less in SCC passes. Turning these on at the same time keeps the memory improvements while keeping compile times neutral/slightly positive. Reviewed By: asbirlea, nikic Differential Revision: https://reviews.llvm.org/D113304	2021-11-15 14:44:53 -08:00
Arthur Eubanks	1d8750c3da	[NFC] Rename GVN -> GVNPass and SROA -> SROAPass To be more consistent with other pass struct names. There are still more passes that don't end with "Pass", but these are the important ones. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D112935	2021-11-09 10:35:58 -08:00
Michael Liao	bf225939bc	[InferAddressSpaces] Support assumed addrspaces from addrspace predicates. - CUDA cannot associate memory space with pointer types. Even though Clang could add extra attributes to specify the address space explicitly on a pointer type, it breaks the portability between Clang and NVCC. - This change proposes to assume the address space from a pointer from the assumption built upon target-specific address space predicates, such as `__isGlobal` from CUDA. E.g., ``` foo(float *p) { __builtin_assume(__isGlobal(p)); // From there, we could assume p is a global pointer instead of a // generic one. } ``` This makes the code portable without introducing the implementation-specific features. Note that NVCC starts to support __builtin_assume from version 11. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D112041	2021-11-08 16:51:57 -05:00
Whitney Tsang	93421108d2	Add NoOpLoopNestPass and LOOPNEST_PASS macro Having a NoOpLoopNestPass can ensure that only outermost loop is invoked for a LoopNestPass with a lit test. There are some existing passes that are implemented as LoopNestPass, but they are still using LOOP_PASS macro. It would be easier to identify LoopNestPasses with a LOOPNEST_PASS macro. Differential Revision: https://reviews.llvm.org/D113185	2021-11-05 16:11:48 +00:00
Arthur Eubanks	7175886a0f	[NewPM] Make eager analysis invalidation per-adaptor Follow-up change to D111575. We don't need eager invalidation on every adaptor. Most notably, adaptors running passes that use very few analyses, or passes that purely invalidate specific analyses. Also allow testing of this via a pipeline string "function<eager-inv>()". The compile time/memory impact of this is very comparable to D111575. https://llvm-compile-time-tracker.com/compare.php?from=9a2eec512a29df45c90c2fcb741e9d5c693b1383&to=b9f20bcdea138060967d95a98eab87ce725b22bb&stat=instructions Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D113196	2021-11-04 17:16:11 -07:00
Kirill Stoimenov	b3145323b5	Revert "[ASan] Process functions in Asan module pass" This reverts commit 76ea87b94e5cba335d691e4e18e3464ad45c8b52. Reviewed By: kstoimenov Differential Revision: https://reviews.llvm.org/D113129	2021-11-03 18:01:01 +00:00

1 2 3 4 5 ...

905 Commits