llvm-project

Author	SHA1	Message	Date
Hans Wennborg	fd0ca2c4e9	Revert "Reland "[pgo] Avoid introducing relocations by using private alias"" This caused lld on mac to assert when building instrumented clang (or instrumented code in general). See comment on the code review for reproducer. > In many cases, we can use an alias to avoid a symbolic relocations, > instead of using the public, interposable symbol. When the instrumented > function is in a COMDAT, we can use a hidden alias, and still avoid > references to discarded sections. > > New compiler-rt tests are Linux only for now. > > Previous versions of this patch allowed the compiler to name the > generated alias, but that would only be valid when the functions were > local. Since the alias may be used across TUs we use a more > deterministic naming convention, and add a `.local` suffix to the alias > name just as we do for relative vtables aliases. > > Reviewed By: phosek > > Differential Revision: https://reviews.llvm.org/D137982 This reverts commit c42e50fede53bbcce79095e7c8115f26826c81ae.	2022-12-21 15:29:00 +01:00
Paul Kirth	c42e50fede	Reland "[pgo] Avoid introducing relocations by using private alias" In many cases, we can use an alias to avoid a symbolic relocations, instead of using the public, interposable symbol. When the instrumented function is in a COMDAT, we can use a hidden alias, and still avoid references to discarded sections. New compiler-rt tests are Linux only for now. Previous versions of this patch allowed the compiler to name the generated alias, but that would only be valid when the functions were local. Since the alias may be used across TUs we use a more deterministic naming convention, and add a `.local` suffix to the alias name just as we do for relative vtables aliases. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D137982	2022-12-19 18:45:58 +00:00
Paul Kirth	7597beac9d	Revert "Reland "[pgo] Avoid introducing relocations by using private alias"" This reverts commit e89e8dcfad364d23515de25ac87d26dfe25badbb. Some configurations still have references to discarded sections. I'm reverting until these can be addressed.	2022-12-09 04:33:09 +00:00
Paul Kirth	e89e8dcfad	Reland "[pgo] Avoid introducing relocations by using private alias" In many cases, we can use an alias to avoid a symbolic relocations, instead of using the public, interposable symbol. When the instrumented function is in a COMDAT, we can use a hidden alias, and still avoid references to discarded sections. This version makes the new runtime test a Linux only test. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D137982	2022-12-09 01:28:24 +00:00
Paul Kirth	7710e0e270	Revert "Reland "[pgo] Avoid introducing relocations by using private alias"" This reverts commit 3cfaea2d499970c2fd14389ade59f792156be0f8. The runtime test fails on Mac.	2022-12-09 00:23:57 +00:00
Paul Kirth	3cfaea2d49	Reland "[pgo] Avoid introducing relocations by using private alias" In many cases, we can use an alias to avoid a symbolic relocations, instead of using the public, interposable symbol. When the instrumented function is in a COMDAT, we can use a hidden alias, and still avoid references to discarded sections. This disables the failing runtime test on Windows, since the compiler options (-fPIC) are unsupported on that platform. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D137982	2022-12-08 21:03:37 +00:00
Paul Kirth	0adc577c11	Revert "Reland "[pgo] Avoid introducing relocations by using private alias""" This reverts commit 071c39df8632561b599f7b1debd81b3cf6b5b396. One of the new runtimes tests causes a failure with MSVC, so I'm reverting until the test can be fixed.	2022-12-08 18:15:55 +00:00
Paul Kirth	071c39df86	Reland "[pgo] Avoid introducing relocations by using private alias"" In many cases, we can use an alias to avoid a symbolic relocations, instead of using the public, interposable symbol. When the instrumented function is in a COMDAT, we can use a hidden alias, and still avoid references to discarded sections. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D137982	2022-12-08 17:37:17 +00:00
Matt Arsenault	be2ca02555	PGOProfile: Only keep opaque pointer tests The typed pointer tests were already converted, these were identical now.	2022-12-05 11:42:11 -05:00
Matt Arsenault	a74c5707be	Fix some test files with executable permissions	2022-12-02 17:12:03 -05:00
Rong Xu	0b2473936d	[PGO][indirect-call-promotion] Add extra parameter check for musttail callsite Indirect-call-promtion for musttail callsite needs stricter type check with parameter. This patch adds the needed extra parameter type check. Differential Revision: https://reviews.llvm.org/D139051	2022-12-02 10:50:55 -08:00
Fangrui Song	34ade57019	PGOProfile: Convert tests to opaque pointers	2022-11-27 18:13:09 -08:00
Matt Arsenault	16663d8ab3	IndirectCallPromotion: Fix illegal promotion with opaque pointers This was doing a type check on the argument types and skipping all other safety checks if they matched.	2022-11-27 09:39:42 -05:00
Rong Xu	6327d263f5	[CHR] Add a threshold for the code duplication ControlHeightReduction (CHR) clones the code region to reduce the branches in the hot code path. The number of clones is linear to the depth of the region. Currently it does not have control over the code size increase. We are seeing one ~9000 BB functions get expanded to ~250000 BBs, an 25x increase. This creates a big compile time issue for the downstream optimizations. This patch adds a cap for number of clones for one region. Differential Revision: https://reviews.llvm.org/D138333	2022-11-22 11:36:40 -08:00
Paul Kirth	0cc8752fa1	Revert "[pgo] Avoid introducing relocations by using private alias" This reverts commit 2b8917f8ad792c4c484975b82039bbdebe556881. This breaks with lld and gold	2022-11-16 03:38:14 +00:00
Paul Kirth	2b8917f8ad	[pgo] Avoid introducing relocations by using private alias Instead of using the public, interposable symbol, we can use a private alias and avoid relocations and addends. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D137982	2022-11-15 16:05:24 +00:00
Matthew Voss	a4b543a5a5	[llvm-profdata] Check for all duplicate entries in MemOpSize table Previously, we only checked for duplicate zero entries when merging a MemOPSize table (see D92074), but a user recently provided a reproducer demonstrating that other entries can also be duplicated. As demonstrated by the test in this patch, PGOMemOPSizeOpt can potentially generate invalid IR for non-zero, non-consecutive duplicate entries. This seems to be a rare case, since the duplicate entry is often below the threshold, but possible. This patch extends the existing warning to check for any duplicate values in the table, both in the optimization and in llvm-profdata. Differential Revision: https://reviews.llvm.org/D136211	2022-11-04 17:08:54 -07:00
Rong Xu	8acb881c19	[PGO] Add a threshold for number of critical edges in PGO For some auto-generated sources, we have a huge number of critical edges (like from switch statements). We have seen instance of 183777 critical edges in one function. After we split the critical edges in PGO instrumentation/profile-use pass, the CFG is so large that we have compiler time issues in downstream passes (like in machine CSE and block placement). Here I add a threshold to skip PGO if the number of critical edges are too large. The threshold is large enough so that it will not affect the majority of PGO compilation. Also sync the logic for skipping instrumentation and profile-use. I think this is the correct thing to do. Differential Revision: https://reviews.llvm.org/D137184	2022-11-02 10:14:04 -07:00
Alex Brachet	443e2a10f6	Reland "[PGO] Make emitted symbols hidden" This was reverted because it was breaking when targeting Darwin which tried to export these symbols which are now hidden. It should be safe to just stop attempting to export these symbols in the clang driver, though Apple folks will need to change their TAPI allow list described in the commit where these symbols were originally exported `f538018562` Then reverted again because it broke tests on MacOS, they should be fixed now. Bug: https://github.com/llvm/llvm-project/issues/58265 Differential Revision: https://reviews.llvm.org/D135340	2022-10-26 17:13:05 +00:00
Nico Weber	76745d2b58	Revert "[PGO] Make emitted symbols hidden" This reverts commit 04877284b4592e9286cab43467662c1b4ff81861. Looks like this is still breaking the test Profile-x86_64 :: instrprof-darwin-dead-strip.c (see comment on https://reviews.llvm.org/D135340).	2022-10-25 08:54:47 -04:00
Alex Brachet	04877284b4	[PGO] Make emitted symbols hidden This was reverted because it was breaking when targeting Darwin which tried to export these symbols which are now hidden. It should be safe to just stop attempting to export these symbols in the clang driver, though Apple folks will need to change their TAPI allow list described in the commit where these symbols were originally exported `f538018562` Bug: https://github.com/llvm/llvm-project/issues/58265 Differential Revision: https://reviews.llvm.org/D135340	2022-10-24 19:05:10 +00:00
Zequan Wu	82035ec777	Revert "[PGO] Make emitted symbols hidden" This reverts commit ecac223b0e4b05a65cf918f90824380db6b9ce64. The commit causes instrprof-darwin-dead-strip.c to fail on mac.	2022-10-14 15:23:26 -07:00
Wolfgang Pieb	b43a1d1bd9	[PGO] Do not create block count annotations when all weights are 0, avoiding an assertion. A BB with a nonzero count, whose successor blocks all have 0 counts, could cause an assertion. Don't create any branch weights in this case. Reviewed By: xur Differential Revision: https://reviews.llvm.org/D134203	2022-10-13 14:57:42 -07:00
Alex Brachet	ecac223b0e	[PGO] Make emitted symbols hidden This was reverted because it was breaking when targeting Darwin which tried to export these symbols which are now hidden. It should be safe to just stop attempting to export these symbols in the clang driver, though Apple folks will need to change their TAPI allow list described in the commit where these symbols were originally exported `f538018562` Bug: https://github.com/llvm/llvm-project/issues/58265 Differential Revision: https://reviews.llvm.org/D135340	2022-10-13 19:47:15 +00:00
Alex Brachet	deb82d4a20	Revert "[PGO] Make emitted symbols hidden" This reverts commit 4ea1a647ff0973c683dd71fec77e6fe7f6dfd2ca. This breaks on Darwin which tries to export these symbols `ebb258d3b0/clang/lib/Driver/ToolChains/Darwin.cpp (L1363)` I'll try to reland which that removed and approval from Apple folks.	2022-10-10 14:37:59 +00:00
Alex Brachet	4ea1a647ff	[PGO] Make emitted symbols hidden Differential Revision: https://reviews.llvm.org/D135340	2022-10-06 18:28:16 +00:00
Teresa Johnson	b1926f308f	Restore "[MemProf] Memprof profile matching and annotation" This reverts commit 794b7ea960ccc3222f2af582efadbc5e5c464292, and thus restores commit a212d8da94d08e229aa8d65283e4b116310bba10, and follow on fixes 0cd6763fa93159b84d70a5bb602c24996acaafaa, e9ff53d42feac7fc157718523275619a8106f2f3, and 37c6a25e9ab230e5e21fa34e246d9fec55275df0. Use a hash function (BLAKE3) instead of hash_combine/hash_code which are not guaranteed to be stable across executions. Additionally, it adds a "REQUIRES: x86_64-linux" to the tests that have raw profile inputs to avoid failures on big endian bots. Reviewers: snehasish, davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D128142	2022-09-23 11:38:47 -07:00
Teresa Johnson	794b7ea960	Revert "[MemProf] Memprof profile matching and annotation" This reverts commit a212d8da94d08e229aa8d65283e4b116310bba10, and follow on fixes 0cd6763fa93159b84d70a5bb602c24996acaafaa, e9ff53d42feac7fc157718523275619a8106f2f3, and 37c6a25e9ab230e5e21fa34e246d9fec55275df0. After re-reading the documentation for hash_combine, I don't think this is the appropriate hash function to use for computing the hash to use as a stack id in the metadata, since it is not guaranteed to produce stable values across executions. I have not hit this problem, but plan to switch to using an MD5 hash. I am hitting an issue with one of the bots (https://lab.llvm.org/buildbot/#/builders/171/builds/20732) where the values produced are only the lower 32 bits of the expected hash values, however, which I assume is related to the implementation of hash_combine and hash_code. I believe I fixed all of the other bot failures with the follow on fixes, which I'll merge into the new version before reapplying.	2022-09-22 16:08:03 -07:00
Teresa Johnson	0cd6763fa9	[MemProf] Fix buildbot errors by requiring zlib This should fix errors like the following from using the raw memprof_pgo.profraw profile in a212d8da94d08e229aa8d65283e4b116310bba10: profile uses zlib compression but the profile reader was built without zlib support E.g. https://lab.llvm.org/buildbot/#/builders/196/builds/18536	2022-09-22 13:13:54 -07:00
Teresa Johnson	a212d8da94	[MemProf] Memprof profile matching and annotation Profile matching and IR annotation for memprof profiles. See also related RFCs: RFC: Sanitizer-based Heap Profiler [1] RFC: A binary serialization format for MemProf [2] RFC: IR metadata format for MemProf [3]* * Note that the IR metadata format has changed from the RFC during implementation, as described in the preceeding patch adding the basic metadata and verification support. The matching is performed during the normal PGO annotation phase, to ensure that the inlines applied in the IR at that point are a subset of the inlines in the profiled binary and thus reflected in the profile's call stacks. This is important because the call frames are associated with functions in the profile based on the inlining in the symbolized call stacks, and this simplifies locating the subset of profile data relevant for matching onto each function's IR. The PGOInstrumentationUse pass is enhanced to perform matching for whatever combination of memprof and regular PGO profile data exists in the profile. Using the utilities introduced in D128854: The memprof profile data for each context is converted to "cold" or "notcold" based on parameterized thresholds for size, access count, and lifetime. The memprof allocation contexts are trimmed to the minimal amount of context required to uniquely identify whether the context is cold or not cold. For allocations where all profiled contexts have the same allocation type, no memprof metadata is attached and instead the allocation call is directly annotated with an attribute specifying the alloction type. This is the same attributed that will be applied to allocation calls once cloned for different contexts, and later used during LibCall simplification to emit allocation hints [4]. Depends on D128141 and D128854. [1] https://lists.llvm.org/pipermail/llvm-dev/2020-June/142744.html [2] https://lists.llvm.org/pipermail/llvm-dev/2021-September/153007.html [3] https://discourse.llvm.org/t/rfc-ir-metadata-format-for-memprof/59165 [4] `ab87cf382d` Differential Revision: https://reviews.llvm.org/D128142	2022-09-22 12:48:31 -07:00
Rong Xu	0caa4a9559	[PGO] Support PGO annotation of CallBrInst We currently instrument CallBrInst but do not annotate it with the branch weight. This patch enables PGO annotation of CallBrInst. Differential Revision: https://reviews.llvm.org/D133040	2022-09-01 14:13:50 -07:00
Rong Xu	d7ef0c3970	[llvm-profdata] Improve profile supplementation Current implementation promotes a non-cold function in the SampleFDO profile into a hot function in the FDO profile. This is too aggressive. This patch promotes a hot functions in the SampleFDO profile into a hot function, and a warm function in SampleFDO into a warm function in FDO. Differential Revision: https://reviews.llvm.org/D132601	2022-08-29 16:50:42 -07:00
Ellis Hoag	6f61594d8c	[InstrProf] Add option to avoid instrumenting small functions If a function only has a few instructions, instrumentation can significantly increase the size and performance overhead of that function. Add the `-pgo-function-size-threshold` option to select a size threshold so these small functions are not instrumented. A similar option `-fxray-instruction-threshold=<N>` is used for XRay to reduce binary size overhead [1]. [1] https://www.llvm.org/docs/XRay.html Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D131816	2022-08-17 06:47:15 -07:00
Ellis Hoag	12e78ff881	[InstrProf] Add the skipprofile attribute As discussed in [0], this diff adds the `skipprofile` attribute to prevent the function from being profiled while allowing profiled functions to be inlined into it. The `noprofile` attribute remains unchanged. The `noprofile` attribute is used for functions where it is dangerous to add instrumentation to while the `skipprofile` attribute is used to reduce code size or performance overhead. [0] https://discourse.llvm.org/t/why-does-the-noprofile-attribute-restrict-inlining/64108 Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D130807	2022-08-04 08:45:27 -07:00
Fangrui Song	a9af1b9a30	[test] Change -function-import tests to use -passes=	2022-07-17 21:43:00 -07:00
Fangrui Song	9ff36df5a4	[PGO][InstrProf][test] Convert to opaque pointers and fix some stale tests	2022-07-16 11:42:35 -07:00
Fangrui Song	f9d6f37201	[LegacyPM] Remove ControlHeightReductionLegacyPass This pass tries to reduce the number of conditional branches in the hot path based on profile. It's mostly a no-op after legacy PGO passes are moved.	2022-07-16 01:35:56 -07:00
Fangrui Song	8658cf999d	[test] Migrate PGO related -instrprof tests to new PM	2022-07-16 01:21:12 -07:00
Rong Xu	5e0443292b	[PGO] Report number of counts being dropped when a hash-mismatch happens This patch reports number of counts being dropped when a hash-mismatch happens. This information will be helpful to the users -- if the dropped counts are large, the user should redo the instrumentation build and recollect the profile. Differential Revision: https://reviews.llvm.org/D129001	2022-07-15 14:53:59 -07:00
Rong Xu	4a40fa82c0	[PGO] Don't cross reference CSFDO profile and non-CSFDO profile Don't cross reference CSFDO profile and non-CSFDO profile when checking the function hash. Only return hash_mismatch when CS bits match, and return unknown_function otherwise. Differential Revision: https://reviews.llvm.org/D129000	2022-07-15 13:57:23 -07:00
Nikita Popov	2a721374ae	[IR] Don't use blockaddresses as callbr arguments Following some recent discussions, this changes the representation of callbrs in IR. The current blockaddress arguments are replaced with `!` label constraints that refer directly to callbr indirect destinations: ; Before: %res = callbr i8* asm "", "=r,r,i"(i8* %x, i8* blockaddress(@test8, %foo)) to label %asm.fallthrough [label %foo] ; After: %res = callbr i8* asm "", "=r,r,!i"(i8* %x) to label %asm.fallthrough [label %foo] The benefit of this is that we can easily update the successors of a callbr, without having to worry about also updating blockaddress references. This should allow us to remove some limitations: * Allow unrolling/peeling/rotation of callbr, or any other clone-based optimizations (https://github.com/llvm/llvm-project/issues/41834) * Allow duplicate successors (https://github.com/llvm/llvm-project/issues/45248) This is just the IR representation change though, I will follow up with patches to remove limtations in various transformation passes that are no longer needed. Differential Revision: https://reviews.llvm.org/D129288	2022-07-15 10:18:17 +02:00
Nikita Popov	41d5033eb1	[IR] Enable opaque pointers by default This enabled opaque pointers by default in LLVM. The effect of this is twofold: * If IR that contains neither explicit ptr nor %T* types is passed to tools, we will now use opaque pointer mode, unless -opaque-pointers=0 has been explicitly passed. * Users of LLVM as a library will now default to opaque pointers. It is possible to opt-out by calling setOpaquePointers(false) on LLVMContext. A cmake option to toggle this default will not be provided. Frontends or other tools that want to (temporarily) keep using typed pointers should disable opaque pointers via LLVMContext. Differential Revision: https://reviews.llvm.org/D126689	2022-06-02 09:40:56 +02:00
Nikita Popov	c8eb83f2d0	[ControlHeightReduction] Use logical and Use logical instead of bitwise and to combine conditions, to avoid propagating poison from a later condition if an earlier one is already false. This avoids introducing branch on poison. Differential Revision: https://reviews.llvm.org/D125898	2022-05-26 18:03:35 +02:00
Ivan Kosarev	ad1d60c3be	[FileCheck] Catch missspelled directives. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D125604	2022-05-26 11:37:19 +01:00
Nikita Popov	b2a13d3e2d	[InstCombine] Use IRBuilder in freeze pushing transform (PR55619) Use IRBuilder so that the newly created freeze instructions automatically gets inserted back into the IC worklist. The changed worklist processing order leads to some cosmetic differences in tests. Fixes https://github.com/llvm/llvm-project/issues/55619.	2022-05-24 15:48:28 +02:00
Ellis Hoag	9a90ea1fdc	[InstrProf] Fix promoter when using counter relocations When using counter relocations, two instructions are emitted to compute the address of the counter variable. ``` %BiasAdd = add i64 ptrtoint <__profc_>, <__llvm_profile_counter_bias> %Addr = inttoptr i64 %BiasAdd to i64* ``` When promoting a counter, these instructions might not be available in the block, so we need to copy these instructions. This fixes https://github.com/llvm/llvm-project/issues/55125 Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D125710	2022-05-16 14:32:39 -07:00
Nikita Popov	7ba484660b	[ControlHeightReduction] Freeze condition when converting select to branch While select conditions can be poison, branch on poison is immediate UB. As such, we need to freeze the condition when converting a select into a branch. Differential Revision: https://reviews.llvm.org/D125398	2022-05-16 10:37:26 +02:00
Wei Wang	26a0d53b15	[CHR] Skip region containing llvm.coro.id When a block containing llvm.coro.id is cloned during CHR, it inserts an invalid PHI node with token type to the beginning of the block containing llvm.coro.begin. To avoid such case, we exclude regions with llvm.coro.id. Reviewed By: ChuanqiXu Differential Revision: https://reviews.llvm.org/D124418	2022-04-27 10:27:25 -07:00
Paul Kirth	bac6cd5bf8	[misexpect] Re-implement MisExpect Diagnostics Reimplements MisExpect diagnostics from D66324 to reconstruct its original checking methodology only using MD_prof branch_weights metadata. New checks rely on 2 invariants: 1) For frontend instrumentation, MD_prof branch_weights will always be populated before llvm.expect intrinsics are lowered. 2) for IR and sample profiling, llvm.expect intrinsics will always be lowered before branch_weights are populated from the IR profiles. These invariants allow the checking to assume how the existing branch weights are populated depending on the profiling method used, and emit the correct diagnostics. If these invariants are ever invalidated, the MisExpect related checks would need to be updated, potentially by re-introducing MD_misexpect metadata, and ensuring it always will be transformed the same way as branch_weights in other optimization passes. Frontend based profiling is now enabled without using LLVM Args, by introducing a new CodeGen option, and checking if the -Wmisexpect flag has been passed on the command line. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D115907	2022-04-19 21:23:48 +00:00
Fangrui Song	98c22f68e6	[PGO][test] Fix memop_size_opt.ll	2022-04-14 21:16:04 -07:00

1 2 3 4 5 ...

329 Commits