llvm-project

Author	SHA1	Message	Date
Shilei Tian	d4ecd1241c	Revert "[OpenMP] Introduce kernel environment" This reverts commit 35cfadfbe2decd9633560b3046fa6c17523b2fa9. It makes a couple of buildbots unhappy because of the following test failures: - `Transforms/OpenMP/add_attributes.ll'` - `mapping/declare_mapper_target_data.cpp` on AMDGPU	2023-04-22 20:56:35 -04:00
Shilei Tian	35cfadfbe2	[OpenMP] Introduce kernel environment This patch introduces per kernel environment. Previously, flags such as execution mode are set through global variables with name like `__kernel_name_exec_mode`. They are accessible on the host by reading the corresponding global variable, but not from the device. Besides, some assumptions, such as no nested parallelism, are not per kernel basis, preventing us applying per kernel optimization in the device runtime. This is a combination and refinement of patch series D116908, D116909, and D116910. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D142569	2023-04-22 20:46:38 -04:00
Bjorn Pettersson	a20f7efbc5	Remove several no longer needed includes. NFCI Mostly removing includes of InitializePasses.h and Pass.h in passes that no longer has support for the legacy PM.	2023-04-17 13:54:19 +02:00
Joseph Huber	46ee1021d9	[OpenMP] Replace HeapToShared's initial value with `poison` There's a desire to move away from `undef` in LLVM. Currently we want to have the `addressspace(3)` variables use `poison` instead. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D147719	2023-04-14 09:39:32 -05:00
Johannes Doerfert	94d14536a9	[OpenMP][FIX] More AAExecutionDomain fixes We missed certain updates, mostly to call site information, and dependent AAs did not get recomputed. We also did not properly distinguish and propagate incoming and outgoing information of call sites. The runtime tests passes now, I'll add a proper test for AAExecutionDomain soon that covers all the cases and ensures we haven't forgotten more updates. To help unblock some apps, I'll put the fix first.	2023-03-27 21:36:21 -07:00
Johannes Doerfert	3a7cb3d45a	[OpenMP] Adjust generic state machine simplification CB This callback caused us to potentially miss out on call edges if we were expecting a custom state machine since the custom state machine was not created but the workers also did not enter the generic one. I have not observed an issue and don't know how to create a test for sure, but it is saver to err on the conservative side for now.	2023-03-27 21:30:23 -07:00
Johannes Doerfert	7f7e1749c5	[OpenMP] Be smarter about the insertion point for deduplication We can use dominance and avoid the special handling of kernels and prevent inserting code before allocas accidentally (as happend in the runtime test).	2023-03-27 21:30:23 -07:00
Kazu Hirata	b9c4b95b11	[llvm] Use ConstantInt::{isZero,isOne} (NFC)	2023-03-21 17:40:35 -07:00
Ishaan Gandhi	aead502b11	[Attributor] Add convergent abstract attribute This patch adds the AANonConvergent abstract attribute. It removes the convergent attribute from functions that only call non-convergent functions. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D143228	2023-03-20 22:33:50 -07:00
Johannes Doerfert	8f47fd05d5	[OpenMPOpt][FIX] Avoid removing barriers in callees We could be smarter about this, e.g., if the callee has a single call site, but for now we first avoid the miscompile.	2023-03-20 17:44:24 -07:00
Johannes Doerfert	b89558a2ae	[OpenMP][FIX] Properly track and lookup Execution Domains This is a two part fix. First, we need two Execution Domains (ED) to track the values of a function. One for incoming values and one for outgoing values. This was conflated before. Second, at the function entry we need to look at the incoming information from call sites not iterate over non-existing predecessors.	2023-03-20 17:44:24 -07:00
Johannes Doerfert	578d507359	[OpenMP][FIX] Ensure to determine aligned regions properly There were missing checks in the aligned region code, copy-paste errors (= usage of the IsReachedFromAlignedBarrierOnly value instead of IsReachingAlignedBarrierOnly value on the forward pass), and a missing update of the call state for sync declarations and definitions. Partially fixes https://github.com/llvm/llvm-project/issues/60425	2023-02-02 02:28:10 -08:00
Johannes Doerfert	18a2975b57	[Attributor][FIX] Ensure we use the right AAExecutionDomain Before we might have ended up queriying the AAExecutionDomain of a different function, which resulted in wrong optimistic results. Partially fixes https://github.com/llvm/llvm-project/issues/60425	2023-02-02 02:27:54 -08:00
Guillaume Chatelet	ffc1205bde	[reland][NFC] Transition GlobalObject alignment from MaybeAlign to Align This is a follow up on https://reviews.llvm.org/D142459#4081179. This first patch adds an overload to `GlobalObject::setAlignment` that accepts an `Align` type. This already handles most of the calls. This patch also converts a few call sites to the new type when this is safe. Here is the list of the remaining call sites: - [clang/lib/CodeGen/CodeGenModule.cpp:1688](`e195e6bad6/clang/lib/CodeGen/CodeGenModule.cpp (L1688)`) - [llvm/lib/AsmParser/LLParser.cpp:1309](`e195e6bad6/llvm/lib/AsmParser/LLParser.cpp (L1309)`) - [llvm/lib/AsmParser/LLParser.cpp:6050](`e195e6bad6/llvm/lib/AsmParser/LLParser.cpp (L6050)`) - [llvm/lib/Bitcode/Reader/BitcodeReader.cpp:3871](`e195e6bad6/llvm/lib/Bitcode/Reader/BitcodeReader.cpp (L3871)`) - [llvm/lib/Bitcode/Reader/BitcodeReader.cpp:4030](`e195e6bad6/llvm/lib/Bitcode/Reader/BitcodeReader.cpp (L4030)`) - [llvm/lib/IR/Core.cpp:2018](`e195e6bad6/llvm/lib/IR/Core.cpp (L2018)`) - [llvm/lib/IR/Globals.cpp:141](`e195e6bad6/llvm/lib/IR/Globals.cpp (L141)`) - [llvm/lib/Linker/IRMover.cpp:660](`e195e6bad6/llvm/lib/Linker/IRMover.cpp (L660)`) - [llvm/lib/Linker/LinkModules.cpp:361](`e195e6bad6/llvm/lib/Linker/LinkModules.cpp (L361)`) - [llvm/lib/Linker/LinkModules.cpp:362](`e195e6bad6/llvm/lib/Linker/LinkModules.cpp (L362)`) - [llvm/lib/Transforms/IPO/MergeFunctions.cpp:782](`e195e6bad6/llvm/lib/Transforms/IPO/MergeFunctions.cpp (L782)`) - [llvm/lib/Transforms/IPO/MergeFunctions.cpp:840](`e195e6bad6/llvm/lib/Transforms/IPO/MergeFunctions.cpp (L840)`) - [llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp:1813](`e195e6bad6/llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp (L1813)`) - [llvm/tools/llvm-reduce/deltas/ReduceGlobalObjects.cpp:27](`e195e6bad6/llvm/tools/llvm-reduce/deltas/ReduceGlobalObjects.cpp (L27)`) Differential Revision: https://reviews.llvm.org/D142708	2023-01-31 15:09:10 +00:00
Guillaume Chatelet	e098ee726e	Revert D142708 "[NFC] Transition GlobalObject alignment from MaybeAlign to Align" This is breaking the build bots. e.g., https://lab.llvm.org/buildbot/#/builders/121/builds/27549 This reverts commit 6717efe74da825214cb4d307ad35e5fbda353301.	2023-01-31 14:12:51 +00:00
Guillaume Chatelet	6717efe74d	[NFC] Transition GlobalObject alignment from MaybeAlign to Align This is a follow up on https://reviews.llvm.org/D142459#4081179. This first patch adds an overload to `GlobalObject::setAlignment` that accepts an `Align` type. This already handles most of the calls. This patch also converts a few call sites to the new type when this is safe. Here is the list of the remaining call sites: - [clang/lib/CodeGen/CodeGenModule.cpp:1688](`e195e6bad6/clang/lib/CodeGen/CodeGenModule.cpp (L1688)`) - [llvm/lib/AsmParser/LLParser.cpp:1309](`e195e6bad6/llvm/lib/AsmParser/LLParser.cpp (L1309)`) - [llvm/lib/AsmParser/LLParser.cpp:6050](`e195e6bad6/llvm/lib/AsmParser/LLParser.cpp (L6050)`) - [llvm/lib/Bitcode/Reader/BitcodeReader.cpp:3871](`e195e6bad6/llvm/lib/Bitcode/Reader/BitcodeReader.cpp (L3871)`) - [llvm/lib/Bitcode/Reader/BitcodeReader.cpp:4030](`e195e6bad6/llvm/lib/Bitcode/Reader/BitcodeReader.cpp (L4030)`) - [llvm/lib/IR/Core.cpp:2018](`e195e6bad6/llvm/lib/IR/Core.cpp (L2018)`) - [llvm/lib/IR/Globals.cpp:141](`e195e6bad6/llvm/lib/IR/Globals.cpp (L141)`) - [llvm/lib/Linker/IRMover.cpp:660](`e195e6bad6/llvm/lib/Linker/IRMover.cpp (L660)`) - [llvm/lib/Linker/LinkModules.cpp:361](`e195e6bad6/llvm/lib/Linker/LinkModules.cpp (L361)`) - [llvm/lib/Linker/LinkModules.cpp:362](`e195e6bad6/llvm/lib/Linker/LinkModules.cpp (L362)`) - [llvm/lib/Transforms/IPO/MergeFunctions.cpp:782](`e195e6bad6/llvm/lib/Transforms/IPO/MergeFunctions.cpp (L782)`) - [llvm/lib/Transforms/IPO/MergeFunctions.cpp:840](`e195e6bad6/llvm/lib/Transforms/IPO/MergeFunctions.cpp (L840)`) - [llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp:1813](`e195e6bad6/llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp (L1813)`) - [llvm/tools/llvm-reduce/deltas/ReduceGlobalObjects.cpp:27](`e195e6bad6/llvm/tools/llvm-reduce/deltas/ReduceGlobalObjects.cpp (L27)`) Differential Revision: https://reviews.llvm.org/D142708	2023-01-31 13:59:58 +00:00
Joseph Huber	0bdde9dfb9	[OpenMP] Make OpenMPOpt aware of the OpenMP runtime's status The `OpenMPOpt` pass contains optimizations that generate new calls into the OpenMP runtime. This causes problems if we are in a state where the runtime has already been linked statically. Generating these new calls will result in them never being resolved. We should indicate if we are in a "post-link" LTO phase and prevent OpenMPOpt from generating new runtime calls. Generally, it's not desireable for passes to maintain state about the context in which they're called. But this is the only reasonable solution to static linking when we have a pass that generates new runtime calls. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D142646	2023-01-26 13:23:44 -06:00
Shilei Tian	e13179db7a	[NFC] clang-format OpenMPOpt.cpp	2023-01-25 19:05:53 -05:00
Johannes Doerfert	5238df7ed5	[Attributor] Allow (inter-procedural) "CFG" reasoning for aligned regions If an instruction is executed in an aligned region we can ignore threading effects and use CFG reasoning (dominance and reachability). This is true because all threads are together in an aligned region and there cannot be one waiting for a signal at a place not connected via the control flow. More dedicated tests will follow. More details can be found here: "Co-Designing an OpenMP GPU Runtime and Optimizations for Near-Zero Overhead Execution", IPDPS 2022, https://www.osti.gov/servlets/purl/1890094	2023-01-23 22:45:48 -08:00
Johannes Doerfert	d41f93dae9	[OpenMP] Readnone calls do not have non-local side-effects	2023-01-23 22:45:47 -08:00
Kazu Hirata	7d3306fa42	[llvm] Fix warnings This patch fixes: llvm/lib/IR/DataLayout.cpp:942:13: warning: unused variable ‘VecTy’ [-Wunused-variable] llvm/lib/Transforms/IPO/OpenMPOpt.cpp:2899:27: warning: unused variable ‘MI’ [-Wunused-variable]	2023-01-23 10:57:56 -08:00
Johannes Doerfert	129faec711	[OpenMP] Identify non-aligned barriers executed in an aligned context Even if a barrier does not enforce aligned execution, it will effectively be like an aligned barrier if it is executed by all threads in an aligned way. We lack control flow divergence analysis here so we can only do (basic block) local reasoning for now.	2023-01-22 21:42:07 -08:00
Johannes Doerfert	8e124515cd	[OpenMP][FIX] Ensure not to dereference a nullptr	2023-01-22 20:06:23 -08:00
Johannes Doerfert	43c1c59f73	[OpenMP] Merge barrier elimination into AAExecutionDomain With this patch we track aligned barriers in AAExecutionDomain and also delete unnecessary barriers there. This allows us to eliminate barriers across blocks, across functions, and in the presence of complex accesses that do not force a barrier. Further, we can use the collected information to enable store-load forwarding in a threaded environment (follow up patch). Differential Revision: https://reviews.llvm.org/D140463	2023-01-22 16:34:59 -08:00
Johannes Doerfert	2275e325e4	[OpenMP] Guarding restrictions are required only for guarding If we do not guard code during SPMDzation, we do not need to check conditions for successfull guarding. That is, even if some code is executed in different modes, it does not prevent SPMDzation if there is no guarded code in there.	2023-01-22 15:53:42 -08:00
Johannes Doerfert	ea3c24932a	[OpenMP][FIX] Properly update ParallelLevels tracker	2023-01-22 15:52:45 -08:00
Johannes Doerfert	7bc88cbe5c	[OpenMP] Simplify `llvm.assume` operands in device code	2023-01-22 01:27:41 -08:00
Johannes Doerfert	cefa5cefdc	[OpenMP] Replace ExternalizationRAII with virtual uses The externalization was always a stopgap solution. One of the drawbacks is that it is very conservative no matter if we actually require the functions at the end of the pass. The new concept is more generic and properly integrates into the dependence graph. Whenever we might need a function, it has a "virtual use" that cannot be analyzed. If we do not because of some AA state, there will be a dependence to ensure state changes trigger revisits of uses, including a potentially new virtual use.	2023-01-12 00:14:06 -08:00
Johannes Doerfert	cddcbfae14	[OpenMP][FIX] Avoid performance regression accidentally introduced	2023-01-11 00:58:34 -08:00
Johannes Doerfert	b2a8d2c69b	[OpenMP] Avoid running openmp-opt on dead functions The Attributor has logic to run only on assumed live functions and this is exposed to users now. OpenMP-opt will (mostly) ignore dead internal functions now but run the same deduction as before if an internal function is marked live. This should lower compile time as we run on less code and delete more code early on. For the full OpenMC module compiled with noinline and JITed at runtime, we save ~25%, or ~10s on my machine during JITing.	2023-01-10 15:03:51 -08:00
Johannes Doerfert	c3de9c1c7b	[OpenMP] Ensure AAHeapToShared is only looking at one function When we collect and process allocations we did not verify the call against the anchor scope / associated function. This should be done to avoid processing calls multiple times and generally looking at calls not in the AAs scope.	2023-01-10 15:03:51 -08:00
Johannes Doerfert	d1033e3cad	[OpenMP] Disable ICV deduction by default. This is not tested well and needs to be revisited in the future.	2023-01-10 15:03:51 -08:00
Johannes Doerfert	22c898dbfd	[OpenMP] Use Attributor to find underlying objects of stores When we see a store in generic mode we need to decide if we should guard it for SPMDzation. This patch changes the getUnderlyingObjects call to the more optimistic getAssumedUnderlyingObjects call to identify more thread local pointers.	2023-01-09 23:34:52 -08:00
Rafael A Herrera Guaitero	13b909ef27	OpenMPOpt: Check nested parallelism in target region Analysis that determines if a parallel region can reach another parallel region in any target region of the TU. A new global var is emitted with the name of the kernel + "_nested_parallelism", which is either 0 or 1 depending on the result. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D141010	2023-01-09 15:55:30 -06:00
Matt Arsenault	2e7640e6dc	OpenMPOpt: Fix null dereference on missing declaration cache Found by llvm-reduce fuzzing.	2023-01-03 16:26:37 -05:00
Matt Arsenault	c3054aeb5a	OpenMPOpt: Fix using wrong address space for alloca Using the function's address space makes no sense. Copied from the existing test, with more addrspace variation. Could just replace the existing one with this version if it's redundant.	2023-01-03 16:26:37 -05:00
Matt Arsenault	a7425e299e	OpenMPOpt: Use getFnAttributeAsParsedInteger	2023-01-03 11:40:42 -05:00
Matt Arsenault	a87de3a6dc	OpenMPOpt: Fix introducing empty nvvm.annotations into module	2023-01-03 10:32:10 -05:00
Joseph Huber	7ae3db66e8	[OpenMP] Fix leftover use of removed function Summary: Didn't notice this one floating around as it was still cached somewhere. Delete it.	2022-12-20 13:43:00 -06:00
Joseph Huber	bb4c6e7a06	[OpenMP] Remove folding logic for removed runtime function This function was removed from the device runtime at some point but we still have specialized code for it and an entry in the runtime kinds. Remove it as it is no longer necessary. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D140402	2022-12-20 13:37:38 -06:00
Johannes Doerfert	d4f3d8212a	[OpenMP][FIX] Ensure to inline `ompx::` functions after the rename in D140334	2022-12-19 16:41:49 -08:00
Fangrui Song	21c4dc7997	std::optional::value => operator*/operator-> value() has undesired exception checking semantics and calls __throw_bad_optional_access in libc++. Moreover, the API is unavailable without _LIBCPP_NO_EXCEPTIONS on older Mach-O platforms (see _LIBCPP_AVAILABILITY_BAD_OPTIONAL_ACCESS). This fixes clang.	2022-12-17 00:42:05 +00:00
Johannes Doerfert	dde21c1983	[OpenMP][FIX] Remove accidental and somewhat random change	2022-12-13 19:38:15 -08:00
Johannes Doerfert	90609fb68f	[OpenMP][NFCI] Remove effectively dead code in clang and the runtime Differential Revision: https://reviews.llvm.org/D136903	2022-12-13 18:44:19 -08:00
Johannes Doerfert	f9c29878b0	Revert "[OpenMP][NFCI] Remove effectively dead code in clang and the runtime" This reverts commit c1c8cbbf5f29257d084a23a2f6c4236c40b7afb9. One of the tests seems to be flaky/non-deterministic.	2022-12-12 22:08:28 -08:00
Johannes Doerfert	c1c8cbbf5f	[OpenMP][NFCI] Remove effectively dead code in clang and the runtime	2022-12-12 20:55:36 -08:00
Johannes Doerfert	2dd158d655	[OpenMP] Make barrier elimination work in the presence of llvm.assume Assumptions are droppable and eliminating them to eliminate barriers seems reasonable.	2022-12-07 22:37:57 -08:00
Kazu Hirata	1f421b6d7e	[llvm] Use std::nullopt instead of None in comments (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-06 22:45:17 -08:00
Fangrui Song	75801e3b45	Transforms/IPO: llvm::Optional => std::optional	2022-12-05 07:07:19 +00:00
Kazu Hirata	3c09ed006a	[llvm] Use std::nullopt instead of None in comments (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-04 17:12:44 -08:00

1 2 3 4 5 ...

339 Commits