llvm-project

Author	SHA1	Message	Date
Shenghang Tsai	7610b13729	[MLIR] Split ExecutionEngine Initialization out of ctor into an explicit method call (#153524 ) Retry landing https://github.com/llvm/llvm-project/pull/153373 ## Major changes from previous attempt - remove the test in CAPI because no existing tests in CAPI deal with sanitizer exemptions - update `mlir/docs/Dialects/GPU.md` to reflect the new behavior: load GPU binary in global ctors, instead of loading them at call site. - skip the test on Aarch64 since we have an issue with initialization there --------- Co-authored-by: Mehdi Amini <joker.eph@gmail.com>	2025-08-17 23:07:24 +02:00
Mehdi Amini	bfd490e0cd	Revert "[MLIR] Split ExecutionEngine Initialization out of ctor into an explicit method call" (#153477 ) Reverts llvm/llvm-project#153373 Sanitizer bot is broken	2025-08-13 19:43:04 +00:00
Shenghang Tsai	2f93693f76	[MLIR] Split ExecutionEngine Initialization out of ctor into an explicit method call (#153373 ) This PR introduces a mechanism to defer JIT engine initialization, enabling registration of required symbols before global constructor execution. ## Problem Modules containing `gpu.module` generate global constructors (e.g., kernel load/unload) that execute during engine creation. This can force premature symbol resolution, causing failures when: - Symbols are registered via `mlirExecutionEngineRegisterSymbol` after creation - Global constructors exist (even if not directly using unresolved symbols, e.g., an external function declaration) - GPU modules introduce mandatory binary loading logic ## Usage ```c // Create engine without initialization MlirExecutionEngine jit = mlirExecutionEngineCreate(...); // Register required symbols mlirExecutionEngineRegisterSymbol(jit, ...); // Explicitly initialize (runs global constructors) mlirExecutionEngineInitialize(jit); ``` --------- Co-authored-by: Mehdi Amini <joker.eph@gmail.com>	2025-08-13 15:22:01 +02:00
Karlo Basioli	cd585864c0	Pass memory buffer to RuntimeDyld::MemoryManager factory (#142930 ) `RTDyldObjectLinkingLayer` is currently creating a memory manager without any parameters. In this PR I am passing the MemoryBuffer that will be emitted to the MemoryManager so that the user can use it to configure the behaviour of the MemoryManager.	2025-06-06 00:44:39 +01:00
Nikita Popov	979c275097	[IR] Store Triple in Module (NFC) (#129868 ) The module currently stores the target triple as a string. This means that any code that wants to actually use the triple first has to instantiate a Triple, which is somewhat expensive. The change in #121652 caused a moderate compile-time regression due to this. While it would be easy enough to work around, I think that architecturally, it makes more sense to store the parsed Triple in the module, so that it can always be directly queried. For this change, I've opted not to add any magic conversions between std::string and Triple for backwards-compatibilty purses, and instead write out needed Triple()s or str()s explicitly. This is because I think a decent number of them should be changed to work on Triple as well, to avoid unnecessary conversions back and forth. The only interesting part in this patch is that the default triple is Triple("") instead of Triple() to preserve existing behavior. The former defaults to using the ELF object format instead of unknown object format. We should fix that as well.	2025-03-06 10:27:47 +01:00
Lang Hames	b18e5b6a36	Re-apply "[ORC] Remove the Triple argument from LLJITBuilder::..." with fixes. This re-applies f905bf3e1ef860c4d6fe67fb64901b6bbe698a91, which was reverted in c861c1a046eb8c1e546a8767e0010904a3c8c385 due to compiler errors, with a fix for MLIR.	2025-03-06 17:17:05 +11:00
JOE1994	884221eddb	[mlir] Tidy uses of llvm::raw_stream_ostream (NFC) As specified in the docs, 1) raw_string_ostream is always unbuffered and 2) the underlying buffer may be used directly ( 65b13610a5226b84889b923bae884ba395ad084d for further reference ) * Don't call raw_string_ostream::flush(), which is essentially a no-op. * Avoid unneeded calls to raw_string_ostream::str(), to avoid excess indirection.	2024-09-16 23:23:25 -04:00
Fabian Mora	01dbc5da33	Reland [mlir][ExecutionEngine] Add support for global constructors and destructors #78070 (#78170 ) This patch add support for executing global constructors and destructors in the ExecutionEngine.	2024-01-15 12:10:14 -05:00
Cullen Rhodes	3295b88a66	Revert "[mlir][ExecutionEngine] Add support for global constructors and destructors" (#78164 ) this is causing test failures on AArch64 linux, hitting the following assert: # \| mlir-cpu-runner: /home/culrho01/llvm-project/llvm/lib/ExecutionEngine/RuntimeDyld/RuntimeDyldELF.cpp:519: void llvm::RuntimeDyldELF::resolveAArch64Relocation(const SectionEntry &, uint64_t, uint64_t, uint32_t, int64_t): Assertion `isInt<33>(Result) && "overflow check failed for relocation"' failed. Seeing the same in buildbot as well, e.g. https://lab.llvm.org/buildbot/#/builders/179/builds/9094/steps/12/logs/FAIL__MLIR__sparse_codegen_dim_mlir Reverts llvm/llvm-project#78070	2024-01-15 14:21:41 +00:00
Fabian Mora	48e8cd8345	[mlir][ExecutionEngine] Add support for global constructors and destructors (#78070 ) This patch add support for executing global constructors and destructors in the `ExecutionEngine`.	2024-01-14 21:41:23 -05:00
Fangrui Song	a3ef858968	[mlir,polly] Replace uses of IRBuilder::getInt8PtrTy with getPtrTy. NFC	2023-11-27 20:58:25 -08:00
Youngsuk Kim	645b7795d4	[mlir] Remove no-op ptr-to-ptr bitcasts (NFC) Opaque pointer cleanup effort. NFC.	2023-10-26 13:01:23 -05:00
JOE1994	204883623e	[NFC] Replace uses of Type::getPointerTo Replace some uses of `Type::getPointerTo` via 2 ways * Remove entirely if it's only used to support an unnecessary bitcast (remove the bitcast as well). * Replace with `PointerType::get`/`PointerType::getUnqual` NFC opaque pointer clean-up effort.	2023-09-29 21:38:53 -04:00
Job Noorman	8de9f2b558	Move SubtargetFeature.h from MC to TargetParser SubtargetFeature.h is currently part of MC while it doesn't depend on anything in MC. Since some LLVM components might have the need to work with target features without necessarily needing MC, it might be worthwhile to move SubtargetFeature.h to a different location. This will reduce the dependencies of said components. Note that I choose TargetParser as the destination because that's where Triple lives and SubtargetFeatures feels related to that. This issues came up during a JITLink review (D149522). JITLink would like to avoid a dependency on MC while still needing to store target features. Reviewed By: MaskRay, arsenm Differential Revision: https://reviews.llvm.org/D150549	2023-06-26 11:20:08 +02:00
Ingo Müller	0eb0fecbc5	[mlir][ExecutionEngine] Only load JITDyLibs without init/destroy funcs. In https://reviews.llvm.org/D153029, I moved the loading/unloading mechanisms of shared libraries from the JIT runner to the execution engine in order to make that mechanism available in the latter (including its Python bindings). However, I realized that I introduced a small change in semantic: previously, the JIT runner checked for the presence of init/destroy functions and only loaded the library as JITDyLib if they were not present. After I moved the code, all libraries were loaded as JITDyLib, even if they registered their symbols explicitly in their init function. I am not sure if this is really a problem but (1) the previous behavior was different and (2) I guess it could cause a problem if some symbols are exported through the init function and have public visibility. This patch reestablishes the original behaviour in the new place of the code. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D153249	2023-06-19 07:38:51 +00:00
Ingo Müller	0b3841eb97	[mlir] Move symbol loading from mlir-cpu-runner to ExecutionEngine. Both the mlir-cpu-runner and the execution engine allow to provide a list of shared libraries that should be loaded into the process such that the jitted code can use the symbols from those libraries. The runner had implemented a protocol that allowed libraries to control which symbols it wants to provide in that context (with a function called __mlir_runner_init). In absence of that, the runner would rely on the loading mechanism of the execution engine, which didn't do anything particular with the symbols, i.e., only symbols with public visibility were visible to jitted code. Libraries used a mix of the two mechanisms: while the runner utils and C runner utils libs (and potentially others) used public visibility, the async runtime lib (as the only one in the monorepo) used the loading protocol. As a consequence, the async runtime library could not be used through the Python bindings of the execution engine. This patch moves the loading protocol from the runner to the execution engine. For the runner, this should not change anything: it lets the execution engine handle the loading which now implements the same protocol that the runner had implemented before. However, the Python binding now get to benefit from the loading protocol as well, so the async runtime library (and potentially other out-of-tree libraries) can now be used in that context. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D153029	2023-06-16 14:50:14 +00:00
Andrzej Warzynski	fb0b035e35	[mlir-cpu-runner] Add support for `-mattr` and `-march` flags This patch adds support for `-mattr` and `-march` in mlir-cpu-runner. With this change, one should be able to consistently use mlir-cpu-runner for MLIR's integration tests (instead of e.g. resorting to lli when some additional flags are needed). This is demonstrated in concatenate_dim_1.mlir. In order to support the new flags, this patch makes sure that MLIR's ExecutionEngine/JITRunner (that mlir-cpu-runner is built on top of): * takes into account the new command line flags when creating TargetMachine, * avoids recreating TargetMachine if one is already available, * creates LLVM's DataLayout based on the previously configured TargetMachine. This is necessary in order to make sure that the command line configuration is propagated correctly to the backend code generator. A few additional updates are made in order to facilitate this change, including support for debug dumps from JITRunner. Differential Revision: https://reviews.llvm.org/D146917	2023-03-31 07:34:24 +00:00
Jakub Kuderski	8c258fda1f	[ADT][mlir][NFCI] Do not use non-const lvalue-refs with enumerate Replace references to enumerate results with either result_pairs (reference wrapper type) or structured bindings. I did not use structured bindings everywhere as it wasn't clear to me it would improve readability. This is in preparation to the switch to zip semantics which won't support non-const lvalue reference to elements: https://reviews.llvm.org/D144503. I chose to use values instead of const lvalue-refs because MLIR is biased towards avoiding `const` local variables. This won't degrade performance because currently `result_pair` is cheap to copy (size_t + iterator), and in the future, the enumerator iterator dereference will return temporaries anyway. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D146006	2023-03-15 10:43:56 -04:00
Archibald Elliott	d768bf994f	[NFC][TargetParser] Replace uses of llvm/Support/Host.h The forwarding header is left in place because of its use in `polly/lib/External/isl/interface/extract_interface.cc`, but I have added a GCC warning about the fact it is deprecated, because it is used in `isl` from where it is included by Polly.	2023-02-10 09:59:46 +00:00
Benjamin Kramer	fcf4e360ba	Iterate over StringMaps using structured bindings. NFCI.	2022-12-04 18:36:41 +01:00
Mehdi Amini	a4ef4445a0	Apply clang-tidy fixes for readability-container-size-empty in ExecutionEngine.cpp (NFC)	2022-11-12 23:47:38 +00:00
Denys Shabalin	95c083f579	[mlir] Fix and test python bindings for dump_to_object_file Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D136334	2022-10-20 15:53:16 +02:00
rkayaith	200889fbd9	[mlir-cpu-runner] Support parsing operations other than 'builtin.module' as top-level This adds a `--no-implicit-module` option, which disables the insertion of a top-level `builtin.module` during parsing. The top-level op is required to have the `SymbolTable` trait. The majority of the change here is removing `ModuleOp` from interfaces. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D134238	2022-10-03 15:36:59 -04:00
Kazu Hirata	6d5fc1e3d5	[mlir] Don't use Optional::getValue (NFC)	2022-06-20 23:20:25 -07:00
River Riddle	8bb5b657fe	[mlir:ExecutionEngine] Update use of getAddress now that lookup returns ExecutorAddr This was changed in 16dcbb53dc7968a3752661aac731172ebe0faf64	2022-05-05 14:24:32 -07:00
Emilio Cota	b24de9f684	[mlir] ExecutionEngine: default enableObjectCache to false The enableObjectCache option was added in https://reviews.llvm.org/rG06e8101034e, defaulting to false. However, the init code added there got its logic reversed (cache(enableObjectCache ? nullptr : new SimpleObjectCache()), which was fixed in https://reviews.llvm.org/rGd1186fcb04 by setting the default to true, thereby preserving the existing behavior even if it was unintentional. Default now the object cache to false as it was originally intended. While at it, mention in enableObjectCache's documentation how the cache can be dumped. Reviewed-by: mehdi_amini Differential Revision: https://reviews.llvm.org/D121291	2022-03-10 11:24:48 -05:00
Emilio Cota	011f653265	[mlir] Add sectionMemoryMapper to ExecutionEngineOptions By specifying a sectionMemoryMapper, users can control how memory for JIT code is allocated. In particular, I need this in order to use a named memory region so that profilers such as perf(1) can correctly label execution cycles coming from JIT'ed code. Reviewed-by: ezhulenev Differential Revision: https://reviews.llvm.org/D120415	2022-02-23 14:56:50 -05:00
Emilio Cota	a7db3c611b	[mlir][NFC] Use options struct in ExecutionEngine::create Its number of optional parameters has grown too large, which makes adding new optional parameters quite a chore. Fix this by using an options struct. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D120380	2022-02-23 10:21:46 -05:00
Nicolas Vasilache	f68ecdd458	[mlir] Add CMake flags to properly enable Jit event listeners. By default, the listeners do nothing unless linked in. This revision allows the "Perf" and "Intel" Jit event listeners to be used. The "OProfile" event listener is not enabled at this time, the associated library structure is not well-isolated. Differential Revision: https://reviews.llvm.org/D116552	2022-01-04 02:11:02 -05:00
Nicolas Vasilache	f1f5a85af8	[mlir] NFC - Format ExecutionEngine.cpp	2022-01-03 17:17:29 -05:00
Mehdi Amini	02b6fb218e	Fix clang-tidy issues in mlir/ (NFC) Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D115956	2021-12-20 20:25:01 +00:00
Tres Popp	106f307499	Rename MlirExecutionEngine lookup to lookupPacked The purpose of the change is to make clear whether the user is retrieving the original function or the wrapper function, in line with the invoke commands. This new functionality is useful for users that already have defined their own packed interface, so they do not want the extra layer of indirection, or for users wanting to the look at the resulting primary function rather than the wrapper function. All locations, except the python bindings now have a `lookupPacked` method that matches the original `lookup` functionality. `lookup` still exists, but with new semantics. - `lookup` returns the function with a given name. If `bool f(int,int)` is compiled, `lookup` will return a reference to `bool(f)(int,int)`. - `lookupPacked` returns the packed wrapper of the function with the given name. If `bool f(int,int)` is compiled, `lookupPacked` will return `void(mlir_f)(void**)`. Differential Revision: https://reviews.llvm.org/D114352	2021-11-22 14:12:09 +01:00
Reid Kleckner	89b57061f7	Move TargetRegistry.(h\|cpp) from Support to MC This moves the registry higher in the LLVM library dependency stack. Every client of the target registry needs to link against MC anyway to actually use the target, so we might as well move this out of Support. This allows us to ensure that Support doesn't have includes from MC/*. Differential Revision: https://reviews.llvm.org/D111454	2021-10-08 14:51:48 -07:00
Nikita Popov	ffe94738ed	[ExecutionEngine] Fix GEP type Fix bug introduced in 2c68ecccc9ee1fb37eca318a9b3572813a137cd5, the GEP type was off-by-ptr. Apparently I didn't run the MLIR tests.	2021-07-17 23:45:00 +02:00
Nikita Popov	2c68ecccc9	[OpaquePtr] Remove uses of CreateGEP() without element type Remove uses of to-be-deprecated API. In cases where the correct element type was not immediately obvious to me, fall back to explicit getPointerElementType().	2021-07-17 22:56:27 +02:00
Nikita Popov	f3f0c6cd47	[mlir] Remove uses of type-less CreateLoad() APIs (NFC) For the use in LLVMOps.td I used the getPointerElementType() escape hatch, as it's not obvious to me how the load type should be properly obtained here.	2021-03-11 18:39:20 +01:00
Kern Handa	3c4cdd0b6a	[mlir] ExecutionEngine needs special handling for COFF binaries Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97141	2021-02-23 17:34:19 -08:00
Alex Zinenko	ce8f10d6cb	[mlir] Simplify ModuleTranslation for LLVM IR A series of preceding patches changed the mechanism for translating MLIR to LLVM IR to use dialect interface with delayed registration. It is no longer necessary for specific dialects to derive from ModuleTranslation. Remove all virtual methods from ModuleTranslation and factor out the entry point to be a free function. Also perform some cleanups in ModuleTranslation internals. Depends On D96774 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96775	2021-02-16 18:42:52 +01:00
Mehdi Amini	d6efb6fc86	Rework ExecutionEngine::invoke() to make it more friendly to use from C++ This new invoke will pack a list of argument before calling the `invokePacked` method. It accepts returned value as output argument wrapped in `ExecutionEngine::Result<T>`, and delegate the packing of arguments to a trait to allow for customization for some types. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D95961	2021-02-06 01:32:50 +00:00
Ella Ma	1756d67934	[llvm][clang][mlir] Add checks for the return values from Target::createXXX to prevent protential null deref All these potential null pointer dereferences are reported by my static analyzer for null smart pointer dereferences, which has a different implementation from `alpha.cplusplus.SmartPtr`. The checked pointers in this patch are initialized by Target::createXXX functions. When the creator function pointer is not correctly set, a null pointer will be returned, or the creator function may originally return a null pointer. Some of them may not make sense as they may be checked before entering the function, but I fixed them all in this patch. I submit this fix because 1) similar checks are found in some other places in the LLVM codebase for the same return value of the function; and, 2) some of the pointers are dereferenced before they are checked, which may definitely trigger a null pointer dereference if the return value is nullptr. Reviewed By: tejohnson, MaskRay, jpienaar Differential Revision: https://reviews.llvm.org/D91410	2020-11-21 21:04:12 -08:00
River Riddle	65fcddff24	[mlir][BuiltinDialect] Resolve comments from D91571 * Move ops to a BuiltinOps.h * Add file comments	2020-11-19 11:12:49 -08:00
River Riddle	73ca690df8	[mlir][NFC] Remove references to Module.h and Function.h These includes have been deprecated in favor of BuiltinDialect.h, which contains the definitions of ModuleOp and FuncOp. Differential Revision: https://reviews.llvm.org/D91572	2020-11-17 00:55:47 -08:00
George Mitenkov	89808ce734	[MLIR][mlir-spirv-cpu-runner] A SPIR-V cpu runner prototype This patch introduces a SPIR-V runner. The aim is to run a gpu kernel on a CPU via GPU -> SPIRV -> LLVM conversions. This is a first prototype, so more features will be added in due time. - Overview The runner follows similar flow as the other runners in-tree. However, having converted the kernel to SPIR-V, we encode the bind attributes of global variables that represent kernel arguments. Then SPIR-V module is converted to LLVM. On the host side, we emulate passing the data to device by creating in main module globals with the same symbolic name as in kernel module. These global variables are later linked with ones from the nested module. We copy data from kernel arguments to globals, call the kernel function from nested module and then copy the data back. - Current state At the moment, the runner is capable of running 2 modules, nested one in another. The kernel module must contain exactly one kernel function. Also, the runner supports rank 1 integer memref types as arguments (to be scaled). - Enhancement of JitRunner and ExecutionEngine To translate nested modules to LLVM IR, JitRunner and ExecutionEngine were altered to take an optional (default to `nullptr`) function reference that is a custom LLVM IR module builder. This allows to customize LLVM IR module creation from MLIR modules. Reviewed By: ftynse, mravishankar Differential Revision: https://reviews.llvm.org/D86108	2020-10-26 09:09:29 -04:00
Aden Grue	670063eb22	Preserve the error message when MemoryBuffer creation fails Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D86326	2020-08-21 18:03:25 +00:00
Alex Zinenko	db1c197bf8	[mlir] take LLVMContext in MLIR-to-LLVM-IR translation Due to the original type system implementation, LLVMDialect in MLIR contains an LLVMContext in which the relevant objects (types, metadata) are created. When an MLIR module using the LLVM dialect (and related intrinsic-based dialects NVVM, ROCDL, AVX512) is converted to LLVM IR, it could only live in the LLVMContext owned by the dialect. The type system no longer relies on the LLVMContext, so this limitation can be removed. Instead, translation functions now take a reference to an LLVMContext in which the LLVM IR module should be constructed. The caller of the translation functions is responsible for ensuring the same LLVMContext is not used concurrently as the translation no longer uses a dialect-wide context lock. As an additional bonus, this change removes the need to recreate the LLVM IR module in a different LLVMContext through printing and parsing back, decreasing the compilation overhead in JIT and GPU-kernel-to-blob passes. Reviewed By: rriddle, mehdi_amini Differential Revision: https://reviews.llvm.org/D85443	2020-08-07 14:22:30 +02:00
River Riddle	9db53a1827	[mlir][NFC] Remove usernames and google bug numbers from TODO comments. These were largely leftover from when MLIR was a google project, and don't really follow LLVM guidelines.	2020-07-07 01:40:52 -07:00
Haruki Imai	9f2ce5b915	[mlir][SystemZ] Fix incompatible datalayout in SystemZ MLIR tests in "mlir/test/mlir-cpu-runner" fails in SystemZ (z14) because of incompatible datalayout error. This patch fixes it by setting host CPU name in createTargetMachine() Differential Revision: https://reviews.llvm.org/D80130	2020-05-20 03:46:26 +00:00
Eugene Zhulenev	3a11ca7bed	[MLIR] Add symbol map to mlir ExecutionEngine Add additional symbol mapping to be able to provide custom symbols to jitted code at runtime. Differential Revision: https://reviews.llvm.org/D79812	2020-05-14 22:30:03 +02:00
Eugene Zhulenev	3c5dd5863c	[MLIR] Register JIT event listeners with RTDyldObjectLinkingLayer Use a new API to register JIT event listeners. Differential Revision: https://reviews.llvm.org/D78435	2020-05-09 11:17:22 +02:00
Stephan Herhut	69040d5b0b	[MLIR] Allow for multiple gpu modules during translation. This change makes the ModuleTranslation threadsafe by locking on the LLVMContext. Furthermore, we now clone the llvm module into a new context when compiling to PTX similar to what the OrcJit does. Differential Revision: https://reviews.llvm.org/D78207	2020-04-16 14:18:31 +02:00

1 2

99 Commits