llvm-project

Author	SHA1	Message	Date
Jacques Pienaar	07967d4af8	[mlir] Switch to new LDBG macro (#150616 ) Change local variants to use new central one.	2025-07-25 18:22:46 +02:00
Kazu Hirata	0925d7572a	[mlir] Remove unused includes (NFC) (#150266 ) These are identified by misc-include-cleaner. I've filtered out those that break builds. Also, I'm staying away from llvm-config.h, config.h, and Compiler.h, which likely cause platform- or compiler-specific build failures.	2025-07-23 15:18:53 -07:00
Maksim Levental	dce6679cf5	[mlir][NFC] update `mlir/Dialect` create APIs (16/n) (#149922 ) See https://github.com/llvm/llvm-project/pull/147168 for more info.	2025-07-21 19:57:30 -04:00
Kazu Hirata	54bd936ec9	[mlir] Remove unused includes (NFC) (#147455 ) These are identified by misc-include-cleaner. I've filtered out those that break builds. Also, I'm staying away from llvm-config.h, config.h, and Compiler.h, which likely cause platform- or compiler-specific build failures.	2025-07-07 23:40:44 -07:00
Nicolas Vasilache	2b28d10022	[mlir][SCF][GPU] Add DeviceMaskingAttrInterface (#146943 ) This revision adds DeviceMaskingAttrInterface and extends DeviceMappingArrayAttr to accept a union of DeviceMappingAttrInterface and DeviceMaskingAttrInterface. Support is added to GPUTransformOps to take advantage of this information and lower to block/warpgroup/warp/thread specialization when mapped to linear ids. The revision also connects to scf::ForallOp and uses the new attribute to implement warp specialization. The implementation is in the form of a GPUMappingMaskAttr, which can be additionally passed to the scf.forall.mapping attribute to specify a mask on compute resources that should be active. In the first implementation the masking is a bitfield that specifies for each processing unit whether it is active or not. In the future, we may want to implement this as a symbol to refer to dynamically defined values. Extending op semantics with an operand is deemed too intrusive at this time. --------- Co-authored-by: Oleksandr "Alex" Zinenko <git@ozinenko.com>	2025-07-07 18:06:41 +02:00
Nicolas Vasilache	ea62de5b1d	[mlir] NFC - refactor id builder and avoid leaking impl details (#146922 )	2025-07-07 15:42:48 +02:00
Nicolas Vasilache	0a62836969	[mlir][gpu][transforms] Add support for mapping to lanes (#146912 ) This revision adds a new attribute for mapping `scf.forall` to linear lane ids. Example: ``` // %arg2 and %arg3 map to lanes [0, 6) and are turned into epxressions // involving threadIdx.x/y by the map_nested_forall_to_threads // transformation. This results in a if (linear_thread_id < 6) conditional. scf.forall (%arg2, %arg3) in (2, 3) { ... } {mapping = [#gpu.lane<linear_dim_0>, #gpu.lane<linear_dim_1>]} ``` --------- Co-authored-by: Oleksandr "Alex" Zinenko <git@ozinenko.com>	2025-07-07 15:14:52 +02:00
Jakub Kuderski	198c5dac37	[mlir][transform] Clean up prints. NFC. (#136401 ) Use `llvm::interleaved` from #135517 to simplify printing.	2025-04-19 12:11:06 -04:00
Guray Ozen	baf27862dd	[MLIR][NVGPU] Move max threads/blocks size to dialect (NFC) (#124454 ) This PR moves maximum number of threads in a block and block in a grid to nvgpu dialect to avoid replicated code. The limits are defined here: https://docs.nvidia.com/cuda/cuda-c-programming-guide/#features-and-technical-specifications-technical-specifications-per-compute-capability	2025-02-05 12:38:37 +01:00
Kazu Hirata	129f1001c3	[Dialect] Migrate away from PointerUnion::{is,get} (NFC) (#120818 ) Note that PointerUnion::{is,get} have been soft deprecated in PointerUnion.h: // FIXME: Replace the uses of is(), get() and dyn_cast() with // isa<T>, cast<T> and the llvm::dyn_cast<T> I'm not touching PointerUnion::dyn_cast for now because it's a bit complicated; we could blindly migrate it to dyn_cast_if_present, but we should probably use dyn_cast when the operand is known to be non-null.	2024-12-21 08:17:51 -08:00
Kazu Hirata	5262865aac	[mlir] Construct SmallVector with ArrayRef (NFC) (#101896 )	2024-08-04 11:43:05 -07:00
Oleksandr "Alex" Zinenko	5a9bdd85ee	[mlir] split transform interfaces into a separate library (#85221 ) Transform interfaces are implemented, direction or via extensions, in libraries belonging to multiple other dialects. Those dialects don't need to depend on the non-interface part of the transform dialect, which includes the growing number of ops and transitive dependency footprint. Split out the interfaces into a separate library. This in turn requires flipping the dependency from the interface on the dialect that has crept in because both co-existed in one library. The interface shouldn't depend on the transform dialect either. As a consequence of splitting, the capability of the interpreter to automatically walk the payload IR to identify payload ops of a certain kind based on the type used for the entry point symbol argument is disabled. This is a good move by itself as it simplifies the interpreter logic. This functionality can be trivially replaced by a `transform.structured.match` operation.	2024-03-20 22:15:17 +01:00
Mehdi Amini	9a2a6a728a	Apply clang-tidy fixes for readability-identifier-naming in Utils.cpp (NFC)	2024-01-17 08:51:41 -08:00
Mehdi Amini	74cf9bcf71	Apply clang-tidy fixes for performance-unnecessary-value-param in Utils.cpp (NFC)	2024-01-17 08:51:41 -08:00
Mehdi Amini	d8ed736c0e	Apply clang-tidy fixes for bugprone-macro-parentheses in Utils.cpp (NFC)	2024-01-15 20:59:13 -08:00
Nicolas Vasilache	44e6318cea	[mlir][transforms] Revamp the implementation of mapping loops to GPUs This revision significantly simplifies the specification and implementation of mapping loops to GPU ids. Each type of mapping (block, warpgroup, warp, thread) now comes with 2 mapping modes: 1. a 3-D "grid-like" mode, subject to alignment considerations on threadIdx.x, on which predication may occur on a per-dimension 3-D sub-rectangle basis. 2. a n-D linearized mode, on which predication may only occur on a linear basis. In the process, better size and alignment requirement inference are introduced along with improved runtime verification messages. The `warp_dims` attribute was deemed confusing and is removed from the transform in favor of better size inference. Differential Revision: https://reviews.llvm.org/D155941	2023-07-26 00:09:08 +02:00
Nicolas Vasilache	90ecfa2a40	[mlir][linalg] NFC - Move some utils in preparation for revamping mapping of scf.forall	2023-07-25 01:19:57 +02:00

17 Commits