llvm-project

Author	SHA1	Message	Date
Maksim Levental	c3f381ccfe	[mlir-python] Fix duplicate EnumAttr builder registration across dialects. (#187191 ) When multiple dialects share td `#includes` (e.g. `affine` includes `arith`), each dialect's `*_enum_gen.py` file registers attribute builders under the same keys, causing "already registered" errors on the second import; the first commit checks in such a case which currently fails on main: ``` # \| RuntimeError: Attribute builder for 'Arith_CmpFPredicateAttr' is already registered with func: <function _arith_cmpfpredicateattr at 0x78d13cbe9a80> ``` This PR implements a two-pronged fix: 1. Add `allow_existing=True` to `register_attribute_builder` (and the underlying C++ `registerAttributeBuilder`). When set, silently skips registration if the key already exists (first-wins semantics). This handles `EnumInfo`-based builders which have no dialect prefix (e.g. `AtomicRMWKindAttr`, `Arith_CmpFPredicateAttr`), which may be emitted by every dialect whose td file includes the defining file; 2. Filter `EnumAttr` builders by `-bind-dialect` in `EnumPythonBindingGen.cpp` and register them under dialect qualified keys (`"dialect.AttrName"`). Update `OpPythonBindingGen.cpp` to look up the same qualified keys for EnumAttr typed op attributes (detected via `isSubClassOf("EnumAttr")`). Pass `-bind-dialect` from `AddMLIRPython.cmake`. This approach incurs no changes to `ir.py` registrations (no "builtin." prefix), and no manual builder additions to individual dialect Python files (unlike the previous attempt https://github.com/llvm/llvm-project/pull/117918). Note, this PR was "clauded" not "coded".	2026-03-19 21:02:23 -07:00
Twice	044776691a	[MLIR][Python] Refine the behavior of Python-defined dialect reloading (#186128 ) This includes several changes: - `Dialect.load(reload=False)` will fail if the dialect was already loaded in a different context. To prevent the further program abortion. - `Dialect.load(reload=True)` implies `replace=True` in dialect/operation registering. - `PyGlobals::registerDialectImpl` now has a parameter `replace`. - `register_dialect` and `register_operation` is no longer exposed in `mlir.dialects.ext`. This should solve the registering problem found in writing transform test cases by @rolfmorel.	2026-03-15 10:25:24 +08:00
RattataKing	8f6866c9e9	[MLIR][Python] Clean remaining LLVM dependencies in MLIR-PY bindings (#181779 ) This PR fixed [issues](https://github.com/iree-org/iree/actions/runs/21956878131/job/63423389868#step:7:211) caused by dropping `LLVMSupport` in PR #180986, dropped the remaining direct llvm dependencies from mlir-python binding files. Previously `LLVMSupport` was dropped while some uncleaned `mlir/CAPI/*` sources were still being pulled into mlir-py, and those files still directly depended on LLVM headers. The issue was masked via a global `include_directories(${LLVM_INCLUDE_DIRS})` in `mlir/CMakeLists.txt`, out-of-tree builds (e.g., IREE) that define Python module targets outside the mlir/ directory tree would fail with "no such llvm file" errors.	2026-02-17 15:23:10 -05:00
RattataKing	d380b29a7c	[MLIR][Python] Remove partial LLVM APIs in python bindings (5/n) (#180644 ) This PR continues work from https://github.com/llvm/llvm-project/pull/178290 Added local helper functions to avoid dependency on LLVM APIs. --------- Co-authored-by: Jakub Kuderski <kubakuderski@gmail.com>	2026-02-10 15:24:56 -05:00
RattataKing	71a8973a3e	[MLIR][Python] Remove partial LLVM APIs in python bindings (4/n) (#180256 ) This PR continues work from #178290 It replaces some LLVM utilities with straightforward `std::` equivalents.	2026-02-06 17:04:49 -05:00
RattataKing	b20fca82ab	[MLIR][Python] Remove partial LLVM APIs in python bindings (3/n) (#178984 ) This PR continues work from #178290 It cleans up multiple LLVM utilities in .h files under `mlir/Bindings/python`, along with the corresponding .cpp files.	2026-02-04 13:18:32 -05:00
Twice	426374ed12	[MLIR][Python] Ignore the returned status of `loadDialectModule` in lookup functions (#179609 ) Since `loadDialectModule` doesn't work for Python-defined dialects (`mlir.dialects.ext`), currently we should lookup for dialect/operation/opadaptor class even if the `loadDialectModule` function fails. It's also because users can import some modules manually, and we do already ignore it in some cases: `e2061328a8/mlir/lib/Bindings/Python/Globals.cpp (L163-L166)` Related to https://github.com/llvm/llvm-project/pull/176920#discussion_r2762029022.	2026-02-04 15:25:07 +08:00
Twice	f992f9719f	[MLIR][Python] Support dialect conversion in python bindings (#177782 ) This PR adds dialect conversion support to the MLIR Python bindings. Because it introduces a number of new APIs, it’s a fairly large PR. It mainly includes the following parts: * Add a set of types and APIs to the C API, including `MlirConversionTarget`, `MlirConversionPattern`, `MlirTypeConverter`, `MlirConversionPatternRewriter`, and others. * Add the corresponding types and APIs to the Python bindings. * Extend `mlir-tblgen` with codegen for Python adaptor classes, which generates an adaptor class for each op. Note that this PR only adds support for 1-to-1 conversions, 1-to-N type/value conversions are not supported yet. --------- Co-authored-by: Maksim Levental <maksim.levental@gmail.com>	2026-01-31 12:37:49 +08:00
Maksim Levental	f0ef5dba6d	[mlir][Python] create MLIRPythonSupport (#171775 ) # What This PR adds a shared library `MLIRPythonSupport` which contains all of the CRTP classes ike `PyConcreteValue`, `PyConcreteType`, `PyConcreteAttribute`, as well as other useful code like `Defaulting` and etc enabling their reuse in downstream projects. Downstream projects can now do ```c++ struct PyTestType : mlir::python::MLIR_BINDINGS_PYTHON_DOMAIN::PyConcreteType<PyTestType> { ... }; class PyTestAttr : public mlir::python::MLIR_BINDINGS_PYTHON_DOMAIN::PyConcreteAttribute<PyTestAttr> { ... } NB_MODULE(_mlirPythonTestNanobind, m) { PyTestType::bind(m); PyTestAttr::bind(m); } ``` instead of using the discordant alternative `mlir_type_subclass`/`mlir_attr_subclass` (same goes for `PyConcreteValue`/`mlir_value_subclass`). # Why This PR is mostly code motion (along with CMake) but before I describe the changes I want to state the goals/benefits: 1. Currently upstream "core" extensions and "dialect" extensions ([all of the `Dialect` extensions here](`d7c734b5a1/mlir/lib/Bindings/Python`)) are a two-tier system; a. [core extensions](https://github.com/llvm/llvm-project/blob/main/mlir/lib/Bindings/Python/IRTypes.cpp#L361) enjoy first class support as far as type inference[^3], type stub generation, and ease of implementation, while dialect extensions [have poorer support](https://reviews.llvm.org/D150927), incorrect type stub generation much more tedious (boilerplate) implementation; b. Crucially, this two-tiered system is reflected in the fact that the two sets of types/attributes are not in the same Python object hierarchy. To wit: `isinstance(..., Type)` and `isinstance(..., Attribute)` are not supported for the dialect extensions[^2]; c. Since these types are not exposed in public headers, downstream users (dialect extensions or not) cannot write functions that overload on e.g. `PyFloat8Type` - that's quite a [useful feature](`fdbee98df8/cpp_ext/TorchOps.cpp (L29-L69)`)! 2. The dialect extensions incur a sizeable performance penalty relative to the core extensions in that every single trip across the wire (either `python->cpp` or `cpp->python`) requires work in addition to nanobind's own casting/construction pipeline; a. When going from `python->cpp`, [we extract the capsule object from the Python object](https://github.com/llvm/llvm-project/blob/main/mlir/include/mlir/Bindings/Python/NanobindAdaptors.h#L219C24-L219C46) and then extract from the capsule the `Mlir` opaque struct/ptr. This side isn't so onerous; b. When going from `cpp->python` we call long-hand call Python `import` APIs and construct the Python object using `_CAPICreate`. Note, there at least 2 `attr` calls incurred in addition to `_CAPICreate`; this is already much more [efficiently handled by nanobind itself](`4ba51fcf79/src/nb_internals.h (L381-L382)`)! 3. This division blocks various features: in some configurations[^1] we trigger a circular import bug because "dialect" types and attributes perform an [import of the root `_mlir` module](`bd9651bf78/mlir/include/mlir/Bindings/Python/NanobindAdaptors.h (L585)`) when they are created (the types themselves, not even instances of those types). This blocks type stub generation for dialect extensions (i.e., the reason we currently only generate type stubs for `_mlir`). # How Prior this was not done/possible because of "ODR" issues but I have resolved those issues; the basic idea for how we solve this is "move things we want to share into shared libraries": 1. Move IRCore (stuff like `PyConcreteValue`, `PyConcreteType`, `PyConcreteAttribute`) into `MLIRPythonSupport`; - Note, we move the rest of the things in `IRModule.h` (renamed to `IRCore.h`) because `PyConcreteValue`, `PyConcreteType`, `PyConcreteAttribute` depend on them. This makes for a bigger PR than one would hope for but ultimately I think we should give people access to these classes to use as they see fit (specifically inherit from, but also liberally use in bindings signatures instead of the opaque `Mlir` struct wrappers). 2. Put all of this code into a nested namespace `MLIR_BINDINGS_PYTHON_DOMAIN` which is determined by a compile time define (and tied to `MLIR_BINDINGS_PYTHON_NB_DOMAIN`). This is necessary in order to prevent conflicts on both symbol name and* typeid (necessary for nanobind to not double register binded types) between multiple bindings libraries (e.g., `torch-mlir`, and `jax`). Note [nanobind doesn't support `module_local` like pybind11](https://nanobind.readthedocs.io/en/latest/porting.html#removed-features). It does support `NB_DOMAIN` but that is not sufficient for disambiguating typeids across projects (to wit: we currently define `NB_DOMAIN` and it was still necessary to move everything to a nested namespace); 3. Build the [nanobind library itself as a shared object](https://github.com/wjakob/nanobind/blob/master/cmake/nanobind-config.cmake#L127) (and link it to both the extensions and `MLIRPythonSupport`). 4. CMake to make this work, in-tree, out-of-tree, downstream, upstream, etc. # Testing Three tests are added here 1. `PythonTestModuleNanobind` is ported to use `PyConcreteType<PyTestType>` instead of `mlir_type_subclass` and `PyConcreteAttribute<PyTestAttr>` instead of `mlir_atrr_subclass`, verifying this works for non-core extensions in-tree; 2. `StandaloneExtensionNanobind` is ported to use `struct PyCustomType : mlir::python::MLIR_BINDINGS_PYTHON_DOMAIN::PyConcreteType<PyCustomType>` instead of `mlir_type_subclass` verifying this works for non-core extensions out-of-tree; 3. `StandaloneExtensionNanobind`'s `smoketest` is extended to also load another bindings package (namely `mlir`) verifying `MLIR_BINDINGS_PYTHON_DOMAIN` successfully disambiguates symbols and typeids. I have also tested this downstream: https://github.com/llvm/eudsl/pull/287 as well run the following builder bots: mlir-nvidia-gcc7: https://lab.llvm.org/buildbot/#/buildrequests/6654424?redirect_to_build=true I have also tested against IREE: https://github.com/iree-org/iree/pull/21916 # Integration It is highly recommended to set the CMake var `MLIR_BINDINGS_PYTHON_NB_DOMAIN` (which will also determine `MLIR_BINDINGS_PYTHON_DOMAIN`) to something unique for each downstream. This can also be passed explicitly to `add_mlir_python_modules` if your project builds multiple bindings packages. I added a `WARNING` to this effect in `AddMLIRPython.cmake`. [^3]: Python values being typed correctly when exiting from cpp; [^1]: Specifically when the modules are imported using `importlib`, which occurs with nanobind's [stubgen](https://github.com/wjakob/nanobind/blob/master/src/stubgen.py#L965); [^2]: The workaround we implemented was a class method for the dialect bindings called `Class.isinstance(...)`;	2026-01-05 09:08:13 -08:00

9 Commits