llvm-project

Author	SHA1	Message	Date
lfrenot	40afff7bd9	[mlir][LLVM] Add disjoint flag (#115855 ) The implementation is mostly based on the one existing for the exact flag. disjoint means that for each bit, that bit is zero in at least one of the inputs. This allows the Or to be treated as an Add since no carry can occur from any bit. If the disjoint keyword is present, the result value of the or is a [poison value](https://llvm.org/docs/LangRef.html#poisonvalues) if both inputs have a one in the same bit position. For vectors, only the element containing the bit is poison.	2024-11-15 13:48:01 +01:00
Sirui Mu	1dfb104eac	[mlir][LLVMIR] Add operand bundle support for llvm.intr.assume (#112143 ) This patch adds operand bundle support for `llvm.intr.assume`. This patch actually contains two parts: - `llvm.intr.assume` now accepts operand bundle related attributes and operands. `llvm.intr.assume` does not take constraint on the operand bundles, but obviously only a few set of operand bundles are meaningful. I plan to add some of those (e.g. `aligned` and `separate_storage` are what interest me but other people may be interested in other operand bundles as well) in future patches. - The definitions of `llvm.call`, `llvm.invoke`, and `llvm.call_intrinsic` actually define `op_bundle_tags` as an operation property. It turns out this approach would introduce some unnecessary burden if applied equally to the intrinsic operations because properties are not available through `Operation ` but we have to operate on `Operation ` during the import/export of intrinsics, so this PR changes it from a property to an array attribute. This patch relands commit d8fadad07c952c4aea967aefb0900e4e43ad0555.	2024-10-16 20:49:02 +08:00
Sirui Mu	484c02780b	Revert "[mlir][LLVMIR] Add operand bundle support for llvm.intr.assume (#112143 )" This reverts commit d8fadad07c952c4aea967aefb0900e4e43ad0555. The commit breaks the following CI builds: - ppc64le-mlir-rhel-clang: https://lab.llvm.org/buildbot/#/builders/129/builds/7685 - ppc64le-flang-rhel-clang: https://lab.llvm.org/buildbot/#/builders/157/builds/10338	2024-10-16 14:15:31 +08:00
Sirui Mu	d8fadad07c	[mlir][LLVMIR] Add operand bundle support for llvm.intr.assume (#112143 ) This patch adds operand bundle support for `llvm.intr.assume`. This patch actually contains two parts: - `llvm.intr.assume` now accepts operand bundle related attributes and operands. `llvm.intr.assume` does not take constraint on the operand bundles, but obviously only a few set of operand bundles are meaningful. I plan to add some of those (e.g. `aligned` and `separate_storage` are what interest me but other people may be interested in other operand bundles as well) in future patches. - The definitions of `llvm.call`, `llvm.invoke`, and `llvm.call_intrinsic` actually define `op_bundle_tags` as an operation property. It turns out this approach would introduce some unnecessary burden if applied equally to the intrinsic operations because properties are not available through `Operation ` but we have to operate on `Operation ` during the import/export of intrinsics, so this PR changes it from a property to an array attribute.	2024-10-16 12:51:50 +08:00
Tarun Prabhu	839344f025	[clang][flang][mlir] Reapply "Support -frecord-command-line option (#102975 )" The underlying issue was caused by a file included in two different places which resulted in duplicate definition errors when linking individual shared libraries. This was fixed in c3201ddaeac02a2c86a38b [#109874].	2024-10-14 08:44:24 -06:00
Abid Qadeer	cd12ffb622	[mlir][debug] Allow multiple DIGlobalVariableExpression on globals. (#111981 ) Currently, we allow only one DIGlobalVariableExpressionAttr per global. It is especially evident in import where we pick the first from the list and ignore the rest. In contrast, LLVM allows multiple DIGlobalVariableExpression to be attached to the global. They are needed for correct working of things like DICommonBlock. This PR removes this restriction in mlir. Changes are mostly mechanical. One thing on which I went a bit back and forth was the representation inside GlobalOp. I would be happy to change if there are better ways to do this. --------- Co-authored-by: Tobias Gysi <tobias.gysi@nextsilicon.com>	2024-10-13 23:36:00 +01:00
Rahul Joshi	fa789dffb1	[NFC] Rename `Intrinsic::getDeclaration` to `getOrInsertDeclaration` (#111752 ) Rename the function to reflect its correct behavior and to be consistent with `Module::getOrInsertFunction`. This is also in preparation of adding a new `Intrinsic::getDeclaration` that will have behavior similar to `Module::getFunction` (i.e, just lookup, no creation).	2024-10-11 05:26:03 -07:00
Abid Qadeer	36c34ec967	[mlir][debug] Support DICommonBlock. (#111706 ) A COMMON block is a named area of memory that holds a collection of variables. Fortran subprograms may map the COMMON block memory area to a list of variables. A common block is represented in LLVM debug by DICommonBlock. This PR adds support for this in MLIR. The changes are mostly mechanical apart from small change to access the DICompileUnit when the scope of the variable is DICommonBlock. --------- Co-authored-by: Tobias Gysi <tobias.gysi@nextsilicon.com>	2024-10-10 18:07:06 +01:00
Youngsuk Kim	123e8c735d	[mlir] Don't call llvm::raw_string_ostream::flush() (NFC) Don't call raw_string_ostream::flush(), which is essentially a no-op. As specified in the docs, raw_string_ostream is always unbuffered. ( 65b13610a5226b84889b923bae884ba395ad084d for further reference )	2024-09-22 15:37:34 -05:00
David Spickett	737c414e1d	Revert "[clang][flang][mlir] Support -frecord-command-line option (#102975 )" This reverts commit b3533a156da92262eb19429d8c12f53e87f5ccec. It caused test failures in shared library builds: https://lab.llvm.org/buildbot/#/builders/80/builds/3854	2024-09-20 11:30:50 +00:00
Tarun Prabhu	b3533a156d	[clang][flang][mlir] Support -frecord-command-line option (#102975 ) Add support for the -frecord-command-line option that will produce the llvm.commandline metadata which will eventually be saved in the object file. This behavior is also supported in clang. Some refactoring of the code in flang to handle these command line options was carried out. The corresponding -grecord-command-line option which saves the command line in the debug information has not yet been enabled for flang.	2024-09-19 18:28:50 -06:00
Mogball	80cd2141eb	[mlir][llvm] Add `externally_initialized` support to GlobalOp This maps the `externally_initialized` flag in `llvm::GlobalVariable` to `GlobalOp` and adds exported support.	2024-09-05 14:49:39 +01:00
Sirui Mu	318b0678e3	[mlir][LLVM] Add support for constant struct with multiple fields (#102752 ) Currently `mlir.llvm.constant` of structure types restricts that the structure type effectively represents a complex type -- it must have exactly two fields of the same type and the field type must be either an integer type or a float type. This PR relaxes this restriction and it allows the structure type to have an arbitrary number of fields.	2024-08-22 15:32:55 +02:00
Abid Qadeer	00a1a45a7d	[mlir][llvmir][debug] Correctly generate location for phi nodes. (#105534 ) In [convertBlockImpl](`87eeed1f0e/mlir/lib/Target/LLVMIR/ModuleTranslation.cpp (L959)`), the debug location is set on the builder before the op is processed. This results in correct location being given to corresponding llvm instructions. But same is not done when phi nodes are created a few lines above. This result is phi nodes getting whatever the current debug location of the builder is. It can be nothing or in worst case a stale location. Fixed by calling SetCurrentDebugLocation before generating phi nodes.	2024-08-22 10:08:04 +01:00
Tarun Prabhu	90aac06c7f	[flang][mlir] Add llvm.ident metadata when compiling with flang This brings the behavior of flang in line with clang which also adds this metadata unconditionally. Co-authored-by: Tarun Prabhu <tarun.prabhu@gmail.com>	2024-08-12 11:56:19 -06:00
Victor Perez	6d2bbba187	[MLIR][LLVM] Attach kernel metadata representation to `llvm.func` (#101314 ) Add optional attributes to `llvm.func` representing LLVM so-called "kernel" metadata: - [`vec_type_hint`](https://clang.llvm.org/docs/AttributeReference.html#vec-type-hint) - [`work_group_size_hint`](https://clang.llvm.org/docs/AttributeReference.html#work-group-size-hint) - [`reqd_work_group_size`](https://clang.llvm.org/docs/AttributeReference.html#reqd-work-group-size) - [`intel_reqd_sub_group_size`](https://clang.llvm.org/docs/AttributeReference.html#intel-reqd-sub-group-size). --------- Signed-off-by: Victor Perez <victor.perez@codeplay.com>	2024-08-02 11:53:30 +02:00
Finlay	2eea9d6a2d	[mlir] Rename memory attribute to memory_effects in llvmir dialect (#100108 ) This commit renames the memory attribute on operations to "memory_effects" to be in line with the naming LLVM IR is using.	2024-07-23 14:17:05 +02:00
Finlay	aa21ee7926	[MLIR] Add attributes no_unwind and will_return to the LLVMIR dialect (#98921 ) And testing. These are being added to be used in the GPU to LLVM SPV pass. --------- Co-authored-by: Victor Perez <victor.perez@codeplay.com>	2024-07-17 11:26:25 +02:00
Alexis Perry-Holby	f1d3fe7aae	Add basic -mtune support (#98517 ) Initial implementation for the -mtune flag in Flang. This PR is a clean version of PR #96688, which is a re-land of PR #95043	2024-07-16 16:48:24 +01:00
Abid Qadeer	dab36b288a	[mlir][debug] Allow global with local scope. (#98358 ) Currently, only those global variables in debug info are allowed which have a compile unit or module scope. But global variable with function scope are used in debug info to represent certain type of variables. One example will be static local variables in C. Here the variable is represented as global variable with parent function as its scope. See the code and debug metadata as generated by clang below. You can see that scope of `DIGlobalVariable` is `DISubprogram`. ``` int test() { static int a = 0; return a++; } !0 = !DIGlobalVariableExpression(var: !1...) !1 = !DIGlobalVariable(name: "a", scope: !2 ...) !2 = !DISubprogram(name: "test" ...) !7 = !DICompileUnit(... globals: !8...) !8 = !{!0} ``` Similar issue exist in fortran where global variable with function scope are used to represent local variable with save attribute. This PR will allows such variables during module translation.	2024-07-11 16:31:11 +01:00
Mats Petersson	ad315ebfee	[MLIR][NFC]Fix unusued variable warning. (#97813 ) This just changes a dyn_cast to and isa call to check for type, which avoids getting a warning in clang. The variable `ity` is not used in the next block. No functional change.	2024-07-09 10:35:06 +01:00
runseny	e5865ec93c	[MLIR][LLVM] Add ftz and fuse FP ops related function attribute support (#97812 ) Adds `denormal-fp-math-f32`, `denormal-fp-math`, `fp-contract` to llvmFuncOp attributes. `denormal-fp-math-f32` and `denormal-fp-math` can enable the ftz, that is , flushing denormal to zero. `fp-contract` can enable the fma fusion such as `mul + add -> fma`	2024-07-09 09:34:43 +01:00
Finlay	d6df01870a	[MLIR] Add the convergent attribute to LLVM Dialect (#97709 ) In order to use the convergent attribute in the GPUToLLVMSPV pass, I've added the attribute to the LLVM dialect. Some details on the convergent attribute https://llvm.org/docs/ConvergentOperations.html#convergent-operations	2024-07-05 09:33:56 +02:00
Ramkumar Ramachandra	db791b278a	mlir/LogicalResult: move into llvm (#97309 ) This patch is part of a project to move the Presburger library into LLVM.	2024-07-02 10:42:33 +01:00
Tarun Prabhu	8dd9494056	Revert "[flang] Add basic -mtune support" (#96678 ) Reverts llvm/llvm-project#95043	2024-06-25 13:25:39 -06:00
Alexis Perry-Holby	a790279bf2	[flang] Add basic -mtune support (#95043 ) This PR adds -mtune as a valid flang flag and passes the information through to LLVM IR as an attribute on all functions. No specific architecture optimizations are added at this time.	2024-06-25 18:39:35 +01:00
Stephen Tozer	48f8d95f33	[MLIR][DebugInfo] Set debug info format in MLIR->IR translation (#95329 ) MLIR's LLVM dialect does not internally support debug records, only converting to/from debug intrinsics. To smooth the transition from intrinsics to records, there is a step prior to IR->MLIR translation that switches the IR module to intrinsic-form; this patch adds the equivalent conversion to record-form at MLIR->IR translation. This is a partial reapply of https://github.com/llvm/llvm-project/pull/95098 which can be landed once the flang frontend has been updated by https://github.com/llvm/llvm-project/pull/95306. This is the counterpart to the earlier patch https://github.com/llvm/llvm-project/pull/89735 which handled the IR->MLIR conversion.	2024-06-13 16:27:15 +01:00
Johannes de Fine Licht	c012e487b7	[MLIR][LLVM] Promote noinline/alwaysinline/optnone out of passthrough (#95110 ) The `noinline`, `alwaysinline`, and `optnone` function attributes are already being used in MLIR code for the LLVM inlining interface and in some SPIR-V lowering, despite residing in the passthrough dictionary, which is intended as exactly that -- a pass through MLIR -- and not to model any actual semantics being handled in MLIR itself. Promote the `noinline`, `alwaysinline`, and `optnone` attributes out of the passthrough dictionary on `llvm.func` into first class unit attributes, updating the import and export accordingly. Add a verifier to `llvm.func` that checks that these attributes are not set in an incompatible way according to the LLVM specification. Update the LLVM dialect inlining interface to use the first class attributes to check whether inlining is possible.	2024-06-12 08:29:02 +02:00
Stephen Tozer	d83f37f039	Revert#2 "[MLIR][Flang][DebugInfo] Set debug info format in MLIR->IR translation (#95098 )" Also reverts "[MLIR][Flang][DebugInfo] Convert debug format in MLIR translators" The patch above introduces behaviour controlled by an LLVM flag into the Flang driver, which is incorrect behaviour. This reverts commits: 3cc2710e0dd53bb82742904fa13014018a1137ed. 460408f78b30720950040e336f7b566aa7203269.	2024-06-11 15:14:31 +01:00
Stephen Tozer	460408f78b	Reapply "[MLIR][Flang][DebugInfo] Set debug info format in MLIR->IR translation (#95098 )" Reapplies the original patch with some additional conversion layers added to the MLIR translator, to ensure that we don't write the new debug info format unless WriteNewDbgInfoFormat is set. This reverts commit 8c5d9c79b96ed8297b381e00d3a706a432cd6c9d.	2024-06-11 13:46:09 +01:00
Stephen Tozer	8c5d9c79b9	Revert "[MLIR][Flang][DebugInfo] Set debug info format in MLIR->IR translation (#95098 )" Reverted due to failure on buildbot due to missing use of the WriteNewDbgInfoFormat flag in MLIR. This reverts commit ca920bb6285e9995f5a202d040af79363e98ab28.	2024-06-11 13:29:38 +01:00
Stephen Tozer	ca920bb628	[MLIR][Flang][DebugInfo] Set debug info format in MLIR->IR translation (#95098 ) MLIR's LLVM dialect does not internally support debug records, only converting to/from debug intrinsics. To smooth the transition from intrinsics to records, there is a step prior to IR->MLIR translation that switches the IR module to intrinsic-form; this patch adds the equivalent conversion to record-form at MLIR->IR translation, and also modifies the flang front end to use the WriteNewDbgInfoFormat flag when it is emitting LLVM IR.	2024-06-11 13:11:41 +01:00
Christian Ulmann	9b2a349991	[MLIR][ModuleTranslation] Add disableVerification parameter (NFC) (#94445 ) This commit adds a boolean parameter that allows downstream users to disable the verification when translating an MLIR module to LLVM IR. This is helpful for debugging broken LLVM IR modules post translation.	2024-06-05 12:47:24 +02:00
Valentin Clement (バレンタインクレメン)	428b9be648	[mlir] Align num elements type to LLVM ArrayType (#93230 ) MLIR LLMArrayType is using `unsigned` for the number of elements while LLVM ArrayType is using `uint64_t` `4ae896fe97/llvm/include/llvm/IR/DerivedTypes.h (L377)` This leads to silent truncation when we use it for globals in flang. ``` program test integer(8), parameter :: large = 2**30 real, dimension(large) :: bigarray common /c/ bigarray bigarray(999) = 666 end ``` The above program would result in a segfault since the global would be of size 0 because of the silent truncation. ``` fir.global common @c_(dense<0> : vector<4294967296xi8>) : !fir.array<4294967296xi8> ``` became ``` llvm.mlir.global common @c_(dense<0> : vector<4294967296xi8>) {addr_space = 0 : i32} : !llvm.array<0 x i8> ``` This patch updates the definition of MLIR ArrayType to take `uint64_t` as argument of the number of elements to be compatible with LLVM.	2024-05-29 13:05:44 -07:00
Christian Ulmann	b00e0c1671	[MLIR][Analysis] Consolidate topological sort utilities (#92563 ) This PR attempts to consolidate the different topological sort utilities into one place. It adds them to the analysis folder because the `SliceAnalysis` uses some of these. There are now two different sorting strategies: 1. Sort only according to SSA use-def chains 2. Sort while taking regions into account. This requires a much more elaborate traversal and cannot be applied on graph regions that easily. This additionally reimplements the region aware topological sorting because the previous implementation had an exponential space complexity. I'm open to suggestions on how to combine this further or how to fuse the test passes.	2024-05-22 08:48:10 +02:00
Christian Ulmann	e919df5777	[MLIR][Transforms] Correct block sorting utils name (NFC) (#92558 ) This commit renames the name of the block sorting utility function to `getBlocksSortedByDominance`. A topological order is not defined on a general directed graph, so the previous name did not make sense.	2024-05-17 17:36:09 +02:00
Abid Qadeer	77b80bd818	[mlir][flang] Improve handling of fortran module variables. (#91604 ) Currently, only those global variables which are at compile unit scope are added to the 'globals' list of the DICompileUnit. This does not work for languages which support modules (e.g. Fortran) where hierarchy can be variable -> module -> compile unit. To fix this, if a variable scope points to a module, we walk one level up and see if module is in the compile unit scope. This was initially part of #91582 which adds debug information for Fortran module variables. @kiranchandramohan pointed out that MLIR changes should go in separate PRs.	2024-05-14 18:08:10 +01:00
Victor Perez	77cbc9bf60	[MLIR][LLVM] Add `llvm.experimental.constrained.fptrunc` operation (#86260 ) Add operation mapping to the LLVM `llvm.experimental.constrained.fptrunc.*` intrinsic. The new operation implements the new `LLVM::FPExceptionBehaviorOpInterface` and `LLVM::RoundingModeOpInterface` interfaces. --------- Signed-off-by: Victor Perez <victor.perez@codeplay.com>	2024-03-26 11:02:50 +01:00
Tobias Gysi	adda597388	[MLIR] Add index bitwidth to the DataLayout (#85927 ) When importing from LLVM IR the data layout of all pointer types contains an index bitwidth that should be used for index computations. This revision adds a getter to the DataLayout that provides access to the already stored bitwidth. The function returns an optional since only pointer-like types have an index bitwidth. Querying the bitwidth of a non-pointer type returns std::nullopt. The new function works for the built-in Index type and, using a type interface, for the LLVMPointerType.	2024-03-21 09:07:57 +01:00
Tom Eccles	27534d69e2	[mlir][LLVM] erase call mappings in forgetMapping() (#84955 ) It looks like the mappings for call instructions were forgotten here. This fixes a bug in OpenMP when in-lining a region containing call operations multiple times. OpenMP array reductions 4/6 Previous PR: https://github.com/llvm/llvm-project/pull/84954 Next PR: https://github.com/llvm/llvm-project/pull/84957	2024-03-20 10:03:26 +00:00
Tobias Gysi	e39e30e952	[mlir][llvm] Fix access group translation (#83257 ) This commit fixes the translation of access group metadata to LLVM IR. Previously, it did not use a temporary metadata node to model the placeholder of the self-referencing access group nodes. This is dangerous since, the translation may produce a metadata list with a null entry that is later on changed changed with a self reference. At the same time, for example the debug info translation may create the same uniqued node, which after setting the self-reference the suddenly references the access group metadata. The commit avoids such breakages.	2024-02-28 14:45:18 +01:00
Jie Fu	ef7417fbc5	[mlir] Fix -Wunused-but-set-variable in ModuleTranslation.cpp (NFC) mlir/lib/Target/LLVMIR/ModuleTranslation.cpp:1050:11: error: variable 'numConstantsHit' set but not used [-Werror,-Wunused-but-set-variable] int numConstantsHit = 0; ^ mlir/lib/Target/LLVMIR/ModuleTranslation.cpp:1051:11: error: variable 'numConstantsErased' set but not used [-Werror,-Wunused-but-set-variable] int numConstantsErased = 0; ^ 2 errors generated.	2024-02-27 10:37:17 +08:00
Xiang Li	c11627c2f4	[MLIR][LLVM] Fix memory explosion when converting global variable bodies in ModuleTranslation (#82708 ) There is memory explosion when converting the body or initializer region of a large global variable, e.g. a constant array. For example, when translating a constant array of 100000 strings: llvm.mlir.global internal constant @cats_strings() {addr_space = 0 : i32, alignment = 16 : i64} : !llvm.array<100000 x ptr<i8>> { %0 = llvm.mlir.undef : !llvm.array<100000 x ptr<i8>> %1 = llvm.mlir.addressof @om_1 : !llvm.ptr<array<1 x i8>> %2 = llvm.getelementptr %1[0, 0] : (!llvm.ptr<array<1 x i8>>) -> !llvm.ptr<i8> %3 = llvm.insertvalue %2, %0[0] : !llvm.array<100000 x ptr<i8>> %4 = llvm.mlir.addressof @om_2 : !llvm.ptr<array<1 x i8>> %5 = llvm.getelementptr %4[0, 0] : (!llvm.ptr<array<1 x i8>>) -> !llvm.ptr<i8> %6 = llvm.insertvalue %5, %3[1] : !llvm.array<100000 x ptr<i8>> %7 = llvm.mlir.addressof @om_3 : !llvm.ptr<array<1 x i8>> %8 = llvm.getelementptr %7[0, 0] : (!llvm.ptr<array<1 x i8>>) -> !llvm.ptr<i8> %9 = llvm.insertvalue %8, %6[2] : !llvm.array<100000 x ptr<i8>> %10 = llvm.mlir.addressof @om_4 : !llvm.ptr<array<1 x i8>> %11 = llvm.getelementptr %10[0, 0] : (!llvm.ptr<array<1 x i8>>) -> !llvm.ptr<i8> %12 = llvm.insertvalue %11, %9[3] : !llvm.array<100000 x ptr<i8>> ... (ignore the remaining part) } where @om_1, @om_2, ... are string global constants. Each time an operation is converted to LLVM, a new constant is created. When it comes to llvm.insertvalue, a new constant array of 100000 elements is created and the old constant array (input) is not destroyed. This causes memory explosion. We observed that, on a system with 128 GB memory, the translation of 100000 elements got killed due to using up all the memory. On a system with 64 GB, 65536 elements was enough to cause the translation killed. There is a previous patch (https://reviews.llvm.org/D148487) which fix this issue but was reverted for https://github.com/llvm/llvm-project/issues/62802 The old patch checks generated constants and destroyed them if there is no use. But the check of use for the constant is too early, which cause the constant be removed before use. This new patch added a map was added a map to save expected use count for a constant. Then decrease when reach each use. And only erase the constant when the use count reach to zero With new patch, the repro in https://github.com/llvm/llvm-project/issues/62802 finished correctly.	2024-02-26 21:15:12 -05:00
Rishi Surendran	fa6850a998	[mlir][nvvm]Add support for grid_constant attribute on LLVM function arguments (#78228 ) Add support for attribute nvvm.grid_constant on LLVM function arguments. The attribute can be attached only to arguments of type llvm.ptr that have llvm.byval attribute. Generate LLVM metadata for functions with nvvm.grid_constant arguments. The metadata node is a list of integers, where each integer n denotes that the nth parameter has the grid_constant annotation (numbering from 1). The generated metadata node will be handled by NVVM compiler. See https://docs.nvidia.com/cuda/nvvm-ir-spec/index.html#supported-properties for documentation on grid_constant property. This patch also adds convertParameterAttr to LLVMTranslationDialectInterface for supporting the translation of derived dialect attributes on function parameters	2024-02-12 13:16:59 -08:00
Sander de Smalen	d313614b60	[AArch64] Replace LLVM IR function attributes for PSTATE.ZA. (#79166 ) Since https://github.com/ARM-software/acle/pull/276 the ACLE defines attributes to better describe the use of a given SME state. Previously the attributes merely described the possibility of it being 'shared' or 'preserved', whereas the new attributes have more semantics and also describe how the data flows through the program. For ZT0 we already had to add new LLVM IR attributes: * aarch64_new_zt0 * aarch64_in_zt0 * aarch64_out_zt0 * aarch64_inout_zt0 * aarch64_preserves_zt0 We have now done the same for ZA, such that we add: * aarch64_new_za (previously `aarch64_pstate_za_new`) * aarch64_in_za (more specific variation of `aarch64_pstate_za_shared`) * aarch64_out_za (more specific variation of `aarch64_pstate_za_shared`) * aarch64_inout_za (more specific variation of `aarch64_pstate_za_shared`) * aarch64_preserves_za (previously `aarch64_pstate_za_shared, aarch64_pstate_za_preserved`) This explicitly removes 'pstate' from the name, because with SME2 and the new ACLE attributes there is a difference between "sharing ZA" (sharing the ZA matrix register with the caller) and "sharing PSTATE.ZA" (sharing either the ZA or ZT0 register, both part of PSTATE.ZA with the caller).	2024-02-01 13:37:37 +00:00
Alex Bradbury	748c295908	[MLIR][LLVM] Add fast-math related function attribute support (#79812 ) Adds unsafe-fp-math, no-infs-fp-math, no-nans-fp-math, approx-func-fp-math, and no-signed-zeros-fp-math function attributes. This allows code generators using the LLVMIR dialect to match the codegen of Clang.	2024-01-30 14:03:51 +00:00
Kunwar Grover	9261ab708e	[mlir][Target] Teach dense_resource conversion to LLVMIR Target (#78958 ) This patch adds support for translating dense_resource attributes to LLVMIR Target. The support added is similar to how DenseElementsAttr is handled, except we don't need to handle splats. Another possible way of doing this is adding iteration on dense_resource, but that is non-trivial as DenseResourceAttr is not meant to be something you should directly access. It has subclasses which you are supposed to use to iterate on it.	2024-01-23 13:30:34 -08:00
Sergio Afonso	8fb685fb7e	[MLIR][LLVM] Add explicit target_cpu attribute to llvm.func (#78287 ) This patch adds the target_cpu attribute to llvm.func MLIR operations and updates the translation to/from LLVM IR to match "target-cpu" function attributes.	2024-01-17 14:55:02 +00:00
Christian Ulmann	fa5255eee2	[MLIR][LLVM] Enable export of DISubprograms on function declarations (#78026 ) This commit changes the MLIR to LLVMIR export to also attach subprogram debug attachements to function declarations. This commit additonally fixes the two passes that produce subprograms to not attach the "Definition" flag to function declarations. This otherwise results in invalid LLVM IR.	2024-01-15 07:34:13 +01:00
Mats Petersson	21e1bf2d00	Add more ZA modes (#77361 ) Add more ZA modes Adds the arm_shared_za and arm_preserves_za attributes to the existing arm_new_za attribute. The functionality already exists in LLVM, so just "linking the pieces together". For more details see: https://arm-software.github.io/acle/main/acle.html#sme-attributes-relating-to-za	2024-01-11 18:49:52 +00:00

1 2 3 4 5 ...

347 Commits