llvm-project

Author	SHA1	Message	Date
BARRET	1666d13078	[CMake]: Remove unnecessary dependencies on LLVM/MLIR (#111255 ) Previous https://github.com/llvm/llvm-project/pull/110362 (reverted) caused breakage. Here is the PR with fix. My build cmdline: ``` cmake ../llvm \ -G Ninja \ -DCMAKE_BUILD_TYPE=Release \ -DCMAKE_INSTALL_PREFIX=install \ -DCMAKE_C_COMPILER=gcc-9 \ -DCMAKE_CXX_COMPILER=g++-9 \ -DCMAKE_CUDA_COMPILER=$(which nvcc) \ -DLLVM_ENABLE_LLD=OFF \ -DLLVM_ENABLE_ASSERTIONS=ON \ -DLLVM_BUILD_EXAMPLES=ON \ -DCOMPILER_RT_BUILD_LIBFUZZER=OFF \ -DLLVM_CCACHE_BUILD=ON \ -DMLIR_ENABLE_BINDINGS_PYTHON=ON \ -DBUILD_SHARED_LIBS=ON \ -DLLVM_ENABLE_PROJECTS='llvm;mlir' ```	2024-10-07 15:52:43 +02:00
Mehdi Amini	8b47711e84	Revert "CMake: Remove unnecessary dependencies on LLVM/MLIR" (#110594 ) Reverts llvm/llvm-project#110362 Multiple bots are broken.	2024-10-01 00:44:21 +02:00
BARRET	4980f2177e	CMake: Remove unnecessary dependencies on LLVM/MLIR (#110362 ) There are some spurious libraries which can be removed. I'm trying to bundle MLIR/LLVM library dependencies for our own libraries. We're utilizing cmake function to recursively collect MLIR/LLVM related dependencies. However, we identified certain library dependencies as redundant and safe for removal.	2024-09-30 23:57:13 +02:00
Frank Schlimbach	baabcb2898	[mlir][mesh] Shardingcontrol (#102598 ) This is a fixed copy of #98145 (necessary after it got reverted). @sogartar @yaochengji This PR adds the following to #98145: - `UpdateHaloOp` accepts a `memref` (instead of a tensor) and not returning a result to clarify its inplace-semantics - `UpdateHaloOp` accepts `split_axis` to allow multiple mesh-axes per tensor/memref-axis (similar to `mesh.sharding`) - The implementation of `Shardinginterface` for tensor operation (`tensor.empty` for now) moved from the tensor library to the mesh interface library. `spmdize` uses features from `mesh` dialect. @rengolin agreed that `tensor` should not depend on `mesh` so this functionality cannot live in a `tensor`s lib. The unfulfilled dependency caused the issues leading to reverting #98145. Such cases are generally possible and might lead to re-considering the current structure (like for tosa ops). - rebased onto latest main -------------------------- Replacing `#mesh.sharding` attribute with operation `mesh.sharding` - extended semantics now allow providing optional `halo_sizes` and `sharded_dims_sizes` - internally a sharding is represented as a non-IR class `mesh::MeshSharding` What previously was ```mlir %sharded0 = mesh.shard %arg0 <@mesh0, [[0]]> : tensor<4x8xf32> %sharded1 = mesh.shard %arg1 <@mesh0, [[0]]> annotate_for_users : tensor<16x8xf32> ``` is now ```mlir %sharding = mesh.sharding @mesh0, [[0]] : !mesh.sharding %0 = mesh.shard %arg0 to %sharding : tensor<4x8xf32> %1 = mesh.shard %arg1 to %sharding annotate_for_users : tensor<16x8xf32> ``` and allows additional annotations to control the shard sizes: ```mlir mesh.mesh @mesh0 (shape = 4) %sharding0 = mesh.sharding @mesh0, [[0]] halo_sizes = [1, 2] : !mesh.sharding %0 = mesh.shard %arg0 to %sharding0 : tensor<4x8xf32> %sharding1 = mesh.sharding @mesh0, [[0]] sharded_dims_sizes = [3, 5, 5, 3] : !mesh.sharding %1 = mesh.shard %arg1 to %sharding1 annotate_for_users : tensor<16x8xf32> ``` - `mesh.shard` op accepts additional optional attribute `force`, useful for halo updates - Some initial spmdization support for the new semantics - Support for `tensor.empty` reacting on `sharded_dims_sizes` and `halo_sizes` in the sharding - New collective operation `mesh.update_halo` as a spmdized target for shardings with `halo_sizes` --------- Co-authored-by: frank.schlimbach <fschlimb@smtp.igk.intel.com> Co-authored-by: Jie Fu <jiefu@tencent.com>	2024-08-12 12:20:58 +01:00
Renato Golin	3968942f10	Revert "[mlir][mesh] adding shard-size control (#98145 )" This reverts commit fca69838caf19854769ada21a71da91fcfcbde73. Also reverts the fixup: "[mlir] Fix -Wunused-variable in MeshOps.cpp (NFC)" This reverts commit fc737368fe6e27d6ecf76e522cb43a32aaca992a.	2024-08-07 15:12:37 +01:00
Frank Schlimbach	fca69838ca	[mlir][mesh] adding shard-size control (#98145 ) - Replacing `#mesh.sharding` attribute with operation `mesh.sharding` - extended semantics now allow providing optional `halo_sizes` and `sharded_dims_sizes` - internally a sharding is represented as a non-IR class `mesh::MeshSharding` What previously was ```mlir %sharded0 = mesh.shard %arg0 <@mesh0, [[0]]> : tensor<4x8xf32> %sharded1 = mesh.shard %arg1 <@mesh0, [[0]]> annotate_for_users : tensor<16x8xf32> ``` is now ```mlir %sharding = mesh.sharding @mesh0, [[0]] : !mesh.sharding %0 = mesh.shard %arg0 to %sharding : tensor<4x8xf32> %1 = mesh.shard %arg1 to %sharding annotate_for_users : tensor<16x8xf32> ``` and allows additional annotations to control the shard sizes: ```mlir mesh.mesh @mesh0 (shape = 4) %sharding0 = mesh.sharding @mesh0, [[0]] halo_sizes = [1, 2] : !mesh.sharding %0 = mesh.shard %arg0 to %sharding0 : tensor<4x8xf32> %sharding1 = mesh.sharding @mesh0, [[0]] sharded_dims_sizes = [3, 5, 5, 3] : !mesh.sharding %1 = mesh.shard %arg1 to %sharding1 annotate_for_users : tensor<16x8xf32> ``` - `mesh.shard` op accepts additional optional attribute `force`, useful for halo updates - Some initial spmdization support for the new semantics - Support for `tensor.empty` reacting on `sharded_dims_sizes` and `halo_sizes` in the sharding - New collective operation `mesh.update_halo` as a spmdized target for shardings with `halo_sizes` @sogartar @yaochengji	2024-08-07 13:34:57 +01:00
Ramkumar Ramachandra	db791b278a	mlir/LogicalResult: move into llvm (#97309 ) This patch is part of a project to move the Presburger library into LLVM.	2024-07-02 10:42:33 +01:00
Arda Unal	01a429c432	[mlir][mesh] Fix wrong argument passed to targetShardingInUnsplitLastAxis (#95059 ) In unsplitLastAxisInResharding, wrong argument was passed when calling targetShardingInUnsplitLastAxis.There weren't any tests to uncover this. I added one in mesh-spmdization.mlir for Linalg and one in resharding-spmdization.mlir for Mesh dialects.	2024-06-13 15:09:47 -07:00
Kazu Hirata	bc0cdefffe	[mlir] Fix warnings This patch fixes: mlir/lib/Dialect/Mesh/Transforms/ShardingPropagation.cpp:73:27: error: unused function 'operator<<' [-Werror,-Wunused-function] mlir/lib/Dialect/Mesh/Transforms/ShardingPropagation.cpp:97:27: error: unused function 'operator<<' [-Werror,-Wunused-function]	2024-05-22 16:40:27 -07:00
Boian Petkantchin	d635b860f3	[mlir][mesh] Insert resharding during sharding propagation (#84514 ) If there are conflicts between the sharding annotations of some op, insert resharding. Make the Spmdization pass more forgiving to allow for more than 2 chained `mesh.shard` ops. Implement `getReductionLoopIteratorKinds` in ShardingInterface for linalg ops.	2024-05-22 13:33:25 -07:00
Christian Sigg	a5757c5b65	Switch member calls to `isa/dyn_cast/cast/...` to free function calls. (#89356 ) This change cleans up call sites. Next step is to mark the member functions deprecated. See https://mlir.llvm.org/deprecation and https://discourse.llvm.org/t/preferred-casting-style-going-forward.	2024-04-19 15:58:27 +02:00
Jakub Kuderski	e93489c434	[mlir] Add missing build deps for Mesh transforms (#84581 )	2024-03-08 17:43:56 -05:00
Boian Petkantchin	abfac563f5	[mlir][mesh] Make sharding propagation and spmdization work on FuncOpInterface (#84415 ) Make them more general instead of only supporting `func::FuncOp`.	2024-03-08 08:14:36 -08:00
Boian Petkantchin	fb582b6ace	[mlir] Implement Mesh's ShardingInterface for Linalg ops (#82284 ) Allows linalg structured operations to be handled during spmdization and sharding propagation. There is only support for projected permutation indexing maps.	2024-03-07 17:05:44 -08:00
Boian Petkantchin	4f7ab789bf	[mlir][mesh] add support in spmdization for incomplete sharding annotations (#82442 ) Don't require that `mesh.shard` operations come in pairs. If there is only a single `mesh.shard` operation we assume that the producer result and consumer operand have the same sharding.	2024-02-22 11:06:14 -08:00
Boian Petkantchin	ff2720d190	[mlir][mesh] Dedublicate iterator type and partial type information (#81920 ) The two types duplicated mostly the same values. Here they are decomposed to carry orthogonal and complimentary information. Use `utils::IteratorType` instead of `mesh::IteratorType`. It now has only parallel and reduction values. Rename `Partial` to `ReductionKind`. Add `getReductionLoopIteratorKinds` method to `ShardingInterface`.	2024-02-16 07:10:46 -08:00
Boian Petkantchin	dc3258c617	[mlir][mesh] Add all-slice operation (#81218 ) This op is the inverse of all-gather. It is useful to have an explicit concise representation instead of having a blob of slicing logic. Add lowering for the op that slices from the tensor based on the in-group process index. Make resharding generate an all-slice instead of inserting the slicing logic directly.	2024-02-15 13:03:58 -08:00
Boian Petkantchin	adbf21f12b	[mlir][mesh] Add spmdization pass (#80518 ) Add a pass that converts a function that has sharding annotations into SPMD form.	2024-02-06 20:55:14 -08:00
Boian Petkantchin	31fc0a12e1	[mlir][mesh] Refactoring code organization, tests and docs (#79606 ) * Split out `MeshDialect.h` form `MeshOps.h` that defines the dialect class. Reduces include clutter if you care only about the dialect and not the ops. * Expose functions `getMesh` and `collectiveProcessGroupSize`. There functions are useful for outside users of the dialect. * Remove unused code. * Remove examples and tests of mesh.shard attribute in tensor encoding. Per the decision that Spmdization would be performed on sharding annotations and there will be no tensors with sharding specified in the type. For more info see this RFC comment: https://discourse.llvm.org/t/rfc-sharding-framework-design-for-device-mesh/73533/81	2024-01-31 07:20:14 -08:00
Boian Petkantchin	9a8437f504	[mlir][mesh] Rename cluster to mesh (#79484 ) Rename * Op mesh.cluster -> mesh.mesh * Op mesh.cluster_shape -> mesh.mesh_shape * variables and attributes. The name `mesh` is more specific to what it really represents. It is a mesh of devices. The name `cluster` implies a broader posibility of device configurations. When just the word `mesh` is used the meaning can often be inferred from the context whether it refers to the mesh dialect or a device mesh. The full name can be used when needed.	2024-01-26 07:03:29 -08:00
Boian Petkantchin	5df2c00af3	[mlir][mesh] Remove rank attribute and rename dim_sizes to shape in ClusterOp (#77838 ) Remove the somewhat redundant rank attribute. Before this change ``` mesh.cluster @mesh(rank = 3, dim_sizes = 2x3) ``` After ``` mesh.cluster @mesh(shape = 2x3x?) ``` The rank is instead determined by the provided shape. With this change no longer `getDimSizes()` can be wrongly assumed to have size equal to the cluster rank. Now `getShape().size()` will always equal `getRank()`.	2024-01-15 07:39:09 -08:00
Matthias Springer	0cb024b357	[mlir][Mesh] Fix invalid IR in rewrite pattern (#78094 ) This commit fixes `test/Dialect/Mesh/folding.mlir` when running with `MLIR_ENABLE_EXPENSIVE_PATTERN_API_CHECKS`. ``` /usr/local/google/home/springerm/mlir_public/llvm-project/mlir/test/Dialect/Mesh/folding.mlir:19:10: error: Unexpected number of results 0. Expected 2. %0:2 = mesh.cluster_shape @mesh1 : index, index ^ /usr/local/google/home/springerm/mlir_public/llvm-project/mlir/test/Dialect/Mesh/folding.mlir:19:10: note: see current operation: "mesh.cluster_shape"() <{axes = array<i16>, mesh = @mesh1}> : () -> () mlir-asm-printer: Verifying operation: builtin.module Unexpected number of results 0. Expected 2. mlir-asm-printer: 'builtin.module' failed to verify and will be printed in generic form "builtin.module"() ({ "mesh.cluster"() <{dim_sizes = array<i64: 2, 3>, rank = 2 : i64, sym_name = "mesh1"}> : () -> () "func.func"() <{function_type = () -> (index, index), sym_name = "cluster_shape_op_folding_all_axes_static_mesh"}> ({ %0 = "arith.constant"() <{value = 2 : index}> : () -> index %1 = "arith.constant"() <{value = 3 : index}> : () -> index "mesh.cluster_shape"() <{axes = array<i16>, mesh = @mesh1}> : () -> () %2:2 = "mesh.cluster_shape"() <{axes = array<i16>, mesh = @mesh1}> : () -> (index, index) "func.return"(%0, %1) : (index, index) -> () }) : () -> () }) : () -> () LLVM ERROR: IR failed to verify after pattern application ``` If `axes` is empty, the op verifier assumes that all dimensions are queried. (Expected 2 results.)	2024-01-15 09:00:43 +01:00
Boian Petkantchin	79aa776267	[mlir][mesh] Add lowering of process multi-index op (#77490 ) * Rename mesh.process_index -> mesh.process_multi_index. * Add mesh.process_linear_index op. * Add lowering of mesh.process_multi_index into an expression using mesh.process_linear_index, mesh.cluster_shape and affine.delinearize_index. This is useful to lower mesh ops and prepare them for further lowering where the runtime may have only the linear index of a device/process. For example in MPI we have a rank (linear index) in a communicator.	2024-01-10 07:01:16 -08:00
Boian Petkantchin	ab590377a3	[mlir][mesh] Add folding of ClusterShapeOp (#77033 ) If the mesh has static size on some of the requested axes, the result is substituted with a constant.	2024-01-09 13:42:56 -08:00
Boian Petkantchin	7a4c49756d	[mlir][mesh] Use one type for mesh axis (#76830 ) Make all ops and attributes use the types MeshAxis and MeshAxesAttr instead of int16_t, int32_t, DenseI16ArrayAttr and DenseI32ArrayAttr.	2024-01-03 15:47:11 -08:00
Jie Fu	ab43cf26ca	[mlir][mesh] Fix -Wunused-variable in Spmdization.cpp (NFC) llvm-project/mlir/lib/Dialect/Mesh/Transforms/Spmdization.cpp:573:14: error: unused variable 'targetShardType' [-Werror,-Wunused-variable] ShapedType targetShardType = ^ 1 error generated.	2024-01-03 09:29:14 +08:00
Boian Petkantchin	1a8fb88719	[mlir][mesh] Add resharding spmdization on a 1D device mesh (#76179 ) The current implementation supports only sharding of tensor axes that have size divisible by the mesh axis size.	2024-01-02 15:50:07 -08:00
Boian Petkantchin	4b3446771f	[mlir][mesh] Add endomorphism simplification for all-reduce (#73150 ) Does transformations like all_reduce(x) + all_reduce(y) -> all_reduce(x + y) max(all_reduce(x), all_reduce(y)) -> all_reduce(max(x, y)) when the all_reduce element-wise op is max. Added general rewrite pattern HomomorphismSimplification and EndomorphismSimplification that encapsulate the general algorithm. Made specialization for all-reduce with respect to addf, addi, minsi, maxsi, minimumf and maximumf in the Arithmetic dialect.	2023-12-12 10:21:52 -08:00
Chengji Yao	b0d5b4d252	[MLIR][Mesh] Add sharding propagation pass (#71261 ) Add a pass that propagates sharding information throughout the graph. After this pass, each of the operations' operands and results is annotated with a mesh.shard operation. The pass is driven by a newly added ShardingInterface, and an implementation for element-wise and matmul ops in the TOSA dialect is provided.	2023-11-03 21:07:31 -07:00
Mehdi Amini	466abaf152	Revert "[MLIR][Mesh] Add sharding propagation pass (#69665 )" This reverts commit 9d9400d7de9b928e3018af97e8b381a4a6ba5162. This reverts commit bda763aea0b854178c01eac9f309042d9aaa823b. The buildbot is broken and tests are failing.	2023-11-03 17:52:41 -07:00
Chengji Yao	9d9400d7de	[MLIR][Mesh] Add sharding propagation pass (#69665 ) Add a pass that propagates sharding information throughout the graph. After this pass, each of the operations' operands and results is annotated with a `mesh.shard` operation, and the operations themselves are added with sharding option attributes. The pass is driven by a newly added `ShardingInterface`, and an implementation for element-wise and matmul ops in the TOSA dialect is provided.	2023-11-03 17:12:42 -07:00

31 Commits